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CHAPTER 1 


Introduction to Calculus 


Pe = 1.4 Velocity and Distance 1 


The right way to begin a calculus book is with calculus, This chapter will jump 
directly into the two problems that the subject was invented to solve. You will see 
what the questions are, and you will see an important part of the answer. There are 
plenty of good things left for the other chapters, so why not get started? 

The book begins with an example that is familiar to everybody who drives a car, 
It is calculus in action—-the driver sees it happening. The example is the relation 
between the speedometer and the odometer. One measures the speed (or velocity); 
the other measures the distance traveled. We wil] write v for the velocity, and f for 
how far the car has gone. The two instruments sit together on the dashboard: 


Fig. 1.4 Velocity » and total distance f (at one instant of time). 


Notice that the units of measurement are different for vp and f. The distance f is 
measured in kilometers or miles {it is easier to say miles). The velocity v is measured 
in km/hr or miles per hour. A unit of time enters the velocity but not the distance. 
Every formula to compute v from f will have { divided by time. 


The central question of calculus is the relation between v and f. ‘ 
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Can you find v if you know /, and vice versa, and how? If we know the velocity over 
the whole history of the car, we should be able to compute the total distance traveled. 
In other words, if the speedometer record is complete but the odometer is missing, 
its information could be recovered. One way to do it (without calculus) is to put in 
a new odometer and drive the car all over again at the right speeds. That seems like 
a hard way; calculus may be easier. But the point is that the information is there. 
If we know everything about v, there must be a method to find f. 


What happens in the opposite direction, when f is known? If you have a complete 
record of distance, could you recover the complete velocity? In principle you could drive 
the car, repeat the history, and read off the speed. Again there must be a better way. 


The whole subject of calculus is built on the relation between v and {. The question 
we are raising here is not some kind of joke, alter which the book will get serious 
and the mathematics will get started. On the contrary, / am serious now—and the 
mathematics has already started. We need to know how to find the velocity from a 
record of the distance. (That is called differentiation, and it is the central idea of 
differential calculus.) We also want to compute the distance from a history of the 
velocity. (That is integration, and it is the goal of integral calculus.) 

Differentiation goes from f to v; integration goes from v to f. We look first 
at examples in which these pairs can be computed and understood. 


CONSTANT VELOCITY 
Suppose the velocity is fixed at p= 60 (miles per hour). Then f increases at this 
constant rate. After two hours the distance is f= 120 (miles). After four hours 


{= 240 and after t hours f= 60t. We say that f increases Hnearly with time—its 
graph is a straight line. 


velocity v(?) distance f(t) 


time f 


2 4 
Fig. 4.2 Constant velocity » = 60 and linearly increasing distance f= 601. 


Notice that this example starts the car at full velocity. No time is spent picking up 
speed. (The velocity is a ‘step function.”’) Notice also that the distance starts at zero; 
the car is new. Those decisions make the graphs of v and f as neat as possible. One 
is the horizontal line » = 60. The other is the sloping line f= 601. This v, f, t relation 
needs algebra hut not calculus: 


if v is constant and { starts at zero then f= vt. 


The opposite is also true. When f increases linearly, p is constant. The division by 
time gives the slope. The distance is f; = 120 miles when the time is ¢, = 2 hours. 
Later f, = 240 at t, = 4. At both points, the ratio f/t is 60 miles/hour. Geometrically, 
the velocity is the slope of the distance graph: 


change in distance _ vt 
change in time t 


slope = =v. 
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-30e------ 4 y=-30 f=-301 


Fig. 4.3. Straight lines f= 20 + 60¢ (slope 60) and f= —30¢ (slope — 30). 


The slope of the f-graph gives the v-graph. Figure 1.3 shows two morc possibilitics: 


1. The distance starts at 20 instead of 0. The distance formula changes from 60t 
to 20+ 60t. The number 20 cancels when we compute change in distancc—so 
the slope is stil] 60. 

2. When pv is negative, the graph of f goes downward. The car goes backward and 
the slope of f= — 30t is ve = — 30. 


] don’t think speedometers go below zero. But driving backwards, it’s not that safe 
to watch. If you go fast enough, Toyota says they measure ‘‘absolute values’’—the 
speedometer reads + 30 when the velocity is — 30. For the odometer, as far as I know 
it just stops. It should go backward.t 


VELOCITY vs. DISTANCE: SLOPE vs. AREA 


How do you compute f{ from v? The point of the question is to see f= vt on the 
graphs. We want to start with the graph of v and discover the graph of f. Amazingly, 
the opposite of slope is area. 

The distance f ts the area under the v-graph. When v is constant, the region under 
the graph is a rectangle. Its height is v, its width is ¢, and its area is v times ft. This is 
integration, to go from v to f by computing the area. We are glimpsing two of the 
central facts of calculus. 


4A The slope of the f-graph gives the velocity v. The area under the v-graph 


gives the distance f. 


That is certainly not obvious, and I hesitated a long time before [ wrote it down in 
this first section. The best way to understand it is to look first at more examples. The 
whole point of calculus is to deal with velocities that are mot constant, and from now 
on vu has several values. 


EXAMPLE (Forward and back) There is a motion that you will understand right away. 
The car goes forward with velocity V, and comes back at the same speed. To say it 
more correctly, the velocity in the second part is — V. If the forward part lasts until 
t= 3, and the backward part continues tor = 6, the ear will come back where it started. 
The total distance after both parts will be f= 0. 


+This actually happened in Ferris Buelfer’s Day Off, when the hero borrowed his father’s sports 
car and ran up the mileage. At home he raised the car and drove in reverse. I forget if it 
worked. 
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v(t) = slope of f(2) ay f(D 


velocity V velocity -V 


Ag. 1.4 Velocities + V and —V give motion forward and back, ending at f(6)=0. 


The v-graph shows velocities + V and — V. The distance starts up with slope +V 
and reaches f= 3V. Then the car starts backward. The distance goes down with slope 
— V and returns to f=0 at :=6. 

Notice what that means. The total area “‘under”’ the v-graph is zero! A negative 
velocity makes the distance graph go downward (negative slope). The car is moving 
backward. Area below the axis in the v-graph is counted as negative. 


FUNCTIONS 


This forward-back example gives practice with a crucially important idea—the con- 
cept of a ‘‘function.”” We seize this golden opportunity to explain functions: 


The number v(t) is the value of the function v at the time ¢. 


The time ¢ is the input to the function. The velocity v(t) at that time is the output. 
Most people say “‘v of t’” when they read v(t). The number “vy of 2” is the velocity 
when t = 2. The forward-back example has v(2)= + V and v(4) = — V. The function 
contains the whole history, like a memory bank that has a record of v at each ¢. 

It is simple to convert forward-back motion into a formula. Here is v(t): 


+V if O<t<3 
v(t) = ? if r=3 
—-V if 3<1<6 


The right side contains the instructions for finding v(t). The input ¢ is converted into 
the output + V or —V. The velocity v(t) depends on t. In this case the function is 
“discontinuous,” because the needle jumps at t = 3. The velocity is not defined at that 
instant. There is no v(3). (You might argue that v is zero at the jump, but that leads 
to trouble.) The graph of f has a corner, and we can’t give its slope. 

The problem also involves a second function, namely the distance. The principle 
behind f(t) is the same: f(t) is the distance at time t. It is the net distance forward, 
and again the instructions change at ¢ = 3. In the forward motion, f(t) equals Vt as 
before. In the backward half, a calculation is built into the formula for /(t): 


Vt if O<r<3 
t= 
fw Vi6-t) if 3<7<6 


At the switching time the right side gives two instructions (one on each line). This 
would be bad except that they agree: {(3)=3V.t The distance function is “con- 


+A function is only allowed one value f(t} or v(t) at each time r- 
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tinuous.” There is no jump in f, even when there is a jump in v. Alter ¢ = 3 the distance 
decreases because of — Vt. At t= 6 the second instruction correctly gives f(6}= 0. 

Notice something more. The functions were given by graphs before they were given 
by formulas. The graphs tell you f and v at every time t—sometimes more clearly 
than the formulas. The values f(t) and v(t) can also be given by tables or equations 
or a set of instructions. (In some way all functions are instructions—the function 
tells how to find f at time t.) Part of knowing f is knowing all its inputs and 
outputs—its domain and range: 


The domain of a function is the set of inputs. The range is the set of outputs. 


The domain of f consists of all times 0<t<6. The range consists of all distances 
O<f(t)<3V. (The range of » contains only the two velocities +V and —V.) 
We mention now, and repeat later, that every “linear” function has a formula 
f(t) = vt + C. Its graph is a line and v is the slope. The constant C moves the line up 
and down. It adjusts the line to go through any desired starting point. 


SUMMARY: MORE ABOUT FUNCTIONS 


May I collect together the ideas brought out by this example? We had two functions 
vy and f. One was velocity, the other was distance. Each function had a domain, 
and a range, and most important a graph. For the f-graph we studied the slope 
{which agreed with p). For the v-graph we studied the area (which agreed with /). 
Calculus produces functions in pairs, and the best thing a book can do early is to 
show you more of them. 


in inputt —+ functionf — output f(t) in 
the input 2 — functiony —» output v(2) the 
domain input? — f(n=2t+6 = f(7)=20 range 


Note about the definition of a function. The idea behind the symbol f(t) is absolutely 
crucial to mathematics. Words don’t do it justice! By definition, a function is a “rule” 
that assigns one member of the range to each member of the domain. Or, a function 
is a set of pairs (t, f(t)) with no ¢ appearing twice. (These are “ordered pairs” because 
we write t before f(t).) Both of those definitions are correct— but somehow they are 
too passive. 

In practice what matters is the active part. The number f(t} is produced from the 
number ft. We read a graph, plug into a formula, solve an equation, run a computer 
program. The input ¢ is “mapped” to the output f(t), which changes as ¢ changes. 
Calculus is about the rate of change. This rate is our other function v. 


f(-2) = 21-3 


. 


range fi) =2r+1 


I 


domain 


domain | 


Aig. 1.5 Subtracting 2 from f affects the range. Subtracting 2 from t affects the domain. 


fe) 4 
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It is quite hard at the beginning, and not automatic, to see the difference between 
f{(t}—2 and f(t — 2). Those are both new functions, created out of the original f(r). 
In f(t)— 2, we subtract 2 from all the distances. That moves the whole graph down. 
In f(t — 2), we subtract 2 from the time. That moves the graph over to the right. 
Figure 1.5 shows both movements, starting from f(t) = 2t~+ 1. The formula to find 


f{t — 2) is 2(f — 2)+ 1, which is 2¢ — 3. 


A graphing calculator also moves the graph, when you change the viewing window. 
You can pick any rectangle A <¢< B, C <f(t)< D. The screen shows that part of 
the graph. But on the calculator, rhe function f(t) remains the same. It is the axes that 
get renumbered. In our figures the axes stay the same and the function is changed. 

There are two more basic ways to change a function. (We are always creating new 
functions—that is what mathematics is all about.) Instead of subtracting or adding, 
we can multiply the distance by 2. Figure 1.6 shows 2/(¢). And instead of shifting the 
time, we can speed it up. The function becomes (22). Everything happens twice as 
fast (and takes half as Jong). On the calculator those changes correspond to a 
“zoom —on the f axis or the ¢ axis. We soon come back to zooms. 


6 


Tage 


flt)= 21 | < 
slope 2 


domain J 


2ft)=4e4+2 


slope 4 


ipel 


{2 =4e+ 1 
slope 4 


t 
} 1/2 


Fig. 1.6 Doubling the distance or speeding up the time doubles the slope. 


4.4 EXERCISES 


Each section of the book contains read-through questions. They 
allow you to outline the section yourself—more actively than 
reading a summary. This is probably the best way to remember 
the important ideas. 


Starting from f(0)=0 at constant velocity xv, the distance 
function is f(t)=__o . When f(t}=55r the velocity is 
t= ob. When f(r) = 551 + 1000 the velocity is still __¢ 
and the starting value is f(Q)=__d__. [In each case ¢ ts the 
eof the graph of f When __ft _ is negative, the graph 


of _g goes downward. In that case area in the r-graph 
counts as oh 


Forward motion from /(0)=0to f(2)= 10 hast =__i_. 
Then backward motion to f{4)=0 has r=_ i. The dis- 


tance function is f(t)= 51 for 0 <1 <2 and then f(r}- __k 


(not —5t). The slopes are | and _m_. The distance 
{Q)=__n_. The area under the e-graph up to time 1.5 is 

o. The domain of f is the time interval __p__, and the 
range is the distance interval _q__. The range of p{r} is only 


r 


The value of f{t}= 31+ latt=2is f(2)=__s__. The value 
19 equals f(__t__). The diflerence f(4) —f(1}=__w_. That 
is the change in distance, when 4 -- | is the change in __¥ 
The ratio of those changes equals _ w__, which is the __ x 
of the graph. The formula for f{t)+2 is 3¢+3 whereas 
f(t+2)equals _y _. Those functions have the same __z 
as f: the graph of f(t)+2 is shifted A and f(t+2} is 
shifted _ 8 . The formula for f(5r) is _¢ . The formula 


for 5f(t)is _D . The slope has jumped from 3 to __£ 


4.4 Velocity and Distance 7 


The set of inputs to a function is its _F . The set of 


outputs is its _@ _. The functions f(t)=7 + 3(t—2) and 
with slopes 


f(j}=vt+C are _H_. Their graphs are __1| 
equal to__J. and _K_. They are the same function, if 


v=_1t andC=_M 


Draw the velocity graph that goes with each distance graph. 


tf 
60 
OS 
f t 
2 4 6 1 2 
2 3sd+s f 

10 

ft t 


10 20 30 T 2F 3F 


3 Write down three-part formulas for the velocities v{t) in 
Problem 2, startmg from v(t) = 2 for 0<t < 10. 


4 The distance in 1b starts with f(t) = 10—l0rfor0<t< 1. 


Give a formula for the second part. 
§ In the middle of graph 2a find f(15S) and f({t2) and /({e). 
6 In graph 2b find f(1.4T). If T=3 what is f(4)? 


7 Find the average speed between t= 0 and t = 5 in graph 
la. What is the speed at t = 5? 


8 What is the average speed between t = and ¢=2in cai 
1b? The average speed is zero between t = 4 and ¢ = 


9 (recommended) A car goes at speed v = 20 into a brick 
wall at distance f= 4. Give two-part formulas for u(t) and 
J (t) (before and after), and draw the graphs. 

10 Draw any reasonable graphs of v(t) and f(t) when 
{a} the driver backs up, stops to shift gear, then goes fast; 
(b) the driver stows to 55 for a police car; 
(c) in a rough gear change, the car accelerates in jumps; 
(d) the driver waits for a light that turns green. 


11. Your bank account eams simple interest on the opening 
balance f(0). What are the interest rates per year? 


88 
fi 80 


! 2 : l 2 
12 The earth’s population is growing at v= 100 million a 


year, Starting from f= 4.2 billion in 1990. Graph f(t) and find 
(2000). 


Draw the distance graph that goes with each velocity graph 
Start from {= 0 at t =0 and mark the distance. 


13 v 
30 
f 
2 4 6 
~30 13b 
14 v v 
40 40 
20 
t 
i» 2 T 2F 3F 
-40 I4a 14b 


15 Write down formulas for v(t) in Problem 14, starting with 
v= —40 for 0<t< 1. Find the average velocities to t = 2.5 
and ¢ = 37. 


16 Give 3-part formulas for the areas f(t) under v(t} in 13. 


17 The distance in 14a starts with f(t}=—40: forO <r <t 
Find f(t) in the other part, which passes through f= Oat ¢ = 2. 


18 Draw the velocity and distance graphs if v(t)=8 for 
O<1<2, f(t)=20+¢ for2<1<3, 


19 Draw rough graphs of yo x and y=,/x—4 and 


y= Jx — 4. They are “‘half-parabolas” with infinite slope at 
the start. 


20 What is the break-even point if x yearbooks cost 
$1200 + 30x to produce and the income is 40x? The slope of 
the cost line is (cost per additional book), If it goes 
above you can’t break even. 


21 What are the domains and ranges of the distance functions 
in [4a and 14b—all values of t and f(t) if f(0}=0? 


22 What is the range of v{t) in 14b? Why is ¢=1 not in the 
domain of v(t) in 14a? 

Problems 23-28 inyolye dinear functions f(t) = vt + C. Find the 
constants v and C. 

23 What linear function has f(0) = 3 and f(2)=—11? 


24 Find two linear functions whose domain ts 0<¢ <2 and 
whose range is 1 <f(t) <9 


25 Find the linear function with f{1) =4 and slope 6. 
26 What functions have f(t + 1) =f{t) + 2? 


27 Find the linear function with f({t+2)=f(t})+6 and 
f{1) = 10. 


28 Find the only f= vt that has {(2t) = 4f(z)}. Show that every 
f=4art? has this property. To go times as far in 
twice the time, you must accelerate. 


8 4 


29 Sketch the graph of f(t)=|5—2t| (absolute value) for 
|t]} <2 and find its slopes and range. 


30 Sketch the graph of f(t) =4—~¢—|4—¢| for 2<1<5 and 
find its slope and range, 


31 Suppose v = 8 up to time T, and after that v = —2. Starting 
from zero, When does f return to zero? Give formulas for v(t) 
and f(t). 

32 Suppose »=3 up to time T=4. What new velocity will 
lead to f(7) = 30 if f{0} = 0? Give formulas for v(t} and f(é). 


33 What function F(C) converts Celsius temperature C to 
Fahrenheit temperature F? The slope is which is 
the number of Fahrenheit degrees equivalent to 1°C. 


34 What function C(F) converts Fahrenheit to Celsius (or 
Centigrade), and what is its slope? 


35 What function converts the weight w in grams to the 
weight f(w} in kilograms? Interpret the slope of f(w). 


36 (Newspaper of March 1989) Ten hours after the accident 
the alcohoi reading was .061. Blood alcohol is eliminated at 
.015 per hour. What was the reading at the time of the acci- 
dent? How much later would it drop to .04 (the maximum set 
by the Coast Guard)? The usual limit on drivers is .10 percent. 


Which points between ¢ = 0 and ¢ = 5 can be in the domain of 
f(t)? With this domain find the range in 37-42. 


37 f(t)=f/t-1 38 f(s lf /t-1 

39 f(t) =|t— 4] (absolute value) 40 /(2) = 1/(t 4)? 

AN f(t)=2! 42 f(t)=27! 

43 (a) Draw the graph of f(t) = 4 + 3 with domain0 <¢<2. 
Then give a formula and graph for 
(b) fle)+1 (ce) f(e+ I) 
(d) 4f(¢) (e) (40). 

44 (a) Draw the graph of U(t) =step function = {0 for t <0, 
1 for ¢ > 0}. Then draw 
{b) U(t)4+2 
(d) 3U@) 


(c} U{e +2) 
(e) U(32). 


4.2 Calculus Without Limits 
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45 (a) Draw the graph of f(t}=1+1 for —l<et<l. Find 
the domain, range, slope, and formula for 
(b) 2f(2} = (c)_ f(t-3) (d) —f(t} =e) f(-2). 
46 Tf f(t) =4—1 what are 2f(32) and f(1 —¢) and f(t—1)? 
47 In the forward-back example find {(47} and {(? 7). Verify 


that those agree with the areas “under” the »-graph in 
Figure 1.4, 


48 Find formulas for the outputs /,(t) and f,(t) which come 
from the input f: 
(1) inside = input «3 (2) inside + input +6 
output = inside + 3 output < inside*3 
Note BASIC and FORTRAN (and calculus itself) use = 
instead of <. But the symboi < or «= is in some ways better. 


The instruction ¢ + t+ 6 produces a new ¢ equal io the old ¢ 
plus six. The equation t =¢ + 6 is not intended. 


49 Your computer can add and multiply. Starting with the 
number ! and the input called ¢, give a list of instructions to 
iead to these outputs: 


AM=P+t AO=AKhO) LO=Ae+ D. 


50 In fifty words or less explain what a function ts. 


The last questions are challenging but possible. 
51 If f(r) =3t—1 for O<t <2 give formulas (with domain) 
and find the slopes of these six functions: 

(a) f(t +2) (b) f(t)+2 (c) 2f(t) 

(d) f(2t) {e) f(i-) (f} £(f()). 
$2 For f(t}=vt+C find the formulas and slopes of 

(a) 3f(e)+ 1 (b} f(3t +1) (c) 2f(4t) 

(d) f(-2) (e} f(t) -/O) (f) UP). 
53 (hardest) The forward-back function is f(t)=2¢ for 
O<t <3, f(t)=12—2t for 3<¢¢<6. Graph f(f(t)) and find 
its four-part formula, First try t = 1.5 and 3. 
54 (a} Why is the letter X not the graph of a functton? 

(b) Which capital letters are the graphs of functions? 

(c) Draw graphs of their slopes. 


The next page is going to reveal one of the key ideas behind calculus. The discussion 
is just about numbers—functions and slopes can wait. The numbers are not even 
special, they can be any numbers. The crucial point is to look at their differences: 


Suppose the numbers are f= 0 2 6 7 4 9 
Their diflerences are » = 2 4 | -—3 5 


The differences are printed in between, to show 2—0=2and6—2=4and7—6=1. 


4.2 Calculus Without Limits 


Notice how 4 — 7 gives a negative answer —3. The numbers in f can go up or down, 
the differences in v can be positive or negative. The idea behind calculus comes when 


you add up those differences: 
2+4+1-3+5=9 


The sum of differences is 9. This is the last number on the top line (in /). Is this an 
accident, or is this always true? If we stop earlier, after 2+ 4+ 1, we get the 7 in f. 
Test any prediction on a second example: 


Suppose the numbers are f= 1 3 7 8 5 10 
Their differences are v = 2 @ 1 =-3 5 


The f’s are increased by |. The differences are exactly the same—no change. The 
sum of differences is still 9. But the last f is now 10. That prediction is not right, we 
don’t always get the last ff. 

The first f is now |. The answer 9 (the sum of differences) is 10—1, the last f 
minus the first f. What happens when we change the /’s in the middle? 


Suppose the numbers are f= 1 5 12 7 = 10 
Their differences are v = 4 7 -5 3 


The differences add to 4+ 7—5+3=9. This is still 10 — 1. No matter what f’s we 
choose or how many, the sum of differences is controlled by the first f and last /. 
If this is always true, there must be a clear reason why the middle f’s cancel out. 


The sum of differences is (5 — 1) + (12 —5)+(7 — 12)+(10—7)= 10-1. 


The 5’s cancel, the 12’s cancel, and the 7’s cancel. It is only 10 — 1 that doesn’t cancel. 
This is the key to calculus! 


4B The differences of the f’s add up to (fiast —Srirst)- 


EXAMPLE 4 The numbers grow linearly: f= 2 3 4 5 6 7 
Their differences are constant: v = 1 l ] 1 l 


The sum of differences is certainly 5. This agrees with 7 — 2 = fias: —Srirse- The numbers 
in v remind us of constant velocity. The numbers in f remind us of a straight line 
f=vt+C. This example has v=1 and the f’s start at 2. The straight line would 
come from f=1t+ 2. 


EXAMPLE 2 The numbers are squares: f= 0 1 4 9 16 
Their differences grow linearly;v= 1 3 5S 7 


1+3+5+7 agrees with 47 = 16. It is a beautiful fact that the first j odd numbers 
always add up to j”. The v’s are the odd numbers, the f’s are perfect squares. 


Note The letter j is sometimes useful to tell which number in f we are looking at. 
For this example the zeroth number is fy = 0 and the jth number is f; =j?. This is a 
part of algebra, to give a formula for the /’s instead of a list of numbers. We can also 
use j to tell which difference we are looking at. The first v is the first odd number 
v; =1. The jth difference is the jth odd number v; = 2j = 1. (Thus vg is 8— 1 =7.) It 
is better to start the differences with j = 1, since there is no zeroth odd number vp. 

With this notation the jth difference is v;=f;—f;-,. Sooner or later you will get 
comfortable with subscripts like j and j — 1, but it can be later. The important point 
is that the sum of the v’s equals f\,., —ffirse- We now connect the v’s to slopes and the 
f’s to areas. 
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I 2 3 4 


Fig. 1.7 Linear increase in v = 1, 3, 5, 7. Squares in the distances f= 0, 1, 4, 9, 16. 


Figure 1.7 shows a natural way to graph Example 2, with the odd numbers in v and 

the squares in f. Notice an important difference between the v-graph and the f-graph. 

The graph of f is “piecewise linear.” We plotted the numbers in f and connected 

them by straight lines. The graph of v is “piecewise constant.” We plotted the differ- 

ences as constant over each piece. This reminds us of the distance-velocity graphs, 

when the distance f(t) is a straight line and the velocity v(t) is a horizontal line. 
Now make the connection to slopes: 


distance up _ change in f _ 


Ehpsiope Of essen distance across changeint — 


Over each piece, the change in t (across) is 1. The change in f (upward) is the difference 

that we are calling v. The ratio is the slope v/1 or just v. The slope makes a sudden 

change at the breakpoints t= 1, 2,3,.... At those special points the slope of the 

f-graph is not defined—we connected the v’s by vertical lines but this is very 

debatable. The main idea is that between the breakpoints, the slope of f(t) is v(t). 
Now make the connection to areas: 


The total area under the v-graph is fas, —Srirst- 


This area, underneath the staircase in Figure 1.7, is composed of rectangles. The base 
of every rectangle is 1. The heights of the rectangles are the v’s. So the areas also 
equal the v’s, and the total area is the sum of the v’s. This area is fiast —Srirst« 

Even more is true. We could start at any time and end at any later time 
—not necessarily at the special times t=0, 1, 2,3,4. Suppose we stop at t = 3.5. 
Only half of the last rectangular area (under v = 7) will be counted. The total area is 
1+3+4+5+4(7)=12.5. This still agrees with fas: — fries: = 12.5 — 0. At this new ending 
time t = 3.5, we are only halfway up the last step in the f-graph. Halfway between 
9 and 16 is 12.5. 


1C The v’s are slopes of f(t). The area under the v-graph is f (tena) —f (tetart)- 


This is nothing less than the Fundamental Theorem of Calculus. But we have only 
used algebra (no curved graphs and no calculations involving limits). For now the 
Theorem is restricted to piecewise linear f(t) and piecewise constant v(t). In Chapter 5 
that restriction will be overcome. 


Notice that a proof of 1 +3+5+7=4? is suggested by Figure 1.7a. The triangle 
under the dotted line has the same area as the four rectangles under the staircase. 
The area of the triangle is }- base - height = }-4+8, which is the perfect square 47. 
When there are j rectangles instead of 4, we get 4-j+2j =j? for the area. 


1.2 Calculus Without Limits 


The next examples show other patterns, where f and v increase exponentially or 
oscillate around zero. I hope you like them but I don’t think you have to learn them. 
They are like the special functions 2‘ and sin t and cos t—except they go in steps. 
You get a first look at the important functions of calculus, but you only need algebra. 
Calculus is needed for a steadily changing velocity, when the graph of f is curved, 


The last example will be income tax—which really does go, in steps. Then Sec- 
tion 1.3 will introduce the slope of a curve. The crucial step for curves is working 
with fimits. That will take us from algebra to calculus. 


EXPONENTIAL VELOCITY AND DISTANCE 


Start with the numbers f= 1, 2, 4, 8, 16. These are “powers of 2.” They start with the 
zeroth power, which is 2° = 1. The exponential starts at | and not 0. After j steps there 
are j factors of 2, and f; equals 2’. Please recognize the difference between 2j and }? 
and 2), The numbers 2j grow linearly, the numbers j? grow quadratically, the numbers 
2? grow exponentially. At j= 10 these are 20 and 100 and 1024. The exponential 2/ 
quickly becomes much larger than the others. 

The differences of f= 1, 2, 4, 8, 16 are exactly » = 1, 2, 4, 8. We get the same beauti- 
ful numbers. When the f’s are powers of 2, so are the v's. The formula »;=2/~' is 
slightly diferent from f;= 2/, because the first v is numbered v,. (Then v, =2°=1. 
The zeroth power of every number is 1, except that 0° is meaningless.) The two graphs 
in Figure 1.8 use the same numbers but they look different, because f is piecewise 
linear and v is piecewise constant. 


1 2 3 4 i 2 3 4 
Aig. 1.8 The velocity and distance grow exponentially (powers of 2). 


Where will calculus come in? It works with the smooth curve f(t) = 2'. This expo- 
nential growth is critically important for population and money in a bank and the 
national debt. You can spot it by the following test: v{t) is proportional to f(£). 


Remark The function 2' is trickier than t?, For f=¢? the slope is v=2z. It is 
proportional to t and not ¢?. For f=2' the slope is v= c2', and we won't find the 
constant c= .693... until Chapter 6. (The number c is the natural logarithm of 2.} 
Problem 37 estimates ¢ with a calculator—the important thing is that it’s constant. 


OSCILLATING VELOCITY AND DISTANCE 


We have seen a forward-back motion, velocity V followed by — V. That is oscillation 
of the simplest kind. The graph of f goes linearly up and linearly down. Figure 1.9 
shows another oscillation that returns to zero, but the path is more interesting. 

The numbers in f are now 0, 1, 1,0, —1, —1,0. Since f, =0 the motion brings us 
back to the start. The whole oscillation can be repeated. 
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The differences in » are 1,0, —1, —1,0, 1. They add up to zero, which agrees with 
Sia —Stirn- [t is the same oscillation as in f (and also repeatable), but shifted in time. 

The f-graph resembles (roughly) a sine curve, The v-graph resembles (even more 
roughly) a cosine curve. The waveforms in nature are smooth curves, while these are 
“digitized”—the way a digital watch goes forward in jumps. You recognize that the 
change from analog to digital brought the computer revolution. The same revolution 
is coming in CD players. Digital signals (off or on, 0 or I} seem to win every time. 

The piecewise v and / start again at t= 6. The ordinary sine and cosine repeat at 
t = 2n. A repeating motion is periodic—here the “period” is 6 or 22. (With t in degrees 
the period is 360-—a full circle. The period becomes 2x when angles are measured in 
radians. We virtually always use radians—which are degrees times 27/360.) A watch 
has a period of 12 hours. If the dial shows am and pM, the period is 


Fig. 1.9 Piecewise constant “cosine” and piecewise linear “sine.” They both repeat. 


A SHORT BURST OF SPEED 


The next example is a car that is driven fast for a short time. The speed is V until 
the distance reaches f= 1, when the car suddenly stops. The graph of f goes up 
linearly with slope V, and then across with slope zero: 


Vo upto t=T Vt upto t=T 
v(t) = f= 
0 after t=T 1 after t=T 


This is another example of “function notation.” Notice the general time ¢ and the 
particular stopping time T. The distance is f(t). The domain of f (the inputs) includes 
all times ¢ > 0. The range of f (the outputs) includes ail distances O<f< 1. 

Figure 1.10 allows us to compare three cars—a Jeep and a Corvette and a Maserati. 
They have different speeds but they all reach f= 1. So the areas under the v-graphs 
are all 1. The rectangles have height V and base T= 1/V. 


EQUAL AREAS EQUAL DISTANCES 
Maserati 


" 
delta) function 
u 
u step 


function 


a 
| Corvette 


Tf. Fr 


Fig. 1.40 Bursts of speed with My Tis = WT = VT; = 1. Step function has infinite slope. 


Optional remark It is natural to think about faster and faster speeds, which means 
steeper slopes. The f-graph reaches ] in shorter times. The extreme case is a step 
function, when the graph of f goes straight up. This is the unit step U(t), which is 
zero up to ¢=0 and jumps immediately to U = 1 for ¢>0. 


4.2 Calculus Without Limits 


What is the slope of the step function? It is zero except at the jump. At that moment, 
which is t = 0, the slope is infinite. We don’t have an ordinary velocity v(t)—instead 
we have an impulse that makes the car jump. The graph is a spike over the single 
point t= 0, and it is often denoted by 6—so the slope of the step function is called 
a “delta function.” The area under the infinite spike is 1. 

You are absolutely not responsible for the theory of delta functions! Calculus is 
about curves, not jumps. 

Our last example is a real-world application of slopes and rates—to explain “how 
taxes work.” Note especially the difference between tax rates and tax brackets and 
total tax. The rates are v, the brackets are on x, the total tax is 


EXAMPLE 3. Income tax is piecewise linear. The slopes are the tax rates .15, .28, 31. 


Suppose you are single with taxable income of x dollars (Form 1040, line 37—after 
all deductions). These are the 1991 instructions from the Internal Revenue Service: 


If x is not over $20,350, the tax is 15% of x. 
If $20,350 < x < $49,300, the tax is $3052.50 + 28% of the amount over $20,350. 
If x is over $49,300, the tax is $11,158.50 + 31% of the amount over $49,300. 


The first bracket is 0 < x < $20,350. (The IRS never uses this symbol <, but I think 
it is OK here. We know what it means.) The second bracket is $20,350 < x < $49,300. 
The top bracket x > $49,300 pays tax at the top rate of 31%. But only the income in 
that bracket is taxed at that rate. 

Figure 1.11 shows the rates and the brackets and the tax due. Those are not average 
rates, they are marginal rates. Total tax divided by total income would be the average 
rate. The marginal rate of .28 or .31 gives the tax on each additional dollar of income— 
it is the slope at the point x. Tax is like area or distance—it adds up. Tax rate is like 
slope or velocity—it depends where you are. This is often unclear in the news media. 


tax to pay f(x) 


up 180 31% 


across 3 


11,158 


tax rate = 
slope .28 


(2) =40 
f 3,052 


t 


| 
\ 
\ 
i 
1 
| 
2 5 


20,350 49,300 
Fig. 4.44 The tax rate is v, the total tax is f. Tax brackets end at breakpoints. 


Question What is the equation for the straight line in the top bracket? 

Answer The bracket begins at x = $49,300 when the tax is f(x) = $11,158.50. The 
slope of the line is the tax rate .31. When we know a point on the line and the slope, 
we know the equation. This is important enough to be highlighted. 


Section 2.3 presents this “point-slope equation” for any straight line. Here you see it 
for one specific example. Where does the number $11,158.50 come from? It is the tax 
at the end of the middle bracket, so it is the tax at the start of the top bracket. 


taxable income 
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Figure 1.11 also shows a distance-velocity example. The distance at t=2 is 
J(2)=40 miles. After that time the velocity is 60 miles per hour. So the line with 
slope 60 on the f-graph has the equation 


f(t) =starting distance + extra distance = 40 + 60(t — 2). 
The starting point is (2, 40). The new speed 60 multiplies the extra time t — 2. The 
point-slope equation makes sense. We now review this section, with comments. 


Central idea Start with any numbers in f, Their differences go in v. Then the sum 
of those differences is figs —Srisst- 


Subscript notation The numbers are fo, f,,... and the first difference is vp, =f, — fo. 


A typical number is f; and the jth difference is v;=f;—f;_,. When those differences 
are added, all f’s in the middle (like f,) cancel out: 


yntoto ta=(h-fot+h ft +G-f-D=f-f- 
Examples f,=j or j” or 2’. Then v;= | (constant) or 2j — 1 (odd numbers) or 2/77, 


Functions Connect the f’s to be piecewise linear. Then the slope v is piecewise 
constant. The area under the »-graph from any t,,,,, to any t.,,g equals f(t..4) —S (tsar): 


Units Distance in miles and velocity in miles per hour. Tax in dollars and tax rate 
in (dollars paid) / (dollars earned), Tax rate is a percentage like .28, with no units. 


4.2 EXERCISES 


Read-through questions 


Start with the numbers f= 1,6, 2,5. Their differences are 
v= _°__. Thesum of those differencesis__> _. This is equal 
to fiz, Minus ¢ . The numbers 6 and 2 have no effect on 
this answer, because in (6 — 1) + (2 — 6) + (5 — 2} the numbers 
6 and 2_9__. The slope of the line between {(0)= 1 and 


{(lI)=6is © . The equation of that line is f(t)=__t 


With distances 1, 5, 25 at unit times, the velocities are 
@_. These are the ___ of the f-graph. The slope of the 


tax graph is the tax __+ __. If f(t) is the postage cost for t 
ounces or ¢ grams, the slope is the _J per _k_. For 
distances 0,1,4,9 the velocities are i. The sum of the 


first j odd numbers is f;=__™_. Then fig is__"___ and the 
velocity vjo is __° 


The piecewise linear sine has slopes ___P__. Those form a 
piecewise __ _ cosine. Both functions have ___' equal to 
6, which means that f(t+6)=__$ for every t. The veloci- 
ties v= 1,2,4,8,... have v;=__! _. In that case fo =1 and 
f,;=_4__. The sum of 1, 2,4, 8,16 is_v__. The difference 


2/—2/-! equals _w__. After a burst of speed V to time T, 
the distance is_ x __. If f(T)= 1 and V increases, the burst 
lasts only to T=__¥__. When V approaches infinity, f(t) 
approaches a _2 _ function. The velocities approach a 

A__ function, which is concentrated at t= 0 but has area 


8 under its graph. The slope of a step function is_¢ 


Problems 1-4 are about numbers f and differences v. 


1 From the numbers f=0, 2, 7, 10 find the differences » and 
the sum of the three v’s. Write down another f that leads 
to the same v’s. For f=0,3,12,10 the sum of the v’s is 
still 


2 Starting from f= !, 3, 2, 4 draw the f-graph (lmear pieces) 
and the p-graph, What are the areas “under” the »-graph that 
add to 4— 1}? If the next number in f is 11, what is the area 
under the next v? 


3 From v=1,2,1,0,—1 find the f’s starting at fy = 3. 
Graph v and f, The maximum value of f occurs when 
v= . Where is the maximum f when v= I, 2, 1, -1? 


4 For f=1,5,c,7 find the differences v,,».,v3 and add 
them up. Do the same for f=a,b,c,7. Do the same for 
f=a, b,c, d. 


Problems 5--11 are about linear fuections and constant slopes. 
5 Write down the slopes of these linear functions: 
{a} f(t)= ble (b) f()=1-2t © f()=44 5-6. 
Compute /(6) and (7) for each function and confirm that 
fS()—f(6) equals the slope. 


6 If f(t)=5+4+3(t—1) and g(t)=1.5+2.5(¢—1) what is 
h(t) = f(t) — g(t)? Find the slopes of f, g, and h. 


4.2 Calculus Without Limits 


7 Suppose v(t) = 2 fort <5 and v(t}=3 for t> 5. 
(a) Ff f{0)=0 find a two-part formula for f(:). 
(b) Check that f(10) equals the area under the graph of 
v(t) (two rectangles} up to t= 10. 

8 Suppose off) = 10 for t< 1/10, v(t} = 0 for ¢ > 1/10, Start- 
ing from f(0}= 1 find f(t} in two pieces, 

9 Suppose g(t) = 2t+ 1 and f(t} = 41. Find g(3) and f(g(3)) 
and f(g(t}). How is the slope of {(ge(t}} related to the slopes 
of f and g? 

10 For the same functions, what are f(3} and g(f(3)} and 


gC f(t)? When ¢ is changed to 41, distance increases 
times as fast and the velocity is multiplied by 


11 Compute f(6} and f(8) for the functions in Problem S. 
Confirm that the slopes v agree with 


£(8)—F(6) 
~S=6° 


_ change in f 
ee ~ change int’ 


Problems 12-18 are based on Example 3 about income taxes. 


12 What are the income taxes on x=$10,000 and 
x = $30,000 and x = $50,000? 


13 What is the equation for income tax f(x} in the second 
bracket $20,350 < x < $49,300? How is the number 11,158.50 
connected with the other numbers in the tax instructions? 


14 Write the tax function F(x) for a married couple if the IRS 
treats them as two single taxpayers each with taxable income 
x/2. (This is not done.) 


15 In the 15% bracket, with 5% state tax as a deduction, the 
combined rate is not 20% but . Think about the tax 
on an extra $100. 


16 A piecewise linear function is continuous when f(t) at the 
end of each interval equals f(t} at the start of the following 
interval. If f(f}=5t up to t=1 and v(t)=2 for t> 1, define 
f beyond ¢=1 so it is (a) continuous (b} discontinuous. 
(c) Define a tax function f(x) with rates .15 and .28 so you 
would lose by earning an extra dollar beyond the breakpoint. 


17 The difference between a tax credit and a deduction from 
income is the difference between f(x)— and f(x — d). Which 
is more desirable, a credit of c=$1000 or a deduction of 
d = $1000, and why? Sketch the tax graphs when f(x) =.15x. 


18 The average tax rate on the taxable income x is a{x)= 
{(x)}/x. This is the slope between (0, 0) and the point (x, f(x). 
Draw a rough graph of a(x), The average rate a is below the 
marginal rate v because 


Problems 19-30 involve numbers fo, f,. o,... and their differ- 
ences v;=/;—f;_1. They give practice with subscripts 0, ..., j. 


19 Find the velocities v,, v2, v3 and formulas for v; and fj: 


(a) f=1,3,5,7-... (b) f=, 10,1... © f=04,2,8,... 


20 Find f,, fo, fs and a formula for f, with fo =0: 
(a) v=1,2,4,8, ... (b) v=—-1,1,-1,1,... 


21 The areas of these nested squares are 17, 27, 37,.... What 
are the areas of the L-shaped bands (the differences between 
squares)? How does the figure show that 1+34+547=47? 


12 3 j; 1 2 3 j 


22 From the area under the staircase (by rectangles and then 
by triangles) show that the first j whole numbers | to j add 
up to 4)? +4). Find 142+ -- + 100, 


23 If v=1,3,5,... then’ f= /?. If e=1,1,1,... 
. Add those to find the sum of 2, 4, 6, ..., 
by 2 to find the sum of 1, 2,3 


then f;= 
2j. Divide 
, soa j- (Compare Problem 22.) 
24 True (with reason) or fatse (with example). 

(a) When the f’s are increasing so are the v's. 

(b) When the »’s are increasing so are the f’s. 

(c} When the /’s are periodic so are the v’s. 

{d) When the v’s are periodic so are the f’s. 


25 If f(t)=¢?, compute f(99) and f(101). Between those 
times, what is the increase in f divided by the increase in t? 


26 If f(t)=17 +2, compute f(99) and (101). Between those 
times, what is the increase in f divided hy the increase in t? 


27 Wf f,=j? + 7+ | find a formula for »;. 


28 Suppose the v’s increase by 4 at every step. Show by 
example and then by algebra that the “second difference” 
Sin ~ 2G 447-1 equals 4. 

29 Suppose fo =0 and the v’s are 1, 4, 4, 4, $, 4, 4, .... For 
which j does fj = 

3 Show that a,=fj..-—24+5-1 wii equals v,,; — 0,. If 
vis velocity then a stands for 


Problems 31-34 involve periodic f’s and v’s (like sint and 


cos tf). 

31 For the discrete sine f=0,1, 1,0, —1, —1,0 find the 
second differences a, =f, — 2f, + fo and a, =f; —2/, +f, and 
a,. Compare a, with f;. 

32 If the sequence v,, v2, ... has period 6 and w,, w3, ... has 
period 10, what is the period of », + w,, v2 + Wa, ...? 


33 Draw the graph of f(t) starting from fo =0 when v= 1, 
—1, -I, 1. If v has period 4 find f(12), f(13), (100.1). 
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34 Graph f(t) from fg=0 to fy=4 when v=1,2,1,0. If v 
has period 4, find /(12) and (14) and (16). Why doesn’t f 
have period 4? 

Problems 35~—42 are about exponential v’s amd {’s. 

35 Find the v’s for f= 1, 3,9, 27. Predict v, and v;. Algebra 
gives 3/— 37-1 =(3 1341. 

3% Find 1+2+4+4+ - +32 and also 1+44+44+-- +H. 


37 Estimate the slope of f(t}=2* at t=0. Use a calculator 
to compute (increase in f}/(increase in t) when ¢ ts small: 
fQ)-fO) 2-1 27-1 2%] 


2-001} 
, eg ea 


38 Suppose fp =1 and v,=2f,_,. Find fg. 


39 (a) From f= 1, 4, 4, 4 find v,, v2, v3 and predict v;. 
(b) Check fy —fo=0) +02 +03 and f,—fj_. =0;. 


40 Suppose v,=r/, Show that f;=(r/*! — 1)/(r— 1) starts 
from fo=1 and has f;—f,., = »;. (Then this is the correct 
fj=l+r4 + +r/=sum of a geometric series.) 


41 From f,=(—I)/ compute v;. What is v, + 02+ +: +0;? 


42 Estimate the slope of f(t)=e' at +0, Use a calculator 
that knows ¢ (or else take e = 2,78) to compute 


Fij)—-f@)_e-1 ei] et] 
fo ae ae oe 


Problems 43-47 are about U(z) =step from 0 to ] at ¢=0. 


43 Graph the four functions U(¢— 1} and U(t)—2 and U(3s) 
and 4U(t). Then graph f(r) = 4U(3t — 1)—2. 


4.3. The Velocity at an Instant 


44 Graph the square wave U{t) — U(t — 1). If this is the veloc- 

ity v(t), graph the distance f(t). If this is the distance f(t), 

graph the velocity. 

45 Two bursts of speed lead to the same distance f= 10: 
v= to t= .001 

As V- o the limit of the /{t)’s is 


v=Vtot= 


46 Draw the staircase function U(t)+ U(t — I) + U(t — 2). Its 
slope is a sum of three functions, 


47 Which capital letters like L are the graphs of functions 
when steps are allowed? The slope of L is minus a delta func- 
tion. Graph the slopes of the others. 


48 Write a subroutine FINDV whose input is a sequence 
So: fis -sfy and whose output is 24, 02,...,0,y. Include 
graphical output if possible. Test on f;= 2j and j? and 2/, 


49 Write a subroutine FINDF whose input is 2,, ..., 9, and 
fo, and whose output is fo, f;, -.., fx. The default value of fy 
is zero. Include graphical output if possible. Test v,; = j. 


50 If FINDV is applied to the output of FINDF, what 
sequence is returned? If FINDF is applied to the output of 
FINDY, what sequence is returned? Watch fy. 


51] Arrange 2j and j? and 2/ and \/j in increasing order 
(a) when j is large: ;=9 (b) when j is small: j =$. 


§2. The average age of your family since 1970 ts a piecewise 
linear function A(z). Is it continuous or does it jump? What 
is its slope? Graph it the best you can. 


We have arrived at the central problems that calculus was invented to solve. There 
are two questions, in opposite directions, and I hope you could see them coming. 


1. If the velocity is changing, how can you compute the distance traveled? 
2. If the graph of f(t) is not a straight line, what is its slope? 


Find the distance from the velocity, find the velocity from the distance. Our goal is 
to do both—but not in one section. Calculus may be a good course, but it is not 
magic. The first step is to let the velocity change in the steadiest possible way. 


Question 4 Suppose the velocity at each time ¢ is v(t) = 2¢. Find f(t). 


With v= 2t, a physicist would say that the acceleration is constant (it equals 2). The 
driver steps on the gas, the car accelerates, and the speedometer goes steadily up. 
The distance goes up too—faster and faster. If we measure t in seconds and v in feet 
per second, the distance f comes out in feet. After 10 seconds the speed is 20 feet 
per second. After 44 seconds the speed is 88 feet/second (which is 60 miles/hour). 
The acceleration is clear, but Aow far has the car gone? 


1.3 The Veloctty at an Instant 


Question 2. The distance traveled by time t is {(t)=t?. Find the velocity v(t). 


The graph of f(t) = t? is on the right of Figure 1.12. It is a parabola. The curve starts 
at zero, when the car is new. At t = 5 the distance is f= 25. By t= 10, f reaches 100. 

Velocity is distance divided by time, but what happens when the speed is changing? 
Dividing f= 100 by t= 10 gives v= !0—the average velocity over the first ten 
seconds, Dividing f= 121 by t= 11 gives the average speed over 11 seconds. But how 
do we find the iastantaneous velocity—the reading on the speedometer at the exact 
instant when t= 10? 


change in slope 21 + fA 
, distance approaches 
i(t+hy? -2 v= 2s 


lime f¢ ft tth t 


Fig. 4.42 The velocity » = 2¢ is linear. The distance f= t? is quadratic. 


I hope you see the problem. As the car goes faster, the graph of t? gets steeper— 
because more distance is covered in each second, The average velocity between t = 10 
and ¢= 11 is a good approximation— but only an approximation—to the speed at 
the moment ¢= 10. Averages are easy to find: 


distance at t= 10 is f(10) = 10? = 100 distance at r= 11 is f(11)= 11? = 121 


FCN) — FO) _ 121— 100 _ 


11-10 I on 


average velocity is 


The car covered 21 feet in that 1 second. Its average speed was 21 feet/second. Since 
it was gaining speed, the velocity at the beginning of that second was below 21. 


Geometnially, what is the average? It is a slope, but not the slope of the curve. 
The average velocity is the slope of a straight line. The line goes between two points 
on the curve in Figure 1.12. When we compute an average, we pretend the velocity 
is constant—so we go back to the easiest case. It only requires a division of distance 
by time: 
change in f 


average velocity = 5 
6 y change in t 


(1) 


Calculus and the Law You enter a highway at 1:00. If you exit 150 miles away at 
3:00, your average speed is 75 miles per hour. I'm not sure if the police can give you 
a ticket. You could say to the judge, “When was I doing 75?” The police would have 
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to admit that they have no idea—but they would have a definite feeling that you 
must have been doing 75 sometime.t 


We return to the central problem—computing v(10) at the instant t= 10. The 
average velocity over the next second is 21. We can also find the average over the 
half-second between t = 10.0 and t = 10.5. Divide the change in distance by the change 
in time: 


10.5 — 10.0 5 o 


That average of 20.5 is closer to the speed at t= 10. It is still not exact. 

The way to find v(10) is to keep reducing the time interval. This is the basis for 
Chapter 2, and the key to differential calculus. Find the slope between points that are 
closer and closer on the curve. The “limit” is the slope at a single point. 


Algebra gives the average velocity between t= 10 and any later time t= 10+ h. 
The distance increases from 10? to (10 + h)?. The change in time is h. So divide: 


+h) - 2 + + Ae — 
irs * ee ae a Oe k 2) 


This formula fits our previous calculations. The interval from t= 10 to t= 11 had 
h=1, and the average was 20+ h= 21. When the time step was h= 4, the average 
was 20+4= 20.5. Over a millionth of a second the average will be 20 plus 
1/1,000,000— which is very near 20. 


Conclusion: The velocity at ¢ = 10 is v= 20. That is the slope of the curve. It agrees 
with the v-graph on the left side of Figure 1.12, which also has v(10) = 20. 


We now show that the two graphs match at all times. If f(t)=t? then v{t} = 21. 
You are seeing the key computation of calculus, and we can put it into words before 
equations. Compute the distance at time t+ h, subtract the distance at time t, and 
divide by h. That gives the average velocity: 


+ — , a | 2 + ey | 
Repeal - fore Ese tanh AF sede 6) 


This fits the previous calculation, where t was 10. The average was 20 + h. Now the 
average is 2: + h. It depends on the time step h, because the velocity is changing. But 
we can see what happens as h approaches zero. The average is closer and closer to 
the speedometer reading of 2t, at the exact moment when the clock shows time t: 


4E As h approaches zero, the average velocity 2t+ 4 approaches v{t) = 28. 


Note The computation (3) shows how calculus needs algebra. If we want the whole 
v-graph, we have to let time be a ‘“‘variable.” It is represented by the letter ¢. Numbers 
are enough at the specific time t = 10 and the specific step h = 1—but algebra gets 
beyond that. The average between any t and any + his 2t + A. Please don’t hesitate 
to put back numbers for the letters—that checks the algebra. 


+This is our first encounter with the much despised “Mean Value Theorem.” If the judge can 
prove the theorem, you are dead. A few v-graphs and f-graphs will confuse the situation 
(possibly also a delta function), 


4.3 The Velocity at an Instant 


There is also a step beyond algebra! Calculus requires the &mit of the average. As 
h shrinks to zero, the points on the graph come closer. “Average over an interval” 
becomes “velocity at an instant.” The general theory of limits is not particularly 
simple, but here we don’t need it. {It isn’t particularly hard either.} In this example 
the limiting value is easy to identify. The average 2t+h approaches 2t, as h 0. 


What remains to do in this section? We answered Question 2—to find velocity 
from distance. We have not answered Question 1. If v(t) = 2¢ increases linearly with 
time, what is the distance? This goes in the opposite direction (it is integration). 

The Fundamental Theorem of Calculus says that no new work is necessary, If the 
slope of f(t) leads to v(t), then the area under that v-graph leads back to the {-graph. 
The odometer readings f=? produced speedometer readings v = 2t. By the Funda- 
mental Theorem, the area under 2t should be ¢t?. But we have certainly not proved 
any fundamental theorems, so it is better to be safe—by actually computing the area. 

Fortunately, it is the area of a triangle. The base of the triangle is t and the height 
is v= 2t. The area agrees with f(z): 


area = }(base)(height) = 4(¢)(2t) = ¢?. (4) 


EXAMPLE 4 The graphs are shifted in time. The car doesn’t start until t = 1. Therefore 
v=0 and f=0 up to that time. After the car starts we have v= 2(t— 1} and 
f=(t— 1)*. You see how the time delay of 1 enters the formulas. Figure 1.13 shows 
how it affects the graphs. 


v=2(¢-1) 


l 2 ] 2 
Fig. 1.43 Delayed velocity and distance, The pairs » = at +b and f= 4at? + bt. 


EXAMPLE 2 The acceleration changes from 2 to another constant a. The velocity 
changes from v= 2t to p= at. The acceleration is the slope of the velocity curve! The 
distance is also proportional to a, but notice the factor 4: 


acceleration a <> velocityy=at <> distance f= at? 


If a equals 1, then v= t and f= }¢*. That is one of the most famous pairs in calculus. 
If a equals the gravitational constant g, then »v = gt is the velocity of a falling body. 
The speed doesn’t depend on the mass (tested by Galileo at the Leaning Tower of 
Pisa). Maybe he saw the distance f= }g¢* more easily than the speed v = gt. Anyway, 
this is the most famous pair in physics. 
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EXAMPLE 3 Suppose /(t)=3t+t?. The average velocity from ¢ to t +h is 


; _fttth-fO) _ 3+ At(et+ hy? — 3-2? 
ave h h = 


The change in distance has an extra 3h (coming from 3(t + #) minus 3t). The velocity 
contains an additional 3 (coming from 3h divided by A), When 3t is added to the 
distance, 3 is added to the velocity. If Galileo had thrown a weight instead of dropping 
it, the starting velocity v, would have added uot to the distance. 


FUNCTIONS ACROSS TIME 


The idea of slope is not difficult—for one straight line. Divide the change in f by 
the change in t. In Chapter 2, divide the change in y by the change in x. Experience 
shows that the hard part is to see what happens to the slope as the line moves. 

Figure 1.14a shows the line between points A and B on the curve. This is a “secant 
line.” [ts slope is an average velocity. What calculus does is to bring that point B 
down the curve toward A. 


speed 
60 
car C 

vorf? 

30 

car D 
t for v? 
t tth ake, ete 32" ll 
4 2 4 


Fig. 4.44 Slope of line, slope of curve. Two velocity graphs. Which is which? 


. Question 4 What happens to the “change in f’’—the height of B above A? 
Answer The change in f decreases to zero. So does the change int. 


Question 2 As 8 approaches A, does the slope of the line increase or decrease? 
Answer lam not going to answer that question. It is too important. Draw another 
secant line with B closer to A. Compare the slopes. 


This question was created by Steve Monk at the University of Washington— where 
57% of the class gave the right answer. Probably 97% would have found the right 
slope from a formula. Figure 1.14b shows the opposite problem. We know the veloc- 
ity, not the distance, But calculus answers questions about both functions. 


Question 3) Which car is going faster at time t = 3/4? 
Answer Car C has higher speed. Car D has greater acceleration. 


Question 4 If the cars start together, is D catching up to C at the end? Between 
t= 4 and t= 1, do the cars get closer or further apart? 
Answer This time more than half the class got it wrong. You won't but you can see 
why they did. You have to look at the speed graph and imagine the distance graph. 
When car C is going faster, the distance between them 
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To repeat: The cars start together, but they don’t finish together. They reach the 
same speed at t= 1, not the same distance. Car C went faster. You really should draw 


their distance graphs, to see how they bend. 


These problems help to emphasize one more point. Finding the speed (or slope) is 
entirely different from finding the distance (or area): 


1. To find the slope of the f-graph at a-particular time t, you don’t have to know 


the whole history. 


2. To find the area under the v-graph up to a particular time ¢, you do have to 


know the whole history. 


A short record of distance is enough to recover v(t). Point B moves toward point A. 
The problem of slope is focai—the speed is completely decided hy f(t) near point A. 

In contrast, a short record of speed is not enough to recover the total distance. We 
have to know what the mileage was earlier. Otherwise we can only know the increase 


in mileage, not the total. 


4.3 


Read-through questions 


Between the distances f(2) = 100 and f(6) = 200, the average 
velocity is _9@ . If f(t)=42? then f(6)=_b and 
f(8}=__©__. The average velocity in between is_ 4. The 


instantaneous velocities at t=6 and t=8 are _ 9 and 
t 


The average velocity is computed from f(c) and f(t + A) by 
Viye=_ 9. If f(t)=e? then »,,,= 4, From t=1 to 
t=1.1 the average ts _ |. The instantaneous velocity 
is the _| _ of v,,.. If the distance is {(t}=4ar? then the 


velocity is v(t}=__k___ and the acceleration is __! 


On the graph of f(¢), the average velocity between A and 
Bis the slope of _™ _. The velocity at A isfound by _"_. 
The velocity at B is found by _@ . When the velocity is 


positive, the distance is__P __. When the velocity is increas- 
ing, the car is__ 4 


1 Compute the average velocity between t = 5 and ¢=8: 


{a) f(t) =6t (b) f(t)=6t+2 
(ce) f(t} =4ar? dd) f@)=t—-2 
(e} f(t)=6 (f} v(t} = 2¢ 


2 For the same functions compute [f(t + 4) —f(e)]/h. This 
depends on ¢ and &, Find the limit as h 30. 


3 If the odometer reads f{t)=t?+t¢ (f in miles or kilo- 
meters, ¢ in hours), find the average speed between 
(a) t=1 and t4=2 
(b) t=Landt=1.1 
{c) t=Landr=1+h 
(d) t=1 and t=.9 (note # = —.1) 


4 For the same f{t)= ¢? +1, find the average speed between 
(a) t=Gand1 (b} t=Oand4 (c) t=Oand &. 

5 In the answer to 3(c), find the limit as h > 0. What does 

that limit tell us? 


6 Set h=0 in your answer to 4c). Draw the graph of 
f(t}=t* +t and show its slope at ¢ =0. 


7 Draw the graph of o(t)=1+42t. From geometry find 
the area under it from 0 to f. Find the slope of that area 
function f(t). , 


8 Draw the graphs of o(t)= 3 — 2¢ and the area f(t). 


9 True or false 

(a) If the distance f(t) is positive, so is v(t). 

(b) If the distance f(t) is increasing, so ts v{t). 

(c) If f(t} 1s positive, v(t) is increasing. 

(d) If v(t) is positive, f(t) is increasing. 
10 If f(z) = 62? find the slope of the f-graph and also the 
v-graph. The slope of the p-graph is the ; 


Il If f(t) = 2? what is the average velocity between ¢ = 9 and 
t= 1.1? What is the average between ¢—h and ¢+h? 


12 (a) Show that for f(t) = 4at? the average velocity between 
t—h and ¢ +h is exactly the velocity att. 
(b} The area under v(t) = at from t—h# to? + his exactly 
the base 2h times : 

13 Find f(t} from v(t} = 201 if (0) = 12. Also if f(1) = 12. 

14 True or faise, for any distance curves. 


(a) The slope of the line from A to Bis the average velocity 
between those points, 
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{b) Secant lines have smaller slopes than the curve, 


(c) If f(t} and F(t} start together and finish together, the 
average velocities are equal. 


(d) If oft) and V{c) start together and finish together, the 
increases in distance are equal. 
15 When you jump up and fall back your height is y = 2¢ — 2? 
in the right units. 
(a} Graph this parabola and its slope. 
(b) Find the time in the air and maximum height. 
(c} Prove: Half the time you are above y =}. 
Basketball players “hang” in the air partly because of (c). 
16 Graph f(t}=t? and g(t)=f(t}—2 and Att)=f(2z), all 
from f =0 to t= 1. Find the velocities. 
17 (Recommended) An up and down velocity is v(t} = 2t for 


<3, v(t)=12—2¢ for +23. Draw the piecewise parabola 
f(t). Check that /(6} = area under the graph of v(t). 


18 Suppose o(t) = +t for ¢<2 and v(t) =2 for ¢ > 2. Draw the 
graph of f(t) out to ¢ = 3. 


19 Draw f(t) up to :=4 when v(t) increases linearly from 
{a} Oto 2 (b) —1 to 1 (c) —2to 0. 


20 (Recommended) Suppose v(t) is the piecewise linear sine 
function of Section 1.2. (In Figure 1.8 it was the distance.} 


4.4 Circular Motion 


Find the area under v(t) between t=0 and t= 1, 2, 3, 4, 5, 6. 
Plot those points f(1), ..., §(6) and draw the complete piece- 
wise parabola f(t). 


21 Draw the graph of f(t}=|1—:7| for O<¢<2. Find a 
three-part formula for v(t). 
22 Draw the graphs of f(t) for these velocities (to t = 2): 
(a) o(t)=1-¢ 
(b) o(t}=|1—¢| 
{c) v(t} =(t -—2) + [1 -¢l. 
23 When does f(t)=!*—31 reach 10? Find the average 


velocity up to that ttme and the instantaneous velocity at that 
time. 


24 If f(t}=4at? + bt +c, what is v(t)? What is the slope of 
p(t)? When does f{t) equal 41, ifa=b=c=t? 


25 If f(t)=2? then v(t) =2t. Does the speeded-up function 
{4} have velocity v(4) or 4v(t) or 4v(42)? 


26 If f(t)=t—2t? find v(t) and f(3t}. Does the slope of f(3t) 
equal v(3t) or 3v(t} or 3v(3t)? 


27 For f(t)=¢7 find v,y-(t) between 0 and ¢. Graph »,,,(t) 
and v(£). 


78 If you know the average velocity v,,.(t), how can you find 
the distance f(t)? Start from (0) =0. 


This section introduces completely new distances and velocities—the sines and cosines 
from trigonometry. As | write that last word, I ask myself how much trigonometry it 
is essential to know. There will be the basic picture of a right triangle, with sides cos t 
and sin t and [. There will also be the crucial equation (cos t)? + (sin t)’ = 1, which 
is Pythagoras’ law a? + b? = c*, The squares of two sides add to the square of the 
hypotenuse (and the 1 is really 17). Nothing else is needed immediately. If you don’t 
know trigonometry, don’t stop—an important part can be learned now. 

You will recognize the wavy graphs of the sine and cosine. We intend to find the 
slopes of those graphs. That can be done without using the formulas for sin(x + y) 
and cos (x + y}—which later give the same slopes in a more algebraic way. Here it is 
only basic things that are needed.t And anyway, how complicated can a triangle be? 


Remark You might think trigonometry is only for surveyors and navigators (people 
with triangles). Not at all! By far the biggest applications are to retation and vibration 
and oscillation. It is fantastic that sines and cosines are so perfect for ‘repeating 


motion” —around a circle or up and down. 


+Sines and cosines are so important that I added a review of trigonometry in Section 1.5. But 
the concepts in this section can be more valuable than formulas. 


4.4 Circular Motion 


cost 


Fig. 4.45 As the angle ¢ changes, the graphs show the sides of the right triangle. 


Our underlying goal is to offer one more example in which the velocity can be 
computed by common sense. Calculus is mainly an extension of common sense, but 
here that extension is not needed. We will find the slope of the sine curve. The straight 
line f= vt was easy and the parabola f= 4at? was harder. The new example also 
involves realistic motion, seen every day. We start with circudar motion, in which the 
position is given and the velocity will be found. 


A ball goes around a circle of radius one. The center is at x = 0, = 0 (the origin). 
The x and y coordinates satisfy x? + y? = 17, to keep the bail on the circle. We specify 
its position in Figure 1.16a by giving its angle with the horizontal. And we make the 
ball travel with constant specd. by requiring that the angle is equal to the time ¢. The 
ball goes counterclockwise. At time { it reaches the point where the angle equals 1. 
The angle is measured in radians rather than degrees. so a full circle is completed at 
t= 2x instead of t = 360. 

The bail starts on the x axis, where the angle is zero. Now find it at time t: 


The ball is at the point where <= cos t and y = sin t. 


This is where trigonometry is useful. The cosine oscillates between 1 and — 1, as the 
ball goes from far right to far left and back again. The sine also oscillates between | 
and —1, starting from sinQ=0. At time 7/2 the sine (the height) increases to one. 
The cosine is zero and the ball reaches the top point x = 0, y = 1. At time x the cosine 
is —! and the sine is back to zero—the coordinates are {(— 1, 0). At t = 2z the circle 
is complete (the angle is also 2z), and x= cos 2x = 1, y = sin 2x =0. 


vertical 


speed | ~ velocity 


vertical 
distance 


Fig. 1.46 Circular motion with speed |, angle r, height sin ¢. upward velocity cos ¢. 
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Important point: The distance around the circle (its circumference) is 2ar = 27, 
because the radius is 1. The ball travels a distance 22 in a time 22. The speed equals 
1. It remains to find the velocity, which involves not only speed but direction. 


Degrees vs. radians A full circle is 360 degrces and 27 radians. Therefore 
1 radian = 360/2n degrees ~ 57.3 degrees 
1 degree = 27/360 radians ~ .01745 radians 


Radians were invented to avoid those numbers! The speed is exactly 1, reaching ¢ 
radians at time t. The speed would be .01745, if the ball only reached t degrees. The 
ball would complete the circle at time T= 360. We cannot accept the division of the 
circle into 360 pieces (by whom?}, which produces these numbers. 

To check degree mode vs. radian mode, verify that sin [° = .017 and sin 1 ~ .84. 


VELOCITY OF THE BALL 


At time t, which direction is the ball going? Calculus watches the motion between ¢ 
and t+. For a ball on a string, we don’t need calculus—just let go. The direction 
of motion is tangent to the circle. With no force to keep it on the circle, the ball goes 
off on a tangent. If the ball is the moon, the force is gravity. If it is a hammer swinging 
around on a chain, the force is from the center. When the thrower lets go, the hammer 
takes off—and it is an art to pick the right moment. (I once saw a friend hit by a 
hammer at MIT. He survived, but the thrower quit track.) Calculus wili find that 
same tangent direction, when the points at t and ¢ + A come close. 

The “velocity triangle” is in Figure 1.16b. Jt is the same as the position triangle, 
but rotated through 90°. The hypotenuse is tangent to the circle, in the direction the 
ball is moving. Its length equals 1 (the speed). The angle ¢ still appears, but now it is 
the angle with the vertical. The upward component of velocity is cos t, when the upward 
component of position is sint. That is our common sense calculation, based on a 
figure rather than a formula. The rest of this section depends on it—and we check 
v= cost at special points. 

At the starting time t = 0, the movement is all upward. The height is sin 0 = 0 and 
the upward velocity is cos 0 = 1. At time 2/2, the ball reaches the top. The height is 
sin n/2=1 and the upward velocity is cos x/2=0. At that instant the ball is not 
moving up or down. 

The horizontal velocity contains a minus sign. At first the ball travels to the left. 
The value of x is cos t, but the speed in the x direction is — sin t. Haif of trigonometry 
is in that figure (the good half), and you see how sin?¢+cos*¢=1 is so basic. 
That equation applies to position and velocity, at every time. 


Application of plane geometry: The right triangles in Figure 1.16 are the same size 
and shape. They look congruent and they are—the angle ¢ above the ball equals the 
angle t at the center. That is because the three angles at the bail add to 180°. 


OSCILLATION: UP AND DOWN MOTION 


We now use circular motion to study straight-line motion. That line will be the y axis. 
Instead of a ball going around a circle, a mass will move up and down. It oscillates 
between y= 1 and y= — 1. The mass is the ‘‘shadow of the ball,” as we explain in a 
moment. 


4.4 Circular Motion 


There is a jumpy oscillation that we do not want, with p=1 and v= —1. That 
“bang-bang” velocity is like a billiard ball, bouncing between two walls without 
slowing down. If the distance between the walls is 2, then at t = 4 the ball is back to 
the start. The distance graph is a zigzag (or sawtooth) from Section 1.2. 

We prefer a smoother motion. Instead of velocities that jump between +1 and —1, 
a real oscillation slows down to zero and gradually builds up speed again. The mass 
is on a spring, whicb pulls it back. The velocity drops to zero as the spring is fully 
stretched. Then v is negative, as the mass goes the same distance in the opposite 
direction. Simple harmonic motion is the most important back and forth motion, 
while f= vt and f= 4at? are the most important one-way motions. 
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Aig. 4.47 Circular motion of the ball and harmonic motion of the mass (ils shadow). 


How do we describe this oscillation? The best way is to match it with the ball on 
the circle. The height of the ball will be the height of the mass. The “shadow of the 
hail” goes up and down, level with the bali. As the ball passes the top of the 
circle, the mass stops at the top and starts down. As the ball goes around the bottom, 
the mass stops and turns back up the y axis. Halfway up (or down), the speed is 1. 

Figure t.17a shows the mass at a typical time t. The height is y= f(t) =sin ¢, level 
with the ball. This height oscillates between f= 1 and f= —1. But the mass does not 
move with constant speed. The speed of the mass is changing although the speed of 
the ball is always 1. The time for a full cycle is still 2x, but within that cycle the mass 
speeds up and slows down. The problem is to find the changing velocity v. Since the 
distance is f= sin t, the velocity will be the slope of the sine curve. 


THE SLOPE OF THE SINE CURVE 


At the top and bottom (f= 2/2 and t = 3x/2) the ball changes direction and p= 0. 
The slope at the top and bottom of the sine curve is zero.t At time zero, when the ball 
is going straight up, the slope of the sine curve is y= 1. At t=, when the ball and 
mass and f-graph are going down, the velocity is y= —1. The mass goes fastest at 
the center. The mass goes slowest (in fact it stops) when the height reaches a maximum 
or minimum, The velocity triangle yields » at every time ¢. 

To find the upward velocity of the mass, look at the upward velocity of the ball. 
Those velocities are the same! The mass and ball stay level, and we know v from 
circular motion: The upward velocity is v = cos t. 


t+That looks easy but you will see later that it is extremely important. At a maximum or 
minimum the slope is zero. The curve levels off. 
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Figure 1.18 shows the result we want. On the right, f= sin t gives the height. On 
the left is the velocity » = cos t. That velocity is the slope of the f-curve. The height 
and velocity (red lines) are oscillating together, but they are out of phase—just as 
the position triangle and velocity triangle were at right angles. This is absolutely 
fantastic, that in calculus the two most famous functions of trigonometry form a pair: 
The slope of the sine curve is given by the cosine curve. 


When the distance is {(t) = sin t, the velocity is v(t)= cos t. 


Admission of guilt: The slope of sint was not computed in the standard way. 
Previously we compared (t + h)? with t?, and divided that distance by h. This average 
velocity approached the slope 2t as h became small. Fer sin t we could have done the 
same: 
change in sint _ sin(t+h)—sint 


average velocity = : 
e y change in t h 


(1) 
This is where we need the formula for sin (t + h), coming soon. Somehow the ratio in 


(1) should approach cos't as h — 0. (It does.) The sine and cosine fit the same pattern 
as ¢? and 2:—our shortcut was to watch the shadow of motion around a circle. 


Fig. 4.148 v=cost when f=sin? {red}; y= —sint when f=cos t (black). 


Question 4 What if the bail goes twice as fast, to reach angle 2t at time 1? 


Answer The speed is now 2. The time for a fuil circle is only x. The bail’s position 
is x = cos 2t and y = sin 2t. The velocity is still tangent to the circle—but the tangent 
is at angle 2t where the ball is. Therefore cos 2t enters the upward velocity and 
—sin 2t enters the horizontal velocity. The difference is that the velocity triangle is 
twice as big. The upward velocity is not cos 2t but 2 cos 2#. The horizontal velocity 
is —2 sin 2t. Notice these 2’s! 


Question 2 What is the area under the cosine curve from 1 = 0 to t = 71/2? 


You can answer that, if you accept the Fundamental Theorem of Calculus— 
computing areas is the opposite of computing slopes. The slope of sin f is cos t, so the 
area under cos ¢ is the increase in sin t. No reason to believe that yet, but we use it 
anyway. 

From sin 0= 0 to sin x/2 = 1, the increase is t. Please realize the power of calculus. 
No other method could compute the area under a cosine curve so fast. 


4.4 Circular Motion 
THE SLOPE OF THE COSINE CURVE 


I cannot resist uncovering another distance and velocity (another f-v pair) with no 
extra work. This time f is the cosine. The time clock starts af the top of the circle. 
The old time ¢ = 2/2 is now t= 0. The dotted lines in Figure 1.18 show the new start. 
But the shadow has exactly the same motion—the hall keeps going around the circle, 
and the mass follows it up and down. The f-graph and v-graph are still correct, both 
with a time shift of 7/2. 

The new f-graph is the cosine. The new v-graph is munus the sine. The slope of the 
cosine curve follows the negative of the sine curve. That is another famous pair, twins 
of the first: 


When the distance is f(t)= cos t, the velocity is v(t) = — sin ¢. 


You could see that coming, by watching the ball go left and right (instead of up and 
down). Its distance across is f= cos t. Its velocity across is y = — sin t. That twin pair 
completes the calculus in Chapter 1 (trigonometry to come). We review the ideas: 


v is the vefocity 
the slope of the distance curve 
the Himit of average velocity over a short time 
the derivative of f. 

f is the distance 
the area under the velocity curve 
the dimit of total distance over many short times 
the integral of v. 


Differential cafculus: Compute v from f. Integral caiculus: Compute f from v. 


With constant velocity, f equals vt. With constant acceleration, v= at and f= 4ar?. 
In harmonic motion, p= cost and f= sin £. One part of our goal is to extend that 
list—for which we need the tools of calculus. Another and more important part is 
to put these ideas to use. 


Before the chapter ends, may I add a note about the book and the course? The 
book is more personal than usual, and I hope readers will approve. What I write is 
very close to what I would say, if you were in this room. The sentences are spoken 
before they are written.t Calculus is alive and moving forward —it needs to be taught 
that way. 

One new part of the subject has come with the computer. It works with a finite 
step h, not an “infinitesimal” limit. What it can do, it does quickly-—even if it cannot 
find exact slopes or areas. The result is an overwhelming growth in the range of 
problems that can be solved. We landed on the moon because f and » were so 
accurate. (The moon's orbit has sines and cosines, the spacecraft starts with pv = at 
and f=4at?. Only the computer can account for the atmosphere and the sun’s gravity 
and the changing mass of the spacecraft.) Modern mathematics is a combination of 
exact formulas and approximate computations. Neither part can be ignored, and I 
hope you will see numerically what we derive algebraically. The exercises are to help 
you master both parts. 


tOn television you know immediately when the words are live. The same with writing, 
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The course has made a quick start—not with an abstract discussion of sets or 
functions or limits, but with the concrete questions that led to those ideas. You have 
seen a distance function f and a limit v of average velocities, We will meet more 
functions and more limits (and their definitions!) but it is crucial to study important 
examples early. There is a lot to do, but the course has definitely begun. 


4.4 EXERCISES 


Read-through questions 


A ball at angle ¢ on the unit circle has coordinates x= __a 

and y=__b__ [tcompletesa fullcircleatt = __c __. Its speed 
is _d__. Its velocity points in the direction of the _e_, 
whichis__f _ to the radius coming out from the center. The 


upward velocity is__g___ and the horizontal velocity is__h 


A mass going up and down level with the ball has height ’ 


f(t)=__i_. Thisiscalled simple J motion. The velocity 


isv(t)=__& . When t=2/2 the height is f=__} and the 
velocity is y=__m_. If a speeded-up mass reaches f= sin 2¢ 
at time ¢, its velocity isp =__n__. A shadow traveling under 
the ball has f=cos? and »=__o _. When f is distance = 


area = integral. vis _po=_q = _¢ 


1 For a ball going around a unit circle with speed 1, 
(a) how long does it take for 5 revolutions? 
(b) at time ¢ = 3x/2 where is the ball? 
(c) at ¢ = 22 where is the ball (approximately)? 
2 For the same motion find the exact x and » coordinates 


at ¢=2n/3. At what time would the ball hit the x axis, if it 
goes off on the tangent at 1 = 2n/3? 


3 A ball goes around a circle of radius 4. At time ¢ (when it 
reaches angle f) find 
(a) its x and y coordinates 
(b) the speed and the distance traveled 
{c} the vertical and horizonta! velocity. 
4 On a circle of radius R find the x and y coordinates at 


time ¢ (and angle ¢), Draw the velocity trrangle and find the 
x and y velocities. 


5 A ball travels around a unit circle (raaius 1} with speed 3, 
starting from angle zero. At time 2, 
(a) what angle does it reach? 
(b} what are its x and y coordinates? 
(c} what are its x and y velocities? This part ts harder. 
6 If another ball stays 2/2 radians ahead of the ball with 


speed 4, find its angle, its x and y coordinates, and its vertical 
velocity at time t. 


7 A mass moves on the x axis under or over the original 
ball (on the unit circle with speed [). What is the position 
x =/(t)? Find x and v at t= 2/4. Plot x and » up tot=a. 


8 Does the new mass (under or over the ball) meet the old 
mass (level with the ball)? What is the distance between 
the masses at time t? 


9 Draw graphs of f (t}=cos 3t and cos 2xt and 27 cost, 
marking the time axes. How long until each f repeats? 


10 Draw graphs of f=sin(t+2) and »=cos(t+ 2). This 
oscillation stays level with what ball? 


11 Draw graphs of f=sin(x/2—14) and v= —cos(x/2 — 8). 
This oscillation stays level with a ball going which way start- 
ing where? 


12 Draw a graph of f(t) =sin ¢ + cos 7, Estimate its greatest 
height (maximum f) and the time it reaches that height. By 
computing f? check your estimate, 


13 How fast should you run across the circle to meet the ball 
again? It travels at speed 1. 


14 A mass falls from the top of the unit circle when the ball 
of speed | passes by. What acceleration a is necessary to meet 
the ball at the bottom? 


Find the area under v = cos t from the change in f= sin: 
15 from r=Otoi=n2 16 from :=O0tot=2/6 
17 from t=O to f=2n 18 from t =2/2 to ¢ = 3x2. 


I9 The distance curve f=sin 4t yields the velocity curve 
v=4cos 4¢. Explain both 4's. 


20 The distance curve f=2 cos 32 yields the velocity curve 
v=—6sin 3. Explain the —6. 


21 The velocity curve v=cos 4i yields the distance curve 
f=%sin 4¢. Explain the . 


22 The velocity » = 5 sin 5¢ yields what distance? 
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23 Find the slope of the sine curve at t = 7/3 from v =cost. 
Then find an average slope by dividing sin x/2 —sin 1/3 by 
the time difference x/2 — 1/3. 

24 The slope of f=sin¢t at t=0 is cosO=1. Compute 
average slopes (sin ¢)/t for ¢= 1, .1, .01, .00L. 


The ball at x=cost, y=sin ¢ circles (1) counterclockwise 
(2) with radius | (3) starting from x= 1, y=0 (4) at speed 1. 
Find {1}(2)(3)(4) for the motions 25—30. 


25 x=cos 3, y=—sin 3t 

26 x =3cos 4t, y=3sin 4t 

27 x =Ssin 21, y=Scos 2t 
28 x=1+cost, y=sint 

29 x=cos(t+ 1), y=sin(t+ 1) 


The oscillation x = 0, y = sin ¢ goes (1) up and down (2) between 
—1 aud 1 (3) starting from x=0, p=0 (4) at velocity 
v=cos t. Find (1}(2)(3)(4) for the oscillations 31-3. 

3k x=cost, y=0 32 x=0, y =sin St 
33 x=0, y=2 sin(t + 4) 


35 x=0, y=—2costr 


34 x=cost, p= cost 

36 x=cos*?t, y=sin*t 

37 If the ball on the unit circle reaches ¢ degrees at time t, 
find tts position and speed and upward velocity. 


38 Choose the number k so that x =cos kt, y=sin kt com- 
pletes a rotation at ¢ = 1. Find the speed and upward velocity. 


39 Ifa pitcher doesn’t pause before starting to throw, a balk 
is called. The American League decided mathematically that 
there is always a stop between backward and forward motion, 
even if the time is too short to see it. (Therefore no balk.) Is 


30 x =cos(—tf), y =sin{—t) that true? 


Re §=8=61.5 AReview of Trigonometry xy 


Trigonometry begins with a right triangle. The size of the triangle is not as important 
as the angles. We focus on one particular angle—call it @—and on the ratios between 
the three sides x, y,r. The ratios don’t change if the triangle is scaled to another 
size. Three sides give six ratios, which are the basic functions of trigonometry: 


x hear side r 1 
cos @ = — = ————_. sec @= —-= 
r hypotenuse x cos é 
‘ 
y ‘ oO ite sid 1 
Gi v=e eee ose @= 6 = —— 
r hypotenuse y sind 
x y _ opposite side x 1 
tan @= = = ——_—-— cot @=-= 
Fig. 1.49 x near side y tanég 


Of course those six ratios are not independent. The three on the right come directly 
from the three on the left. And the tangent is the sine divided by the cosine: 


os@ x/r x 


Note that “tangent of an angle” and “tangent to a circle” and “tangent line to a 
graph” are different uses of the same word. As the cosine of @ goes to zero, the tangent 
of @ goes to infinity. The side x becomes zero, 8 approaches 90°, and the triangle is 
infinitely steep. The sine of 90° is y/r = 1. 

Triangles have a serious limitation. They are excellent for angles up to 90°, and 
they are OK up to 180°, but after that they fail. We cannot put a 240° angle into a 
triangle. Therefore we change now to a circle. 
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Fig. 1.20 Trigonometry on a circle. Compare 2 sin @ with sin 29 and tan 6 (periods 2x, x, x). 


Angles are measured from the positive x axis (counterclockwise). Thus 90° is 
straight up, 180° is to the left, and 360° is in the same direction as 0°. (Then 450° is 
the same as 90°.) Each angle yields a point on the circle of radius r. The coordinates 
x and y of that point can be negative (but never r). As the point goes around the 
circle, the six ratios cos 0, sin 6, tan 8, .., trace out six graphs. The cosine waveform 
is the same as the sine waveform—just shifted by 90°. 

One more change comes with the move to a circle. Degrees are out. Radians are 
in. The distance around the whole circle is 2zr. The distance around to other points 
is Or. We measure the angle by that multiple @. For a half-circle the distance is rr, 
so the angle is x radians—which is 180°. A quarter-circle is 2/2 radians or 90°. 
The distance around to angle 6 is r times §. 

When r = 1 this is the ultimate in simplicity: The distance is 9. A 45° angle is } of 
a circle and 22/8 radians—and the length of the circular arc is 2/8. Similarly for 1°: 


360° = 2n radians 1° = 22/360 radians 1 radian = 360/2n degrees. 


An angle going clockwise is negative. The angle — 2/3 is — 60° and takes us 4 of the 
wrong way around the circle. What is the effect on the six functions? 

Certainly the radius r is not changed when we go to —@. Also x is not changed 
(see Figure 1.20a). But y reverses sign, because —@ is below the axis when +4 is 
above. This change in y affects y/r and y/x but not x/r: 


cos (~— 6) =cos 4 sin(— 0) = — sin 0 tan (~ @) = — tan @. 
The cosine is even (no change). The sine and tangent are odd (change sign). 

The same point is 2 of the right way around. Therefore 3 of 2 radians (or 300°) 
gives the same direction as —x/3 radians or —60°. A difference of 2n makes ne 
difference ta x, y,r. Thus sin @ and cos @ and the other four functions have period 27. 
We can go five times or a hundred times around the circle, adding 102 or 200z to 
the angle, and the six functions repeat themselves. 

EXAMPLE Evaluate the six trigonometric functions at § = 22/3 (or 6 = — 4n/3). 


This angle is shown in Figure 1.20a (where r= 1). The ratios are 
cos = x/r= —1/2 sin 0 = yr = ./3/2 tan G=y/x=-/3 
sec O= —2 csc 0 = 2/./3 cot @= -1/,/3 
Those numbers illustrate basic facts about the sizes of four functions: 
|cos 8] <1 |sin 8] <1 |sec 0] > 1 fese 0| > 1. 


The tangent and cotangent can fall anywhere, as long as cot @= 1/tan @. 


1.5 A Review of Trigonometry 


The numbers reveal more. The tangent — J/3 is the ratio of sine to cosine. The 
secant —2 is 1/cos @. Their squares are 3 and 4 (differing by 1). That may not seem 
remarkable, but it is. There are three relationships in the squares of those six numbers, 
and they are the key identities of trigonometry: 


cos? 6+ sin?9=1 1 + tan? 8 = sec? 9 cot? @+ 1 =csc? @ 


Everything flows from the Pythagoras formula x* + y? =r’. Dividing by 7r* gives 
{x/r)? + (y/r)? = 1. That is cos? @ + sin? @ = 1. Dividing by x? gives the second identity, 
which is 1 + (y/x)* =(r/x)*. Dividing by y? gives the third. All three will be needed 
throughout the book—and the first one has to be unforgettable. 


DISTANCES AND ADDITION FORMULAS 


To compute the distance between points we stay with Pythagoras. The points are in 
Figure 1.21a. They are known by their x and y coordinates, and d is the distance 
between them. The third point completes a right triangle. 

For the x distance along the bottom we don’t need help. It is x, — x, (or |x. — x,| 
since distances can’t be negative). The distance up the side is |y, — y,|. Pythagoras 
immediately gives the distance d: 


distance between points = d= «Ga — x1)? +(y2—y,)*- (1) 


Fig. 4.24 Distance between points and equal distances in two circles. 


By applying this distance formula in two identical circles, we discover the cosine 
of s — ¢. (Subtracting angles is important.) In Figure 1.21b, the distance squared is 


d? = (change in x)* + (change in y)? 
= (cos s— cos t)? + (sin s— sin ¢)*. (2) 


Figure !.21c shows the same circle and triangle (but rotated). The same distance 
squared is 
d? = (cos(s— £)— 1)? + (sin(s— 2))*. (3) 


Now multiply out the squares in equations (2) and (3). Whenever (cosine)* + (sine)* 
appears, replace it by 1. The distances are the same, so (2) = (3): 


(=1+1-2cosscost—2sins sint 
Q=14+ 1-2 cos{s—t). 


31 


x =cos{s—1} 
y = sin(s -¢} 
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After canceling 1 + 1 and then —2, we have the “addition formula” for cos(s— t): 


The cosine of s — ¢ equals cos s cos t+ sin s sin f. (4) 


The cosine of s+ ¢ equals cos s cos t— sins sin ¢. (5) 


The easiest is t= 0. Then cos t = 1 and sin t = 0. The equations reduce to cos s = cos s. 

To go from (4) to (5) in all cases, replace t by — t. No change in cos ¢, but a “minus” 
appears with the sine. In the special case s=t, we have cos(t+t)= 
(cos t)(cos t} — (sin t)(sin t). This is a much-used formula for cos 2t: 


Double angle: cos 2t = cos? t — sin? t= 2 cos? — 1 = 1 — 2 sin? ¢. {6) 


I am constantly using cos? t + sin? = I, to switch between sines and cosines. 

We also need addition formulas and double-angle formulas for the sine of s—t 
and s +t and 2r. For that we connect sine to cosine, rather than (sine)? to (cosine). 
The connection goes back to the ratio y/r in our original triangle. This is the sine of 
the angle @ and also the cosine of the complementary angle x/2 — 0: 


sin 6 = cos (z/2 — 6) and cos # = sin{x/2 — 8). (7) 


The complementary angle is x/2 — 8 because the two angles add to x/2 (a right angle). 
By making this connection in Problem 19, formulas (4-5-6) move from cosines to 
sines: 


sin(s — t)=sin s cos t— cos ssint (8) 
sin{s + ¢)= sin s cos t+cos ssint (9) 
sin 2t=sin{t+t)=2 sin t cost (10) 


I want to stop with these ten formulas, even if more are possible. Trigonometry is 
full of identities that connect its six functions—basically because al] those functions 
come from a single right triangle. The x, y, r ratios and the equation x? + y? =r? can 
be rewritten in many ways. But you have now seen the formulas that are needed by 
calculus.t They give derivatives in Chapter 2 and integrals in Chapter 5. And it is 
typical of our subject to add something of its own—a limit in which an angle 
approaches zero. The essence of calculus is in that limit. 


Review of the ten formulas Figure 1.22 shows d? =(0— 4)? + (1 - f3/2)?. 


cos ~ =cos — cos ~ +sin ~ sin ~ (s—t) sin = = sin ~ cos ~ — cos ~ sin = 
6 2 3 2 3 6 2 3 2 3 

cos a eet sin ~ sin = (s+r) iqiee Sai cn coe che 
6 2 3 2 3 6 2 3 2 3 
7 fd n n nt 1 
?-= : oes + 2 + Pe as al 

cos 3 cos 3 sin 3 (21) sin 25 2 sin; 008 5 
mK ae 2 See 

cos 63m 3 Jf 3/2 (5 0) sin ; cos 3 1/2 


tCalculus turns (6) around to cos? 7 = 3(1 + cos 2t) and sin? 1 = $(1 — cos 2r). 
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0, 1 1 NB q I 

d 9° 2 s=y s-te4 
= 2 6 1 
_ _ 5m ie, 

a) t= 3 S+f 6 

/\ tay 
ov \ 1,0 
Fig. 4.22 


4.5 EXERCISES 


Read-througb questions 


Starting witha __o@ __ triangle, the six basic functions are the 
_6b _ of the sides. Two ratios {the cosine x/r and the __¢ _) 
are below 1. Two ratios {the secant r/x and the _d__) are 
above 1, Two ratios (the __e and the__f__} can take any 
value. The six functions are defined for all angles #, by chang- 


ing from a triangle toa_g . 


The angle 6 is measuredin__h_. Afullcircleis@=__t _, 
when the distance around is 2xr. The distance to angle @ is 

|__. All six functions have period _k_. Going clockwise 
changes the signof@and__! and __m_. Since cos{(—@)= 
cos 9, the cosine is__n__. 


Coming from x?+y?=7? are the three identities 
sin?@4+cos*?*@=1and_o and_p_. (Divide by r? and 

a_ and r_.) The distance from (2, 5) to (3, 4) is 
d=_ 8 __. The distance from (1, 0} to (cos(s—t}, sin{s —#)}) 
leads to the addition formula cos(s—t}=__t__. Changing 
the sign of t gives cos(s+t)=_u__. Choosing s=t gives 
cos2t=_¥ or _w_. Therefore 4{1+cos2s}=_ x , 
a formula needed in calculus. 


1 In a 60-60-60 triangle show why sin 30° =}. 


2 Convert x, 3x, —nx/4 to degrees and 60°, 90°, 270° to 
radians. What angles between 0 and 2x correspond to 
0 = 480° and 6 = —1°? 


3 Draw graphs of tan @ and cot @ from 0 to 2x. What is their 
(shortest) period? 


4 Show that cos 28 and cos? @ have period and draw them 
on the same graph. 


5 At ?= 32/2 compute the six basic functions and check 
cos? 6 + sin? 6, sec? 6 — tan? 0, csc” 8 — cot? 8. 

6 Prepare a table showing the values of the six basic func- 
tions at @ = 0, 1/4, n/3, 2/2, x. 

7 The area of a circle is xr?. What is the area of the sector 
that has angle @7 It is a fraction of the whole area. 


8 Find the distance from (1, 0} to (0, 1) along {a} a straight 
line (b) a quarter-circle (c} a semicircle centered at (4, 4). 


9 Find the distance d from (1, 0) to (4, \/3/2) and show on 
a circle why 6d is less than 2n. 


10 In Figure 1.22 compute d? and {with calculator) 12d. Why 
is 12d close to and below 2x? 


11 Decide whether these equations are true or false: 


sin 9 1+ cos # 


(a) 1—cos6 sin 
secO+cesc@ 
(b} tnJlcte @+cos # 


(c) cos 6 —sec 9 =sin @ tan 8 
(d) sin(2x — 8}=sin @ 


12 Simplify sin {x — 0), cos (x — 9}, sin(x/2 + 0), cos (x/2 + 8). 


13 From the formula for cos(2t + t) find cos 3¢ in terms of 
cos ft. 


14 From the formula for sin(2# +2) find sin 3t in terms of 
sin t. 


15 By averaging cos(s — ft) and cos{s +t} in (4—5) find a for- 
mula for cos s cos ¢. Find a similar formula for sin s sin ¢. 


16 Show that (cos ¢ +i sin ¢)? =cos 2t +i sin 21, if i? = —1, 


17 Draw cos @ and sec @ on the same graph. Find all points 
where cos ? = sec 8. 


18 Find all angles s and ¢ between 0 and 2x where ain (s + t} = 
sin s + sin f. 


19 Camplementary angles have sin @ = cos(n/2 0). Write 
sin(st+if) as cos{n/2—s—t) and apply formula (4} 
with n/2—s instead of s. In this way derive the addition 
formula (9). 


20 If formula (9) is true, how do you prove {8}? 


21 Check the addition formulas (4-5) and (8-9) for 
=t=n7/4. 


22 Use (5) and (9) to find a formula for tan (s + ¢). 
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In 23-28 find every @ that satisfies the equation. (1) show that the side PQ has length 

23 sin O=—1 24 sec @=—2 d? =a? +b? —2ab cos 6 (law of cosines). 

25 sin 6 =cos 0 26 sin 8 =@ 32 Extend the same triangle to a parallelogram with its fourth 
27 sec? @ + csc? @=1 28 tan 6=0 corner at R =(a+ b cos 0, & sin 6). Find the length squared of 


the other diagonal OR. 
29 Rewrite cos 6+ sin @ as fd sin(@ + } by choosing the 
correct “phase angle” @. (Make the equation correct at Draw graphs for equations 33—36, and mark three points, 
@=0. Square both sides to check.) 


33 y=sin 2x 34 y=2 sin ax 
30 Match asin x + b cos x with A sin(x + $). From equation bat =e 
(9) show that a= A cos ¢ and b=A sin ¢. Square and add to 35 y=} cos 2nx 58 Yr St ROB 
find A = . Divide to find tan ¢ = b/a. 37 Which of the six trigonometric functions are infinite at 


? 
31 Draw the base of a triangle from the origin 0 = (0, 0} to what 'aneks! 


P =(a, 0). The third corner is at 0 =(b cos @, bsin 8). What 38 Draw rough graphs or compuler graphs of tsint and 
are the side lengths OP and OQ? From the distance formula sin 4¢ sin t from 0 to 2z. 


1.6 A Thousand Points of Light ‘RE 


The graphs on the back cover of the book show y=sinn. This is very different 
from y= sin x. The graph of sin x is one continuous curve. By the time it reaches 
x = 10,000, the curve has gone up and down 10,000/2x times. Those 1591 oscillations 
would be so crowded that you couldn’t see anything. The graph of sin » has picked 
10,000 points from the curve—and for some reason those points seem to lic on more 
than 40 separate sine curves. 

The second graph shows the first 1000 points. They don’t seem to lie on sine curves. 
Most people see hexagons. But they are the same thousand points! It is hard to believe 
that the graphs are the same, but I have learned what to do. Tilt the second graph 
and look from the side at a narrow angle. Now the first graph appears. You see 
“diamonds.” The narrow angle compresses the x axis—back to the scale of the first 
graph. 
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The effect of scale is something we don’t think of. We understand it for maps. 
Computers can zoom in or zoom out—those are changes of scale. What our eyes see 


4.6 A Thousand Points of Light 


depends on what is “‘close.”” We think we see sine curves in the 10,000 point graph, 
and they raise several questions: 


1. Which points are near (0, 0)? 
2. How many sine curves are there? 
3. Where does the middle curve, going upward from (0, 0), come back to zero? 


A point near (0, 0) really means that sin x is close to zero. That is certainly not true 
of sin 1 (1 is one radian!), In fact sin | is up the axis at .84, at the start of the seventh 
sine curve. Similarly sin 2 is .91 and sin 3 is .14. (The numbers 3 and .14 make us 
think of x. The sine of 3 equals the sine of x — 3. Then sin .14 is near .14.) Similarly 
sin 4, sin 5, ..., sin 21 are not especially close to zero. 

The first point to come close is sin 22. This is because 22/7 is near x. Then 22 is 
close to 72, whose sine is zero: 


sin 22 = sin(7a — 22) = sin(- .01) = — .01. 


That is the first point to the right of (0, 0) and slightly below. You can see it on 
graph |, and more clearly on graph 2. It begins a curve downward. 
The next point to come close is sin 44. This is because 44 is just past 147. 


44 = l4xn + .02 so sin 44 = sin .02 + .02. 


This point (44, sin 44) starts the middle sine curve. Next is (88. sin 88). 

Now we know something. There are 44 curves. They begin near the heights sin 0, 
sin 1, ..., sin 43. Of these 44 curves, 22 start upward and 22 start downward. I was 
confused at first, because I could only find 42 curves. The reason is that sin 11 equals 
— 0.99999 and sin 33 equals .9999. Those are so close to the bottom and top that you 
can’t see their curves. The sine of 11 is near —1 because sin 22 is near zero. It is 
almost impossible to follow a single curve past the top—coming back down it is not 
the curve you think it is. 

The points on the middle curve are at n = 0 and 44 and 88 and every number 44N. 
Where does that curve come back to zero? In other words, when does 44N come 
very close to a multiple of x? We know that 44 is 142+ .02. More exactly 44 is 
t4x + .0177. So we multiply .0177 until we reach z: 


if N=7nj.0177 then 44N=(14x+ 0177)N = l4nN +x, 


This gives N = 177.5. At that point 44N = 7810. This is half the period of the sine 
curve. The sine of 7810 is very near zero. 

Tf you follow the middle sine curve, you will see it come back to zero above 7810. 
The actual points on that curve have n= 44-177 and n= 44-178, with sines just 
above and below zero. Halfway between is n = 7810. The equation for the middle sine 
curve is y = sin(zx/7810). Its period is 15,620—-beyond our graph. 


Question The fourth point on that middle curve looks the same as the fourth point 

coming down from sin 3. What is this ‘‘double point?” 

Answer 4 times 44 is 176. On the curve going up, the point is (176, sin 176). On the 

curve coming down it is (179, sin 179). The sines of 176 and 179 differ only by 00003. 
The second graph spreads out this double point. Lock above 176 and 179, at the 

center of a hexagon. You can follow the sine curve all the way across graph 2. 


Only a little question remains. Why does graph 2 have hexagons? I don’t know. 
The problem is with your eyes, To understand the hexagons, Doug Hardin plotted 
points on straight lines as well as sine curves. Graph 3 shows y = fractional part of 
n/2x. Then he made a second copy, turned it over, and placed it on top. That 
produced graph 4—with hexagons. Graphs 3 and 4 are on the next page. 
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This is called a Moiré pattern. If you can get a transparent copy of graph 3, and 
turn it slowly over the original, you will see fantastic hexagons. They come from 
interference between periodic patterns—in our case 44/7 and 25/4 and 19/3 are near 
2x. This interference is an enemy of printers, when color screens don’t line up. It can 
cause vertical lines on a TY, Also in making cloth, operators get dizzy from seeing 
Moire patterns move. There are good applications in engineering and optics —but 
we have to get back to calculus. 


Rs 861.72 Computingin Calculus iy 


Software is available for calculus courses—a lot of it. The packages keep getting 
better. Which program to use (if any) depends on cost and convenience and purpose. 
How to usc it is a much harder question. These pages identify some of the goals, and 
also particular packages and calculators. Then we make a beginning (this is still 
Chapter 1) on the connection of computing to calculus. 

The discussion wil] be informal. [t makes no sense to copy the manua). Our aim 
is to Support, with examples and information, the effort to use computing to help 
Icarning. 

For calculus, the greatest advantage of the computer is to offer graphics, You see 
the function, not just the formula. As you watch, f(x) reaches a maximum or a 
minimum or zero. A separate graph shows its derivative. Those statements are not 
100% true, as everybody learns right away as soon as a few functions are typed in, 
But the power to see this subject is enormous, because it is adjustable. If we don’t 
like the picture we change to a new viewing window. 


This is computer-based graphics. It combines mumericaf computation with 
graphical computation. You get pictures as well as numbers—a powerful combination. 
The computer offers the experience of actually working with a function. The domain 
and range are not just abstract ideas. You choose them. May I give a few examples. 


EXAMPLE 4 Certainly x? equals 3° when x = 3. Do those graphs ever meet again’ 
At this point we don’t know the full meaning of 3*, except when x is a nice number. 
(Neither does the computer.) Checking at x = 2 and 4, the function x? is smaller 
both times: 27 is below 3? and 4° = 64 is below 3*= 81. If x? is always less than 3” 
we ought to know—these are among the basic functions of mathematics. 
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The computer will answer numerically or graphically. At our command, it solves 
x? = 3*. At another command, it plots both functions—this shows more. The screen 
proves a point of logic (or mathematics) that escaped us. If the graphs cross once, 
they must cross again— because 3* is higher at 2 and 4. A crossing point near 2.5 is 
seen by zooming in. I am less interested in the exact number than its position—it 
comes before x = 3 rather than after. 

A few conclusions from such a basic example: 


1. A supercomputer is not necessary. 
2. High-level programming is not necessary. 
3. We can do mathematics without completely understanding it. 


The third point doesn’t sound so good. Write it differently: We can learn mathematics 
while doing it. The hardest part of teaching calculus is to turn it from a spectator 
sport into a workout. The computer makes that possible. 


EXAMPLE 2 (mental computer) Compare x? with 2*. The functions meet at x = 2. 
Where do they meet again? Is it before or after 2? 


That is mental computing because the answer happens to be a whole number (4). 
Now we are on a different track. Does an accident like 2* = 4? ever happen again? 
Can the machine tell us about integers? Perhaps it can plot the solutions of x° = 6*. 
l asked Mathematica for a formula, hoping to discover x as a function of b—~but the 
program just gave back the equation. For once the machine typed HELP irstead of 
the user. 

Well, mathematics is not helpless. I am proud of calculus. There is a new exercise 
at the end of Section 6,4, to show that we never see whole numbers again. 


EXAMPLE 3 Find the number b for which x® = b* has only one solution(at x = 5). 


When 6 is 3, the second solution is below 3. When 6 is 2, the second solution (4) is 
above 2. If we move b from 2 to 3, there must be a special “double point’’—where 
the graphs barely touch but don’t cross. For that particular )—and only for that 
one value—the curve x never goes above 6”. 

This special point 6 can be found with computer-based graphics. In many ways it 
is the “center point of calculus.” Since the curves touch but don't cross, they are 
tangent. They have the same slope at the double point. Calculus was created to work 
with slopes, and we already know the slope of x”. Soon comes x’. Eventually we 
discover the slope of 6”, and identify the most important number in calculus. 

The point is that this number can be discovered first by experiment. 


EXAMPLE 4 Graph y(x) = e* — x*. Locate its minimum. 


The next example was proposed by Don Small. Solve x* — 11x° + 5x — 2=0. The 
first tool is algebra—try to factor the polynomial. That succeeds for quadratics, and 
then gets extremely hard. Even if the computer can do algebra better than we can, 
factoring is seldom the way to go. In reality we have two good choices: 


1. (Mathematics) Use the derivative. Solve by Newton’s method. 
2. (Graphics) Plot the function and zoom in. 


Both will be done by the computer. Both have potential problems! Newton’s method 
is fast, but that means it can fail fast. (It is usually terrific.) Plotting the graph is also 
fast—but solutions can be outside the viewing window. This particular function is 
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zero only once, in the standard window from —10 to 10. The graph seems to be 
leaving zero, but mathematics again predicts a second crossing point. So we zoom 
out before we zoom in. 

The use of the zoom is the best part of graphing. Not only do we choose the domain 
and range, we change them. The viewing window is controlled hy four numbers. They 
can be the limits Ag x<B and C<y<D. They can be the coordinates of two 
opposite corners: (A, C) and (B, D). They can be the center position (a, b) and the 
scale factors c and d. Clicking on opposite corners of the zoom box is the fastest way, 
unless the center is unchanged and we only need to give scale factors. (Even faster: 
Use the default factors.) Section 3.4 discusses the centering transform and zoom 
transform—a change of picture on the screen and a change of variable within the 
function. 


EXAMPLE 5 Find all real solutions to x* — 11x3 + 5x —2=0. 


EXAMPLE 6 Zoom out and in on the graphs of y=cos40x and y= x sin(1/x). 
Describe what you see. 


EXAMPLE 7 What does y =(tan x — sin x)/x*° become at x=0? For small x the 
machine eventually can’t separate tan x from sin x. It may give y= 0. Can you get 
close enough to see the limit of y? 


For these examples, and for most computer exercises in this book, a menu-driven 
system is entirely adequate. There is a list of commands to choose from. The user 
provides a formula for (x), and many functions are built in. A calculus supplement 
can be very useful— MicroCale or True BASIC or Exploring Calculus or MPP (in 
the public domain). Specific to graphics are Surface Plotter and Master Grapher and 
Gyrographics (animated). The best software for linear algebra is MATLAB. 

Powerful packages are increasing in convenience and decreasing in cost. They are 
capable of symdolic computation— which opens up a third avenue of computing in 
calculus. 


SYMBOLIC COMPUTATION 


In symbolic computation, answers can be formulas as well as numbers and graphs. 
The derivative of y = x? is seen as “2x.” The derivative of sin ¢ is “cos ¢.” The slope 
of b* is known to the program. The computer does more than substitute numhers 
into formulas—it operates directly on the formulas. We need to think where this fits 
with learning calculus. 

In a way, symbolic computing is close to what we ourselves do. Maybe too close— 
there is some danger that symbolic manipulation is al! we do. With a higher-level 
language and enough power, a computer can print the derivative of sin(x?). So why 
learn the chain rule? Because mathematics goes deeper than “algebra with formulas.” 
We deal with ideas. 

I want to say clearly: Mathematics is not formulas, or computations, or even proofs, 
but ideas. The symbols and pictures are the language. The book and the professor 
and the computer can join in teaching it. The computer should be non-threatening 
(like this book and your professor)—you can work at your own pace. Your part is 
to learn by doing. 


EXAMPLES A computer algebra system quickly finds 100 factorial. This is 100! = 
(100)(99)(98) ... {1}. The number has 158 digits (not written out here). The last 24 
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digits are zeros. For 10! = 3628800 there are seven digits and two zeros. Between 10 
and 100, and beyond, are simple questions that need ideas: 


1, How many digits (approximately) are in the number N!? 
2. How many zeros (exactly) are at the end of N!? 


For Question 1, the computer shows more than N digits when N = 100. It will never 
show more than N? digits, because none of the N terms can have more than N digits. 
A much tighter bound would be 2N, but is it true? Does N! always have fewer than 
2N digits? 

For Question 2, the zeros in 10! can be explained. One comes from 10, the other 
from 5 times 2. (10 is also 5 times 2.) Can you explain the 24 zeros in 100!? An idea 
from the card game blackjack applies here too: Count the fives. 

Hard question: How many zeros at the end of 200!? 


The outstanding package for full-scale symbolic computation is Mathematica. It 
was used to draw graphs for this book, including y= sin n on the back cover. The 
complete command was ListPlot[Table [Sin[n], {n, 10000}]]. This system has rewards 
and also drawbacks, including the price. Its original purpose, like MathCAD and 
MACSYMA and REDUCE, was not to teach calculus—but it can. The computer 
algebra system MAPLE is good. 

As I write in 1990, DERIVE is becoming well established for the PC. For the 
Macintosh, Caiculus T/L is a “sleeper” that deserves to be widely known. It builds 
on MAPLE and is much more accessible for calculus. An important alternative is 
Theorist. These are menu-driven (therefore easier at the start) and not expensive. 

I strongly recommend that students share terminals and work together. Two at a 
terminal and 3-5 in a working group seems to be optimal. Mathematics can be 
learned by talking and writing—it is a human activity. Our goal is not to test but to 
teach and learn. 


Writing in Calculus May | emphasize the importance of writing? We totally miss it, 
when the answer is just a number. A one-page report is harder on instructors as weil 
as students—but much more valuable. A word processor keeps it neat. You can’t 
write sentences without being forced to organize ideas—and part of yourself goes 
into it. 

I will propose a writing exercise with options. If you have computer-based graph- 
ing, follow through on Examples 1-4 above and report. Without a computer, pick a 
paragraph from this book that should be clearer and make it clearer. Rewrite it with 
examples. Identify the key idea at the start, explain it, and come back to express it 
differently at the end. Ideas are like surfaces—they can be seen many ways. 


Every reader will understand that in software there is no last word. New packages 
keep coming {Analyzer and EPIC among them). The biggest challenges at this 
moment are three-dimensional graphics and calculus workbooks. In 3D, the problem 
is the position of the eye—since the screen is only 2D. In workbooks, the problem is 
to get past symbol manipulation and reach ideas. Every teacher, including this one, 
knows how hard that is and hopes to help. 


GRAPHING CALCULATORS 


The most valuable feature for calculus—computer-based graphing—is available on 
hand calculators. With trace and zoom their graphs are quite readable. By creating 
the graphs you subconsciously learn about functions. These are genuinely personal 
computers, and the following pages aim to support and encourage their use. 
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Programs for a hand-held machine tend to be simple and short. We don't count 
the zeros in 100 factorial (probably we could). A calculator finds crossing points and 
maximum points to good accuracy. Most of all it allows you to explore calculus by 
yourself. You set the viewing window and define the function. Then you see it. 


There is a choice of calculators—which one to buy? For this book there was also 
a choice—which one to describe? To provide you with listings for useful programs, 
we had to choose. Fortunately the logic is so clear that you can translate the instruc- 
tions into any language—for a computer as well as a calculator. The programs given 
here are the “greatest common denominator” of computing in calculus, 

The range of choices starts with the Casio {x 7000G—the first and simplest, with 
very limited memory but a good screen. The Casio 7500, 8000, and 8500 have increas- 
ing memory and extra features. The Sharp F£f.-5200 (or 9000 in Canada and Europe} 
is comparable to the Casio 8000. These machines have algebraic entry—the normal 
order as in y= x+ 3, They are inexpensive and good. More expensive and much 
more powerful are the Hewlett-Packard calculators—the HP-28S and HP-48SX. 
They have large memories and extensive menus (and symbolic algebra}. They use 
reverse Polish notation—numbers first in the stack, then commands. They require 
extra time and effort, and other books do justice to their amazing capabilities. [t is 
estimated that those calculators could get 95 on a typical calculus exam. 


While this book was being written, Texas Instruments produced a new graphing 
calculator: the T/-§1. It is closer to the Casio and Sharp (emphasis on graphing, easy 
to learn, no symbolic algebra, moderate price). With earlier machines as a starting 
point, many improvements were added. There is some risk in a choice that is available 
only Ar before this textbook is published, and we hope that the experts we asked are 
right. Anyway, our programs are for the T1-81. It is impressive. 


These few pages arc no substitute for (he manual that comes with a calculator. A 
valuable supplement is a guide directed especially at calculus-—my absolute favorites 
are Calculus Activities for Graphic Calculators by Dennis Pence (P WS-Kent, 1990 for 
the Casio and Sharp and HP-28S, 199t for the T/-81). A series of Calculator Enhance- 
ments, using HP's, is being published by Harcourt Brace Jovanovich. What follows 
is an introduction to one part of a calculus laboratory. Later in the book, we supply 
TI-81 programs close to the mathematics and the exercises that they are prepared 
for. 


A few words to start: To select from a menu, press the item number and ENTER. 
Edit a command line using DEL(ete) and INS(ert). Every line ends with 
ENTER. For calculus select radians on the MODE screen. For powers use ©. For 
special powers choose x*, x°', \/x. Multiplication has priority, so (-)3+ 2 2 
produces |. Use keys for SIN, LF, 1S, ... When you press letters, I multipiies $. 

If a program says 3 > C, type 3 STO C ENTER. Storage locations are A to Z 
or Greek @. 


Functions A graphing calculator helps you (forces you?) to understand the concept 
of a function. It also helps you to understand specific functions—especially when 
changing the viewing window. 

To evaluate » = x? — 2x just once, use the home sercen. To define y(x) for repeated 
use, move to the function edit screen: Press MODE. choose Function, and press 
Y=. Then type in the formula. Important tip: for X on the TI-81, the key X|T is faster 
than two steps Alpha X. The Y= edit screen is the same place where the formula 
is needed for graphing. 
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Example ¥1=X2-2X ENTER onthe Y= screen. 4 STO X ENTER on the home 
screen. Y1 ENTER on the Y-VARS screen. The screen shows 8, which is ¥(4). The 
formula remains when the calculator is off. 


Graphing You specify the X range and Y range. {We should say X domain but we 
don’t.) The screen is a grid of 96 x 64 little rectangles called “pixels.” The first column 
of pixels represents Xmin and the last column is Xmax. Press RANGE to reset. 
With Xr es = 1 the function is evaluated 96 times as it is graphed. Xsc l and Yscl 
give the spaces between ticks on the axes. 


The ZOOM menu is a fast way to set ranges. Z00M Standard gives the default 
-10<x<10, -10<y<10. ZOOM Trig gives -2n<x<2n, —-3<y<3. 
The keystroke GRAPH shows the graphing screen with the current functions. 


Example Set the ranges (-)2<X <3 and (-)150< Y <50. Press Y= and store 
Y4 =X (in MATH)? — 28X? + 15X +36 ENTER. Press GRAPH. You won’t see 
much of the graph! Press RANGE and reset (—)10 < X < 30,(—)4000 < Y < 2300. 
Press GRAPH. See a cubic polynomial. 

“Smart Graph” recalls the graph instantly without redrawing it, if no settings have 
changed. The DRAW menu is for points, lines, and shaded regions. This is perfect for 
our piecewise linear functions—just connect the breakpoints with lines. In Section 3.6 
the lines show an iteration by its “cobweb.” 


Programming This book contains programs that you can type in once and save. 
We chose Autoscaling, Newton’s Method, Secant Method, Cobweb Iteration, and 
Numerical Integration. You will create others—to do calculations or to add features 
that are not available as single keystrokes. The caiculator is like a computer, with a 
fairly small set of instructions. One difference: Memory is too precious to store com- 
ments with the code. You have to sec the logic by rereading the program. 

To enter the world of programming, press PRGM. Each PRGM submenu lists ali 
programs by name—a digit, a letter, or @ (37 names). The program title has up to 
eight characters. Select the ED 1T submenu and press G for the edit screen. Type the 
title GRAPHS and press ENTER. Practice on this one: 


7"xX2+X" STO CY-VARS) Y1 ENTER 
:"X-1" STO CY-VARS) Yo ENTER 
> CPRGM) (1/0) DispGraph 


The menus to call are in parentheses. Leave the edit screen with QUIT (not 
CLEAR—that erases the line with the cursor). Set the default window by ZOOM 
Standard, 

To execute, press PRGM CEXEC) G ENTER. The program draws the graphs. It 
leaves Y] and Y? on the Y= screcn. To erase the program from the home screen, 
press (PRGM)(ERASE)G. Practice again by creating Prgm2:FUNC. Type 
a X STO Y and : (PRGM) (1/0)Disp Y. Move to the home screen, store 
X by 4 STO X ENTER, and execute by (PRGM) CEXEC) 2 ENTER. Also try 
X =— 1. When it fails to imagine i, select 1: Goto Error. 


Piecewise functions and Input (to a running program). The definition of a piecewise 
function includes the domain of each piece. Logical tests like "I F X 27" determine 
which domain the input value X fails into. An I F statement only affects the following 
line—which is executed when TES T = | (meaning true) and skipped when TEST =0. 
(meaning false). IF commands are in the PRGM (CTL) submenu; TEST calls the 
menu of inequalities. 
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An input value X =4 need not be stored in advance. Program P stops while 
running to request input. Execute with P ENTER after selecting the PRGM CEXEC) 
menu. Answer ? with 4 and ENTER. After completion, rerun by pressing ENTER 
again. The function is y= 14—-xifx<7,y=xifx2?. 


PrgmP: PIECES 


:Disp "X=" PGRM (1/0) Ask for input 
:Input X PGRM (1/0) Screen ? ENTER X 
714-X > Y First formula for all X 

:If 7<X PRGM (CTL) TEST 

:X-Y Overwrite if TEST =1 

Disp Y Display ¥(X) 


Overwriting is faster than checking both cnds A < X < B for each piece. Even faster: 
a whole formula (14 — X)(X < 7) + (X)(7 < X) can go on a single line using | and 0 
from the tests. Compute-store-display Y(X) as above, or define Y 7 on the edit screen. 


Exercise Define a third piece Y= 8+ X if X < 3. Rewrite P using Y4 =. A product 
of tests (3 <X) (X < 7) evaluates to 1 if all true and to 0 if any false. 


TRACE and ZOOM The best featurc ts graphing. But a whole graph can be like a 
whole book—too much at once. You want to focus on one part. A computer or 
calculator will trace along the graph, stop at a point, and zoom in. 

There is also Z00M OUT, to widen the ranges and see more. Our eyes work the 
same way—they put together information on different scales. Looking around the 
room uses an amazingly large part of the human brain. With a big enough computer 
we can try to tmitate the eyes—this is a key problem in artificial intelligence. With 
a small computer and a zoom feature, we can use our eyes to understand functions. 


Press TRACE to locate a point on the graph. A blinking cursor appears. Move 
left or right—the cursor stays on the graph. Its coordinates appear at the bottom of 
the screen. When x changes by a pixel, the calculator evaluates }(x). To solve y(x) = 0, 
read off x at the point when ¥ is nearest to zero. To minimize or maximize y{x), read 
off the smallest and largest y. In all these problems, zoom in for more accuracy. 

To blow up a figure we can choose new ranges. The fast way is to use a ZOOM 
command, For a preset range, use 200M Standard or ZOOM Trig. To shrink 
or stretch by XFact or YFact (default values 4), use ZOOM In or ZOOM Out. 
Choose the center point and press ENTER, The new graph appears. Change those 
scaling factors with ZOOM Set Factors. Best of all, create your own viewing 
window, Press ZOOM Box. 

To draw the box, move the cursor to one corner. Press ENTER and this point is 
a small square. The same keys move a second (blinking) square to the opposite 
corner—the box grows as you move. Press ENTER, and the box is the new viewing 
window. The graphs show the same function with a change of scale. Section 3.4 will 
discuss the mathematics—here we concentrate on the graphics. 


EXAMPLE9 Place :¥7=X sin (1/X) inthe Y= editscreen. Press ZOOMTrig 
for a first graph. Set XFact =land YFact =2.5. Press 200M In with center at 
(0, 0). To see a larger picture, use XFact =10and YFact=1. Then Zoom Out 
again. As X gets large, the function X sin(1/X} approaches 

Now return to ZOOM Trig. Zoom Ln with the factors set to 4 (default). Zoom 
again by pressing ENTER. With the center and the factors fixed, this is faster than 
drawing a zoom box. 
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EXAMPLE 10 Repeat for the more erratic function Y = sin{1/X). After Z00M Trig, 
create a box to see this function near X = .01. The Y range is now 


Scaling is crucial. For a new function it can be tedious. A formula for y(x) does 
not easily reveal the range of y’s, when A < x < Bis given. The following program is 
often more convenient than zooms. It samples the function L = 19 times across the 
x-range (every 5 pixels). The inputs Xmin, Xmax, Y, are previously stored on other 
screens. After sampling, the program sets the y-range from C = Ymin to D = Ymax 
and draws the graph. 

Notice the Joop with counter K. The loop ends with the command IS > (K,L), 
which increases K by 1 and skips a line if the new K exceeds L. Otherwise the 
command Goto 1 restarts the loop, The screen shows the short form on the left. 


Example: Y4 =x°+10x?-—7x+42 with range Xmin=—12 and Xmax = 10. 
Set tick spacing Xscl=4 and Yscl = 250. Execute with PRGM CEXEC) A 
ENTER. For this program we also list menu locations and comments. 


PrgmA :AUTOSCL Menu (Submenu) Comment 


:ALL-Off Y-VARS (OFF) Turn off functions 
iXmin7A VARS (RNG) Store Xmin using STO 
r193L Store number of evaluations (19) 
:(Xmax-A)}/LAH Spacing between evaluations 

sA>X Start at x= A 

r¥qoc Y-VARS CY) Evaluate the function 
:C-+0 Start C and D with this value 

21-K Initialize counter K = 1 

sLbL 1 PRGM (CTL? Mark loop start 
:A+KH>X Calculate next x 

:¥17Y Evaluate function at x 

IF Y<C PGRM (CTL) New minimum? 
7Y>C Update C 

IF O<Y PRGM (CTL) New maximum? 
:Y70 Update D 

:IS>(K,L) PRGM (CTL) Add | to K, skip Goto if >L 
:Goto 1 PRGM (CTL) Loop return to Lbl 1 
2¥47On Y-VARS (ON) Tum on Y1 
:€>Ymin STO VARS CRNG) Set Ymin=C 
:0 > Ymax STO VARS (RNG) Set Ymax =D 


:DispGraph PRGM (1/0) Generate graph 
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CHAPTER 2 


Derivatives 


Rs =§=62.1 The Derivative of a Function 


This chapter begins with the definition of the derivative. Two examples were in 
Chapter 1. When the distance is t?, the velocity is 2t. When f(t)=sin t we found 
v(t) = cos t. The velocity is now called the derivative of f(t). As we move to a more 
formal definition and new examples, we use new symbols f’ and df/dt for the 
derivative. 


2A At time ¢, the derivative f'(t) or df /dt or v(t) is 


f(t+ At -f) 
At : 


f= jim 


The ratio on the right is the average velocity over a short time At. The derivative, on 
the left side, is its limit as the step At (delta t) approaches zero. 

Go slowly and look at each piece. The distance at time t+ At is f(t + At). The 
distance at time t is f(t). Subtraction gives the change in distance, between those 
times. We often write Af for this difference: Af= f(t + At) — f(t). The average velocity 
is the ratio Af /At—change in distance divided by change in time. 

The limit of the average velocity is the derivative, if this limit exists: 


df... Af 
dt a At" 
This is the neat notation that Leibniz invented: Af/At approaches df /dt. Behind the 
innocent word “limit” is a process that this course will help you understand. 
Note that Af is not A times f! It is the change in f. Similarly At is not A times t. 
It is the time step, positive or negative and eventually small. To have a one-letter 
symbol we replace At by h. 


(2) 


The right sides of (1) and (2) contain average speeds. On the graph of f(t), the 
distance up is divided by the distance across. That gives the average slope Af/At. 
The left sides of (1) and (2) are instantaneous speeds df /dt. They give the slope at 
44 the instant t. This is the derivative df/dt (when At and Af shrink to zero). Look again 
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at the calculation for f(t) = t?: 


Af _ f(t+At)—f@)_ t?+2t At + (At)? —t? 

re V i =2t+ At. (3) 
Important point: Those steps are taken before At goes to zero. If we set At =0 too 
soon, we learn nothing. The ratio Af/At becomes 0/0 (which is meaningless). The 
numbers Af and At must approach zero together, not separately. Here their ratio is 
2t + At, the average speed. 

To repeat: Success came by writing out (t + At)? and subtracting t? and dividing 

by At. Then and only then can we approach At = 0. The limit is the derivative 21. 


There are several new things in formulas (1) and (2). Some are easy but important, 
others are more profound. The idea of a function we will come back to, and the 
definition of a limit. But the notations can be discussed right away. They are used 
constantly and you also need to know how to read them aloud: 


f(t)=“f of t” =the value of the function f at time t 
At = “delta ¢” = the time step forward or backward from t 
f(t+ At) =“‘f of t plus delta t” =the value of f at time t+ At 
Af= “delta f” =the change f(t + At) — f(t) 
Af/At = ‘delta f over delta t’’ = the average velocity 
S'®)=“f prime of t” = the value of the derivative at time t 
df /dt =“‘dfdt” =the same as f’ (the instantaneous velocity) 


lim = “limit as delta t goes to zero” = the process that starts with 
“1° numbers Af/At and produces the number df/dt. 


From those last words you see what lies behind the notation df/dt. The symbol At 
indicates a nonzero (usually short) length of time. The symbol dt indicates an 
infinitesimal (even shorter) length of time. Some mathematicians work separately 
with df and dt, and df/dt is their ratio. For us df/dt is a single notation (don’t 
cancel d and don’t cancel A). The derivative df/dt is the limit of Af/At. When that 
notation df /dt is awkward, use f’ or v. 


Remark The notation hides one thing we should mention. The time step can be 
negative just as easily as positive. We can compute the average Af/At over a time 
interval before the time t, instead of after. This ratio also approaches df/dt. 

The notation also hides another thing: The derivative might not exist. The averages 
Af/At might not approach a limit (it has to be the same limit going forward and 
backward from time t). In that case f’(t) is not defined. At that instant there is no 
clear reading on the speedometer. This will happen in Example 2. 


EXAMPLE 4 (Constant velocity V = 2) The distance f is V times t. The distance at 
time t + At is V times t + At. The difference Af is V times At: 

Af  VAt df 

rVemer vies so the limit is an 
The derivative of Vt is V. The derivative of 2t is 2. The averages Af/At are always 
V = 2, in this exceptional case of a constant velocity. 


45 


46 


2 Derivatives 


EXAMPLE 2 Constant velocity 2 up to time t = 3, then stop. 


For small times we still have f(t) = 2t. But after the stopping time, the distance is 
fixed at f(t)= 6. The graph is flat beyond time 3. Then f(t + At)=f(t) and Af=0 
and the derivative of a constant function is zero: 


Snes gas FEA HI) oe OF 
cee me Ce he “) 

In this example the derivative is not defined at the instant when t = 3. The velocity 
falls suddenly from 2 to zero. The ratio Af/At depends, at that special moment, on 
whether At is positive or negative. The average velocity after time t = 3 is zero. The 
average velocity before that time is 2. When the graph of f has a corner, the graph 
of v has a jump. It is a step function. 

One new part of that example is the notation (df/dt or f’ instead of v). Please look 
also at the third figure. It shows how the function takes t (on the left) to f(t). Especially 
it shows At and Af. At the start, Af/At is 2. After the stop at t = 3, all t’s go to the 
same f(t)= 6. So Af=0 and df/dt =0. 


time distance 
6 
v=df/dt=f At 
5 t=3 Af 
FQ) not defined At 
t 
3 


Fig. 2.4 The derivative is 2 then 0. It does not exist at t = 3. 


THE DERIVATIVE OF 4/t 


Here is a completely different slope, for the “demand function” f(t) = 1/t. The demand 
is 1/t when the price is t. A high price t means a low demand 1/t. Increasing the price 
reduces the demand. The calculus question is: How quickly does 1/t change when t 
changes? The ‘‘marginal demand” is the slope of the demand curve. 

The big thing is to find the derivative of 1/t once and for all. It is —1/t?. 


al = sf t—(tt+At)  —At 
EXAMPLE3 /(t)= ; has Af= nea This equals aay (+ An e+ At)’ 
Af eT 


df 

approaches Ae 

Line 1 is algebra, line 2 is calculus. The first step in line 1 subtracts f(t) from 
f(t + At). The difference is 1/(t + At) minus 1/t. The common denominator is t times 
t+ At—this makes the algebra possible. We can’t set At = 0 in line 2, until we have 
divided by At. 

The average is Af/At= —1/t(t+ At). Now set At=0. The derivative is —1/t?. 
Section 2.4 will discuss the first of many cases when substituting At = 0 is not possible, 
and the idea of a limit has to be made clearer. 


. . i. — 71 
Divide by At and let At > 0: At «e+Ad 
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3 
At 
2 
1 1 
1 Af 
t f= = 


Fig. 2.2 Average slope is —4, true slope is —}. Increase in t produces decrease in f. 


Check the algebra at t= 2 and t + At = 3. The demand 1/t drops from 1/2 to 1/3. 
The difference is Af= — 1/6, which agrees with —1/(2)(3) in line 1. As the steps Af 
and At get smaller, their ratio approaches —1/(2)(2) = —1/4. 


This derivative is negative. The function 1/t is decreasing, and Af is below zero. The 
graph is going downward in Figure 2.2, and its slope is negative: 


An increasing f(t) has positive slope. A decreasing f(t) has negative slope. 


The slope —1/t? is very negative for small t. A price increase severely cuts demand. 

The next figure makes a small but important point. There is nothing sacred about t. 
Other letters can be used—especially x. A quantity can depend on position instead 
of time. The height changes as we go west. The area of a square changes as the side 
changes. Those are not affected by the passage of time, and there is no reason to use 
t. You will often see y = f(x), with x across and y up—connected by a function f. 

Similarly, f is not the only possibility. Not every function is named f! That letter 
is useful because it stands for the word function—but we are perfectly entitled to 
write y(x) or y(t) instead of f(x) or f(t). The distance up is a function of the distance 
across. This relationship “y of x” is all-important to mathematics. 

The slope is also a function. Calculus is about two functions, y(x) and dy/dx. 


Question If we add 1 to y(x), what happens to the slope? Answer Nothing. 
Question If we add 1 to the slope, what happens to the height? Answer 
The symbols t and x represent independent variables—they take any value they 


want to (in the domain). Once they are set, f(t) and y(x) are determined. Thus f and 
y represent dependent variables—they depend on t and x. A change At produces a 


‘Af negative 
1 Ay negative 


Fig. 2.3 The derivative of 1/t is —1/t?. The slope of 1/x is —1/x?. 
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change Af. A change Ax produces Ay. The independent variable goes inside the 
parentheses in f(t) and y(x). It is not the letter that matters, it is the idea: 


independent variable t or x 
dependent variable f or g or y or z or u 
derivative df/dt or df /dx or dy/dx or --- 


The derivative dy/dx comes from [change in y] divided by [change in x]. The time 
step becomes a space step, forward or backward. The slope is the rate at which y 
changes with x. The derivative of a function is its “rate of change.” 

I mention that physics books use x(t) for distance. Darn it. 

To emphasize the definition of a derivative, here it is again with y and x: 


Ay _ y(x+Ax)— y(x) distance up dy F Ay _ 
Ax Ax distance across dx ax+0 Ax 


y'(x). 


The notation y'(x) pins down the point x where the slope is computed. In dy/dx that 
extra precision is omitted. This book will try for a reasonable compromise between 
logical perfection and ordinary simplicity. The notation dy/dx(x) is not good; y’'(x) is 
better; when x is understood it need not be written in parentheses. 

You are allowed to say that the function is y= x? and the derivative is y’ = 2x— 
even if the strict notation requires y(x) =x? and y’(x)= 2x. You can even say that 
the function is x? and its derivative is 2x and its second derivative is 2—provided 
everybody knows what you mean. 


Here is an example. It is a little early and optional but terrific. You get excellent 
practice with letters and symbols, and out come new derivatives. 


EXAMPLE 4 If u(x) has slope du/dx, what is the slope of f(x) = (u(x))?? 


From the derivative of x? this will give the derivative of x*. In that case u = x? and 
f= x‘. First point: The derivative of u? is not (du/dx)?. We do not square the derivative 
2x. To find the “square rule” we start as we have to—with Af=f(x + Ax) — f(x): 


Af = (u(x + Ax))? — (u(x))? = [u(x + Ax) + u(x)] [u(x + Ax) — u(x)]. 


This algebra puts Af in a convenient form. We factored a? — b? into [a+ b] times 
[a—b]. Notice that we don’t have (Au)?. We have Af, the change in u?. Now divide 
by Ax and take the limit: 


Af 


—— = [u(x + Ax) + u(x)] ae 


Ax 


du 
7 | approaches 2u(x) ae (5) 


This is the square rule: The derivative of (u(x))’ is 2u(x) times dujdx. From the 
derivatives of x? and 1/x and sin x (all known) the examples give new derivatives. 


EXAMPLE 5 (u =x”) The derivative of x* is 2u du/dx = 2(x?)(2x) = 4x°. 
EXAMPLE 6 (u = 1/x) The derivative of 1/x? is 2u du/dx = (2/x)(—1/x?) = —2/x?. 


EXAMPLE 7 (u=sin x, du/dx =cos x) The derivative of u? =sin?x is 2sin x cos x. 


Mathematics is really about ideas. The notation is created to express those ideas. 
Newton and Leibniz invented calculus independently, and Newton’s friends spent a 
lot of time proving that he was first. He was, but it was Leibniz who thought of 
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writing dy/dx—which caught on. It is the perfect way to suggest the limit of Ay/Ax. 
Newton was one of the great scientists of all time, and calculus was one of the great 
inventions of all time—but the notation must help. You now can write and speak 
about the derivative. What is needed is a longer list of functions and derivatives. 


2.4 EXERCISES 


Read-through questions 


The derivative is the __a 
Here Af equals __¢ 


of Af/Atas At approaches __6__. 
. The step At can be positive or_d_. 

The derivative is written vor_e or_f If f(x)=2x+4+3 

and Ax=4 then Af=_g_. If Ax=—1 then Af=__h 

If Ax=0 then Af=_!_. The slope is not 0/0 but 


dfjdx=_ |. 


The derivative does not exist where f(t)hasa__k and 


v(t) has a ___|__. For f(t)=1/t the derivative is _m_. The 
slope of y=4/x is dy/dx =__n__. A decreasing function has 
a__o_ derivative. The __p_ variable is t or x and the 


q__ variable is f or y. The slope of y? (is) (is not) (dy/dx)?. 
The slope of (u(x)) is__t _ by the square rule. The slope of 
(2x+3) is _s 


1 Which of the following numbers (as is) gives df/dt at time 
t? If in doubt test on f(t) =t?. 


f(t + At)—f() . f(t + 2h) —fi(t) 
(a) At (eu 2h 

—At)— ; At) — 
() lim, S(t oY f(t) @) lim Ae f@® 


2 Suppose f(x) =x?. Compute each ratio and set h =0: 


a) f(x +h) -f(x) f(x + 5h) —f(x) 


( (b) 


h Sh 
Lx +h) -fe—h) f(x + D—flx) 
a a te aa 


3 For f(x) = 3x and g(x) = 1 + 3x, find f(4 + h) and g(4 + h) 
and f’(4) and g’(4). Sketch the graphs of f and g—why do 
they have the same slope? 


4 Find three functions with the same slope as f(x) = x?. 


5 For f(x) = 1/x, sketch the graphs of f(x) + 1 and f(x + 1). 
Which one has the derivative —1/x?? 


6 Choose c so that the line y = x is tangent to the parabola 
y=x? +c. They have the same slope where they touch. 


7 Sketch the curve y(x)=1—.x? and compute its slope at 
x=3. 


8 If f(t) = 1/t, what is the average velocity between t = 4 and 
t= 2? What is the average between t=4 and t=1? What 
is the average (to one decimal place) between t=4 and 
t = 101/200? 


9 Find Ay/Ax for y(x) =x + x. Then find dy/dx. 
10 Find Ay/Ax and dy/dx for y(x) = 1+ 2x + 3x?, 


11 When f(t)=4/t, simplify the difference f(¢+ At) —f(), 
divide by At, and set At=0. The result is f“(t). 

12 Find the derivative of 1/t? from Af(t) = 1/(t + At)? — 1/t?. 
Write Af as a fraction with the denominator t?(¢ + At)’. 
Divide the numerator by At to find Af/At. Set At = 0. 


13 Suppose f(t)=7t to t=1. Afterwards f(t) = 7+ 9(t— 1). 
(a) Find df/dt at t=4 and t =3. 
(b) Why doesn’t f(t) have a derivative at t = 1? 


14 Find the derivative of the derivative (the second derivative) 
of y= 3x7. What is the third derivative? 


15 Find numbers A and B so that the straight line y = x fits 
smoothly with the curve Y= A+ Bx + x? at x = 1. Smoothly 
means that y= Y and dy/dx =dY/dx at x =1. 


16 Find numbers A and B so that the horizontal line y=4 
fits smoothly with the curve y= 4+ Bx+x* at the point 
x=2, 
17 True (with reason) or false (with example): 

(a) If f(t) <0 then df /dt <0. 

(b) The derivative of (f(t))? is 2 df /dt. 

(c) The derivative of 2f(t) is 2 df /dt. 


(d) The derivative is the limit of Af divided by the limit 
of At. 


18 For f(x) = 1/x the centered difference f(x +h) —f(x —h)is 
1/(x + h) — 1/(x — h). Subtract by using the common denomi- 
nator (x + h)(x —h). Then divide by 2h and set h=0. Why 
divide by 2h to obtain the correct derivative? 


19 Suppose y=mx +b for negative x and y= Mx + B for 
x20. The graphs meet if . The two slopes are 
. The slope at x = 0 is (what is possible?). 


20 The slope of y=1/x at x= 1/4 is y) =—1/x? =—16. At 
h= 1/12, which of these ratios is closest to —16? 
Wx+h—yx) — vx)—yx—-h) Wx th)—yx—h) 
h h 2h 


21 Find the average slope of y=x? between x= x, and 
xX =X. What does this average approach as x, approaches x, ? 


22 Redraw Figure2.t when f(t)=3-—2¢ for t<2 and 
f()=—1 for t>2. Include df/dt. 
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23 Redraw Figure 2.3 for the function y(x)=1—(1/x). 
Include dy/dx. 


24 The limit of 0/At as At > 0 is not 0/0. Explain. 


25 Guess the limits by an informal working rule. Set At = 0.1 
and —0.1 and imagine At becoming smaller: 


1+At |At| 
a) 2+At (b) At 

At + (At)? t+At 
(©) At — (At)? () t—At 


*26 Suppose f(x)/x +7 as x 30. Deduce that f(0)=0 and 
f'(0) =7. Give an example other than f(x) = 7x. 


£B+x)—-f@) 
x 


27 What is lim if it exists? What if x > 1? 
Problems 28-31 use the square rule: d(u*)/dx = 2u(du/dx). 


28 Take u =x and find the derivative of x? (a new way). 


29 Take u=x* and find the derivative of x® (using 
du/dx = 4x). 
30 If w= 1 then u? = 1. Then d1/dx is 2 times d1/dx. How is 
this possible? 


31 Take u= wes The derivative of u? = x is 1 = 2u(du/dx). So 
what is du/dx, the derivative of Vx? 


32 The left figure shows f(t) = t?. Indicate distances f(t + At) 
and At and Af. Draw lines that have slope Af/At and f’(t). 


t t+At 
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33 The right figure shows f(x) and Ax. Find Af/Ax and f’(2). 
34 Draw f(x) and Ax so that Af/Ax =0 but f’(x) 40. 


35 If f=u? then df/dx=2udu/dx. If g=f? then 
dg/dx =2f df/dx. Together those give g=u* and dg/dx = 


36 True or false, assuming f(0)=0: 
(a) If f(x) <x for all x, then df/dx < 1. 
(b) If df/dx <1 for all x, then f(x) <x. 
37 The graphs show Af and Af /h for f(x) = x?. Why is 2x +h 


the equation for Af/h? If h is cut in half, draw in the new 
graphs. 


Vt careh 


he 


38 Draw the corresponding graphs for f(x) = 4x. 


39 Draw 1/x and 1/(x +h) and Af/h—either by hand with 
h =} or by computer to show h- 0. 


40 For y=e*, show on computer graphs that dy/dx = y. 


41 Explain the derivative in your own words. 


This section has two main goals. One is to find the derivatives of f(x) =x? and x* 
and x5 (and more generally f(x) =x"). The power or exponent n is at first a positive 
integer. Later we allow x* and x”? and every x”. 

The other goal is different. While computing these derivatives, we look ahead to 
their applications. In using calculus, we meet equations with derivatives in them— 
“differential equations.” It is too early to solve those equations. But it is not too early 
to see the purpose of what we are doing. Our examples come from economics and 


biology. 


2.2 Powers and Polynomials 


With n= 2, the derivative of x? is 2x. With n= — 1, the slope of x~! is —1x~?. 
Those are two pieces in a beautiful pattern, which it will be a pleasure to discover. 
We begin with x? and its derivative 3x”, before jumping to x". 


EXAMPLE 4 If f(x) = x° then Af=(x + h)? — x3 =(x? + 3x7h + 3xh? + h3)— x9. 
Step 1: Cancel x°. Step 2: Divide by h. Step 3: h goes to zero. 


a = 3x?+3xh+h? approaches a = 3x7, 


That is straightforward, and you see the crucial step. The power (x + h)° yields four 
separate terms x° + 3x7h + 3xh? + h°. (Notice 1, 3, 3, 1.) After x° is subtracted, we 
can divide by h. At the limit (h = 0) we have 3x?. 


For f(x)= x" the plan is the same. A step of size h leads to f(x + h)=(x+ A)’. 
One reason for algebra is to calculate powers like (x + h)", and if you have forgotten 
the binomial formula we can recapture its main point. Start with n= 4: 


(xt h\(xth\(xt+h\(x+h)=xt+ 992 +44, (1) 


Multiplying the four x’s gives x*. Multiplying the four h’s gives h*. These are the easy 
terms, but not the crucial ones. The subtraction (x + h)* — x* will remove x*, and the 
limiting step h > 0 will wipe out h* (even after division by h). The products that matter 
are those with exactly one h. In Example 1 with (x + h)?, this key term was 3x7h. 
Division by h left 3x?. 

With only one h, there are n places it can come from. Equation (1) has four h’s in 
parentheses, and four ways to produce x*h. Therefore the key term is 4x*h. (Division 
by h leaves 4x*.) In general there are n parentheses and n ways to produce x"~*h, so 
the binomial formula contains nx"~*h: 


(x + hy =x" tnx" tht -. +h" (2) 


Subtract x" from (2). Divide by h. The key term is nx"~!. The rest disappears as h > 0: 


n-1 


AP Oa So en Gf _ i. 


Ax h h si ax 


The terms replaced by the dots involve h? and h? and higher powers. After dividing 
by h, they still have at least one factor h. All those terms vanish as h approaches zero. 


EXAMPLE 2 (x + h)* = x*+ 4x3h+ 6x7h? + 4xh3 + h*. This is n= 4 in detail. 


Subtract x*, divide by h, let h > 0. The derivative is 4x3. The coefficients 1, 4, 6, 4, 1 
are in Pascal’s triangle below. For (x + h)* the next row is 1, 5, 10, ?. 


Remark The missing terms in the binomial formula (replaced by the dots) contain 
all the products x""/h/. An x or an h comes from each parenthesis. The binomial 
coefficient “‘n choose j” is the number of ways to choose j h’s out of n parentheses. It 
involves n factorial, which is n(n — 1) --- (1). Thus 5! = 5+4+3+2+1= 120. 


on 
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These are numbers that gamblers know and love: 


1 Pascal’s 
eee a) n! 11 triangle 
“n choose f° =| | | =- 121 
ae aL 1331  n=3 


14641 n=4 


In the last row, the coefficient of xh is 4!/1!3!=4+3+2-1/1°3-2-1=4. For 
the x*h? term, with j = 2, there are 4-3-2+1/2:1+2+1=6 ways to choose two f’s. 
Notice that 1+4+6+4+ 1 equals 16, which is 2*. Each row of Pascal’s triangle 
adds to a power of 2. 

Choosing 6 numbers out of 49 in a lottery, the odds are 49 - 48-47: 46°+45- 44/6! 
to 1. That number is N = ‘‘49 choose 6” = 13,983,816. It is the coefficient of x*3h® 
in (x + h)*°. If 4 times N tickets are bought, the expected number of winners is 4. The 
chance of no winner is e~*. The chance of one winner is 4e~*. See Section 8.4. 

Florida’s lottery in September 1990 (these rules) had six winners out of 109,163,978 
tickets. 


DERIVATIVES OF POLYNOMIALS 


Now we have an infinite list of functions and their derivatives: 
xx? x3 xt xP 1 2x 3x? 4x3 5x4... 


The derivative of x" is n times the next lower power x" |. That rule extends beyond 
these integers 1, 2,3, 4, 5 to all powers: 


f=i/x has f'=—1/x?: Example 3 of Section 2.1. (n= —1) 
f=1/x? has f’=—2/x*: Example 6 of Section 2.1. (n= —2) 
— ae has f'’=4x71??: true but not yet checked (n= 4) 

Remember that x~* means I/x? and x71? means 1 |./x. Negative powers lead to 
decreasing functions, approaching zero as x gets large. Their slopes have minus signs. 


Question What are the derivatives of x!° and x”? and x7 1/7? 

Answer 10x? and 2.2x!? and —4x~7. Maybe (x +h)? is a little unusual. 
Pascal’s triangle can’t deal with this fractional power, but the formula stays firm: 
After x?? comes 2.2x'h. The complete binomial formula is in Section 10.5. 


That list is a good start, but plenty of functions are left. What comes next is really 
simple. A tremendous number of new functions are “linear combinations” like 


f(x)=6x2 or 6x2 +4x? or 6x? —4x?. 


What are their derivatives? The answers are known for x° and x?, and we want to 
multiply by 6 or divide by 2 or add or subtract. Do the same to the derivatives: 


f(x) = 18x? or) «18x? +x or 18x*—x. 


2C The derivative of c times f(x) is c times f’(x). 


2D The derivative of f(x) + g(x) is f(x) + 9(x). 


The number c can be any constant. We can add (or subtract) any functions. The rules 
allow any combination of f and g: The derivative of 9f (x) — 7g(x) is 9f’(x) — 7g'(x). 


2.2 Powers and Polynomials 


The reasoning is direct. When f(x) is multiplied by c, so is f(x + h). The difference 
Af is also multiplied by c. All averages Af /h contain c, so their limit is cf’. The only 
incomplete step is the last one (the limit). We still have to say what “limit” means. 

Rule 2D is similar. Adding f+ g means adding Af+ Ag. Now divide by h. In the 
limit as h +0 we reach f’ + g’—because a limit of sums is a sum of limits. Any 
example is easy and so is the proof—it is the definition of limit that needs care 
(Section 2.6). 


You can now find the derivative of every polynomial. A “polynomial” is a combina- 
tion of 1, x, x”, ..., x" for example 9 + 2x — x°. That particular polynomial has slope 
2— 5x*. Note that the derivative of 9 is zero! A constant just raises or lowers the 
graph, without changing its slope. It alters the mileage before starting the car. 

The disappearance of constants is one of the nice things in differential calculus. 
The reappearance of those constants is one of the headaches in integral calculus. 
When you find v from f, the starting mileage doesn’t matter. The constant in f has 
no effect on v. (Af is measured by a trip meter; At comes from a stopwatch.) To find 
distance from velocity, you need to know the mileage at the start. 


A LOOK AT DIFFERENTIAL EQUATIONS (FIND y FROM dy/dx) 


We know that y = x? has the derivative dy/dx = 3x”. Starting with the function, we 

found its slope. Now reverse that process. Start with the slope and find the function. 

This is what science does all the time—and it seems only reasonable to say so. 
Begin with dy/dx = 3x”. The slope is given, the function y is not given. 


Question Can you go backward to reach y = x3? 

Answer Almost but not quite. You are only entitled to say that y=x°>+C. The 
constant C is the starting value of y (when x =0). Then the differential equation 
dy/dx = 3x? is solved. 


Every time you find a derivative, you can go backward to solve a differential 
equation. The function y = x? + x has the slope dy/dx = 2x + 1. In reverse, the slope 
2x + 1 produces x? + x—and all the other functions x? + x + C, shifted up and down. 
After going from distance f to velocity v, we return to f+ C. But there is a lot more 
to differential equations. Here are two crucial points: 


1. We reach dy/dx by way of Ay/Ax, but we have no system to go backward. With 
dy/dx = (sin x)/x we are lost. What function has this derivative? 


2. Many equations have the same solution y = x. Economics has dy/dx = 3y/x. 
Geometry has dy/dx = 3y?/?. These equations involve y as well as dy/dx. Func- 
tion and slope are mixed together! This is typical of differential equations. 


To summarize: Chapters 2—4 compute and use derivatives. Chapter 5 goes in reverse. 
Integral calculus discovers the function from its slope. Given dy/dx we find y(x). Then 
Chapter 6 solves the differential equation dy/dt = y, function mixed with slope. 
Calculus moves from derivatives to integrals to differential equations. 


This discussion of the purpose of calculus should mention a specific example. 
Differential equations are applied to an epidemic (like AIDS). In most epidemics the 
number of cases grows exponentially. The peak is quickly reached by e’, and the 
epidemic dies down. Amazingly, exponential growth is not happening with AIDS— 
the best fit to the data through 1988 is a cubic polynomial (Los Alamos Sciehce, 1989): 


The number of cases fits a cubic within 2%: y = 174.6(t — 1981.2)° + 340. 
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This is dramatically different from other epidemics. Instead of dy/dt = y we have 
dy/dt = 3y/t. Before this book is printed, we may know what has been preventing e’ 
(fortunately). Eventually the curve will turn away from a cubic—I hope that 
mathematical models will lead to knowledge that saves lives. 


Added in proof: In 1989 the curve for the U.S. dropped from ¢? to t?. 
MARGINAL COST AND ELASTICITY IN ECONOMICS 


First point about economics: The marginal cost and marginal income are crucially 
important. The average cost of making automobiles may be $10,000. But it is the 
$8000 cost of the next car that decides whether Ford makes it.’ The average describes 
the past, the marginal predicts the future.”’ For bank deposits or work hours or wheat, 
which come in smaller units, the amounts are continuous variables. Then the word 
“marginal” says one thing: Take the derivative.t 

The average pay over all the hours we ever worked may be low. We wouldn’t work 
another hour for that! This average is rising, but the pay for each additional hour 
rises faster—possibly it jumps. When $10/hour increases to $15/hour after a 40-hour 
week, a 50-hour week pays $550. The average income is $11/hour. The marginal 
income is $15/hour—the overtime rate. 


Concentrate next on cost. Let y(x) be the cost of producing x tons of steel. The 
cost of x + Ax tons is (x + Ax). The extra cost is the difference Ay. Divide by Ax, 
the number of extra tons. The ratio Ay/Ax is the average cost per extra ton. When 
Ax is an ounce instead of a ton, we are near the marginal cost dy/dx. 

Example: When the cost is x”, the average cost is x?/x = x. The marginal cost is 
2x. Figure 2.4 has increasing slope—an example of ‘‘diminishing returns to scale.” 


marginal 
cost 2x 


average 
cost x 


- E=0 
any price 


any supply 7 
fixed price ~~ 


eo 


x quantity equilibrium price price 


Fig. 2.4 Marginal exceeds average. Constant elasticity E = +1. Perfectly elastic to perfectly 
inelastic ([' curve). 


This raises another point about economics. The units are arbitrary. In yen per 
kilogram the numbers look different. The way to correct for arbitrary units is to work 
with percentage change or relative change. An increase of Ax tons is a relative increase 
of Ax/x. A cost increase Ay is a relative increase of Ay/y. Those are dimensionless, the 
same in tons/tons or dollars/dollars or yen/yen. 


A third example is the demand y at price x. Now dy/dx is negative. But again the 
units are arbitrary. The demand is in liters or gallons, the price is in dollars or pesos. 


{These paragraphs show how calculus applies to economics. You do not have to be an 
economist to understand them. Certainly the author is not, probably the instructor is not, 
possibly the student is not. We can all use dy/dx. 


2.2 Powers and Polynomials 


Relative changes are better. When the price goes up by 10%, the demand may drop 
by 5%. If that ratio stays the same for small increases, the elasticity of demand is 4. 

Actually this number should be — $. The price rose, the demand dropped. In our 
definition, the elasticity will be — 4. In conversation between economists the minus 
sign is left out (I hope not forgotten). 


DEFINITION The elasticity of the demand function y(x) is 


— tm Avy _ ayldx 
F(x) an Ax/x y[x ¢ 


(3) 


Elasticity is “marginal” divided by “average.” E(x) is also relative change in y divided 
by relative change in x. Sometimes E(x) is the same at all prices—this important case 
is discussed below. 


EXAMPLE 4 Suppose the demand is y=c/x when the price is x. The derivative 
dy/dx = — c/x? comes from calculus. The division y/x = c/x? is only algebra. The ratio 
isE=-1: 

For the demand y = c/x, the elasticity is (— c/x?)/(c/x”) = —1. 


All demand curves are compared with this one. The demand is inelastic when |E| < 1. 
It is elastic when |E|>1. The demand 20/./x is inelastic (E = — 4), while x~3 is 
elastic (E = — 3). The power y= cx", whose derivative we know, is the function with 
constant elasticity n: 


if y=cx" then dy/dx=cnx""! and E=cnx"~1'/(cx"/x) =n. 


It is because y = cx" sets the standard that we could come so early to economics. 
In the special case when y= c/x, consumers spend the same at all prices. Price x 
times quantity y remains constant at xy =c. 


EXAMPLE 2 The supply curve has E >0—supply increases with price. Now the 
baseline case is y= cx. The slope is c and the average is y/x=c. The elasticity is 
E=c/c=1. 


Compare E = 1 with E=0 and E= oo. A constant supply is “‘perfectly inelastic.” 
The power n is zero and the slope is zero: y=c. No more is available when the 
harvest is over. Whatever the price, the farmer cannot suddenly grow more wheat. 
Lack of elasticity makes farm economics difficult. 

The other extreme E = 00 is “perfectly elastic.” The supply is unlimited at a fixed 
price x. Once this seemed true of water and timber. In reality the steep curve 
x=constant is leveling off to a flat curve y= constant. Fixed price is changing to 
fixed supply, E = 00 is becoming E = 0, and the supply of water follows a “gamma 
curve”’ shaped like I. 


EXAMPLE 3 Demand is an increasing function of income—more income, more 
demand. The income elasticity is E(I) = (dy/d!)/(y/I). A luxury has E> 1 (elastic). 
Doubling your income more than doubles the demand for caviar. A necessity has 
E <1 (inelastic). The demand for bread does not double. Please recognize how the 
central ideas of calculus provide a language for the central ideas of economics. 


Important note on supply = demand This is the basic equation of microeconomics. 
Where the supply curve meets the demand curve, the economy finds the equilibrium 
price. Supply = demand assumes perfect competition. With many suppliers, no one can 
raise the price. If someone tries, the customers go elsewhere. 
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The opposite case is a monopoly—no competition. Instead of many small producers 
of wheat, there is one producer of electricity. An airport is a monopolist (and maybe 
the National Football League). If the price is raised, some demand remains. 

Price fixing occurs when several producers act like a monopoly—which antitrust 
laws try to prevent. The price is not set by supply = demand. The calculus problem 
is different—to maximize profit. Section 3.2 locates the maximum where the marginal 
profit (the slope!) is zero. 


Question on income elasticity From an income of $10,000 you save $500. The 
income elasticity of savings is E = 2. Out of the next dollar what fraction do you 
save? 

Answer The savings is y= cx? because E = 2. The number c must give 500= 
c(10,000)*, so c is 5+ 10~°. Then the slope dy/dx is 2cx = 10° 107° 104 = +. This is 
the marginal savings, ten cents on the dollar. Average savings is 5%, marginal savings 
is 10%, and E = 2. 


Read-through questions 


The derivative of f=x* is f’=_a_. That comes from 
expanding (x + h)* into the five terms __b 


. Subtracting x* 
and dividing by h leaves the four terms _¢ __. This is Af /h, 
and its limit is __d 


The derivative of f=x"is f’=_e . Now(x +h)" comes 
from the __t _ theorem. The terms to look for are x"~'h, 
containing only one __g__. There are __h___ of those terms, 
so (x+h)"=x"+_i__+---. After subtracting i and 
dividing by h, the limit of Af/h is ___k . The coefficient of 
x" Fh, not needed here, is ‘‘n choose 7” =__|_, where n! 
means __m 


2 2 


The derivative of x”? is _m__. The derivative of x!’? is 

o__. The derivative of 3x +(1/x) is__p _, which uses the 
following rules: The derivative of 3f(x)is__a__ and the deriv- 
ative of f(x) + g(x) is__t__. Integral calculus recovers __s 
from dy/dx. If dy/dx = x* then y(x) = __1 


1 Starting with f= x°, write down f’ and then f”. (This is 
“f double prime,” the derivative of f’.) After deriva- 
tives of x® you reach a constant. What constant? 


2 Find a function that has x° as its derivative. 


Find the derivatives of the functions in 3-10. Even if n is nega- 
tive or a fraction, the derivative of x” is nx"~*. 


3 x7 47x45 4 14(7/x)4+(5/x?) 
5S l4xtx2¢x3 4x4 6 (x? +1) 
Tx"+x" 8 x"/n! 


1 1 1 2) gts ad 
lext-x? 4-134 —x4 = 3/2 4 5 5/2 
9 Ge ta tog aa 10 5% am 


11 Name two functions with df/dx = 1/x?. 


12 Find the mistake: x* is x +x+ --- +x (with x terms). Its 
derivative is 1+1+ --- +1 (also x terms). So the derivative 
of x? seems to be x. 


13 What are the derivatives of 3x13 and —3x7'? and 
(3x43)719 


14 The slope of x + (1/x) is zero when x = 
does the graph do at that point? 


. What 


15 Draw a graph of y=x?— x. Where is the slope zero? 


16 If df/dx is negative, is f(x) always negative? Is f(x) nega- 
tive for large x? If you think otherwise, give examples. 
17 A rock thrown upward with velocity 16 ft/sec reaches 
height f= 16t — 16t? at time t. 

(a) Find its average speed Af /At from t=0 to t=. 

(b) Find its average speed Af/At from t=} to t=1. 

(c) What is df /dt at t= 4? 
18 When f is in feet and ¢ is in seconds, what are the units 


of f’ and its derivative f”? In f= 16t — 16t?, the first 16 is 
ft/sec but the second 16 is 


19 Graph y =x° + x? —x from x = — 2 to x = 2 and estimate 
where it is decreasing. Check the transition points by solving 
dy/dx =0. 


20 At a point where dy/dx =0, what is special about the 
graph of y(x)? Test case: y = x?. 


21 Find the slope of y= We by algebra (then h — 0): 
Ay _ J/xt+h~/x Jxth—/x Jxtht+s/x 
ho h - h Jxth+/x 


22 Imitate Problem 21 to find the slope of y = 1/./x. 
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23 Complete Pascal’s triangle for n =5 and n= 6. Why do 
the numbers across each row add to 2”? 


24 Complete (x +h? =x5+ . What are the bino- 


: : 5 5 5 
9 
mial coefficients ( i and ( ) and ( z) ? 


25 Compute (x +h)? —(x —h)°, divide by 2h, and set h=0. 
Why divide by 2h to find this slope? 


26 Solve the differential equation y" =x to find y(x). 


27 For f(x) =x? + x3, write out f(x + Ax) and Af/Ax. What 
is the limit at Ax = 0 and what rule about sums is confirmed? 


28 The derivative of (u(x))? is 
this rule on u =x". : 


from Section 2.1. Test 


29 What are the derivatives of x’ + 1 and (x + 1)7? Shift the 
graph of x’. 


30 If df/dx is v(x), what functions have these derivatives? 
(a) 4v(x) (b) v(x) +1 
(c) o(x + 1) (d) v(x) + v'(x). 
31 What function f(x) has fourth derivative equal to 1? 
32 What function f(x) has nth derivative equal to 1? 
33 Suppose df/dx=1+x+x?4+x°. Find f(x). 
34 Suppose df /dx =x~? — x73. Find f(x). 


35 f(x) can be its own derivative. In the infinite polynomial 
f=ltxt4hx? + hx3 4+ , what numbers multiply x* 
and x° if df/dx equals f? 


36 Write down a differential equation dy/dx = that 
is solved by y = x”, Make the right side involve y (not just 2x). 
37 True or false: (a) The derivative of x™ is zx". 

(b) The derivative of ax”/bx" is a/b. 

(c) If df/dx =x* and dg/dx = x* then f(x) = g(x). 

(d) (f(x) —f(a))/(x — a) approaches f'(a) as x > a. 

(ce) The slope of y=(x —1)° is y = 3(x— 1)’. 


Problems 38—44 are about calculus in economics. 


38 When the cost is y = yo + cx, find E(x) = (dy/dx)/(y/x). It 
approaches for large x. 


39 From an income of x = $10,000 you spend y = $1200 on 
your car. If E=4, what fraction of your next dollar will be 


spent on the car? Compare dy/dx (marginal) with y/x 
(average). 
40 Name a product whose price elasticity is 

(a) high (b) low (c) negative (?) 
41 The demand y= c/x has dy/dx = — y/x. Show that Ay/Ax 


is not —y/x. (Use numbers or algebra.) Finite steps miss the 
special feature of infinitesimal steps. 


42 The demand y=x" has E= 
(price times demand) has elasticity E = 


. The revenue xy 


43 y=2x +3 grows with marginal cost 2 from the fixed cost 
3. Draw the graph of E(x). 


44 From an income I we save S(I). The marginal propensity 
to save is . Elasticity is not needed because S$ and [ 
have the same . Applied to the whole economy this 
is (microeconomics) (macroeconomics). 


45 2' is doubled when t increases by . t? is doubled 
when t increases to t. The doubling time for AIDS 
is proportional to ¢. 


46 Biology also leads to dy/y =n dx/x, for the relative growth 
of the head (dy/y) and the body (dx/x). Isn>1orn<1 fora 
child? 


47 What functions have df/dx = x° and df/dx =x"? Why 
does n= —1 give trouble? 


48 The slope of y = x° comes from this identity: 


(x +h)? — x3 
h 


(a) Check the algebra. Find dy/dx as h +0. 
(b) Write a similar identity for y = x‘. 


=(x +h)? +(x +h)x + x?. 


49 (Computer graphing) Find all the points where y= 


x* + 2x3 — 7x? +3 =0 and where dy/dx =0. 


50 The graphs of y,(x) =x* +x? and y.(x) = 7x — 5 touch at 
the point where y3(x) = = 0. Plot y3(x) to see what is 
special. What does the graph of y(x) do at a point where 
y=y=0? 

51 In the Massachusetts lottery you choose 6 numbers out 
of 36. What is your chance to win? 


52 In what circumstances would it pay to buy a lottery ticket 
for every possible combination, so one of the tickets would 
win? 
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Chapter | started with straight line graphs. The velocity was constant (at least piece- 
wise). The distance function was linear. Now we are facing polynomials like x* — 2 
or x* — x? + 3, with other functions to come soon. Their graphs are definitely curved. 
Most functions are not close to linear—except if you focus all your attention near a 
single point. That is what we will do. 

Over a very short range a curve looks straight. Look through a microscope, or zoom 
in with a computer, and there is no doubt. The graph of distance versus time becomes 
nearly linear. Its slope is the velocity at that moment. We want to find the line that 
the graph stays closest to—the “tangent line’’—before it curves away. 

The tangent line is easy to describe. We are at a particular point on the graph of 
y=f(x). At that point x equals a and y equals f(a) and the slope equals f’(a). 
The tangent line goes through that point x =a, y=f(a) with that slope m= f'(a). 
Figure 2.5 shows the line more clearly than any equation, but we have to turn the 
geometry into algebra. We need the equation of the line. 


EXAMPLE 14 Suppose y= x*— x? + 3. At the point x = a= 1, the height is y = f(a) = 3. 
The slope is dy/dx = 4x> — 2x. At x = 1 the slope is 4 — 2=2. That is f’(a): 


The numbers x = 1, y = 3, dy/dx = 2 determine the tangent line. 
The equation of the tangent line is y — 3 = 2(x — 1), and this section explains why. 


y=3.1 
y=2v+l 
to (4,9) 
; ZOOM 
fla) =3 BOX 
tangent 
y=29 x 
x=0.9 


Fig. 2.5 ‘The tangent line has the same slope 2 as the curve (especially after zoom). 


THE EQUATION OF A LINE 


A straight line is determined by two conditions. We know the line if we know two 
of its points. (We still have to write down the equation.) Also, if we know one point 
and the slope, the line is set. That is the situation for the tangent line, which has a 
known slope at a known point: 


1. The equation of a line has the form y= mx +b 
2. The number m is the slope of the line, because dy/dx =m 
3. The number b adjusts the line to go through the required point. 


I will take those one at a time—first y= mx + b, then m, then b. 


1. The graph of y = mx + bis not curved. How do we know? For the specific example 
y= 2x +1, take two points whose coordinates x, y satisfy the equation: 


x=0,y=1 and x=4, y=9 both satisfy y=2x+1. 


\ 
y=f) 
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Those points (0, 1) and (4, 9) lie on the graph. The point halfway between has x = 2 
and y= 5. That point also satisfies y= 2x + 1. The halfway point is on the graph. If 
we subdivide again, the midpoint between (0, 1) and (2, 5) is (1, 3). This also has 
y= 2x +1. The graph contains all halfway points and must be straight. 


2. What is the correct slope m for the tangent line? In our example it is m =/'(a) = 2. 
The curve and its tangent line have the same slope at the crucial point: dy/dx = 2. 
Allow me to say in another way why the line y= mx + b has slope m. At x = 0 its 
height is y= b. At x = 1 its height is y= m+ b. The graph has gone one unit across 
(0 to 1) and m units up (b to m+ b). The whole idea is 


distance up — m 
distance across 1— 


slope = 


(1) 
Each unit across means m units up, to 2m+b or 3m+ b. A straight line keeps a 
constant slope, whereas the slope of y= x* — x? + 3 equals 2 only at x= 1. 


3. Finally we decide on b. The tangent line y = 2x + b must go through x = 1, y = 3. 
Therefore b = 1. With letters instead of numbers, y = mx + b leads to f(a)=ma+ b. 
So we know b: 


2E The equation of the tangent line has b = f(a) — ma: 


y=mx + f(a) — ma or y — f(a) = m(x — a). 


That last form is the best. You see immediately what happens at x =a. The factor 
x ~ ais zero. Therefore y = f(a) as required. This is the point-slope form of the equa- 
tion, and we use it constantly: 

—3 dist 
y-3=Ax-1) a. y _ distance up 


x-—1 distance across 


= slope 2. 


EXAMPLE 2 The curve y=x*>—2 goes through y=6 when x=2. At that point 
dy/dx = 3x? = 12. The point-slope equation of the tangent line uses 2 and 6 and 12: 


y—6=12(x-2), whichis also y=12x— 18. 


There is another important line. It is perpendicular to the tangent line and perpen- 
dicular to the curve. This is the normal line in Figure 2.6. Its new feature is its slope. 
When the tangent line has slope m, the normal line has slope —1/m. (Rule: Slopes of 


tangent line: 
slope m= dy distance 
dx 
: VT S: 
., normal line: 77 [Waiting * catch up 
slope _ 1 Car ra T= 8, V=12 
m od 
your speed is V 
time 
Xg=a 
: ; . 1 ; 
Fig. 2.6 Tangent line y — yo = m(x — Xo). Normal line y — yp = — i (x — Xo). Leaving a roller- 


coaster and catching up to a car. 
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perpendicular lines multiply to give —1.) Example 2 has m= 12, so the normal line 
has slope — 1/12: 


tangent line: y — 6 = 12(x ~ 2) normal line: y — 6 = — 2s(x — 2). 


Light rays travel in the normal direction. So do brush fires—they move perpendicular 
to the fire line. Use the point-slope form! The tangent is y = 12x — 18, the normal is 
not y= —qex- 18. 


EXAMPLE 3 You are on a roller-coaster whose track follows y= x* +4. You see a 
friend at (0, 0) and want to get there quickly. Where do you step off? 


Solution Your path will be the tangent line (at high speed). The problem is to cheose 
xX =a so the tangent line passes through x = 0, y= 0. When you step off at x =a, 


the height is y = a? + 4 and the slope is 2a 
the equation of the tangent line is y — (a? + 4) = 2a(x — a) 
this line goes through (0, 0) if — (a? + 4) = — 2a? or a= +2. 


The same problem is solved by spacecraft controllers and baseball pitchers. Releasing 
a ball at the right time to hit a target 60 feet away is an amazing display of calculus. 
Quarterbacks with a moving target should read Chapter 4 on related rates. 


Here is a better example than a roller-coaster. Stopping at a red light wastes gas. 
It is smarter to slow down early, and then accelerate. When a car is waiting in front 
of you, the timing needs calculus: 


EXAMPLE 4 How much must you slow down when a red light is 72 meters away? 
In 4 seconds it will be green. The waiting car will accelerate at 3 meters/sec?. You 
cannot pass the car. 


Strategy Slow down immediately to the speed V at which you will just catch that 
car. (If you wait and brake later, your speed will have to go below V.) At the catch- 
up time T, the cars have the same speed and same distance. Two conditions, so the 
distance functions in Figure 2.6d are tangent. 


Solution At time T, the other car’s speed is 3(T— 4). That shows the delay of 4 
seconds. Speeds are equal when 3(T — 4)= V or T= 3V +4. Now require equal dis- 
tances. Your distance is V times T. The other car’s distance is 72 + $at*: 


72+4-3(T-4" =VT becomes 72+43:4V?=V(4V +4). 
2 


The solution is V = 12 meters/second. This is 43 km/hr or 27 miles per hour. 
Without the other car, you only slow down to V = 72/4 = 18 meters/second. As 
the light turns green, you go through at 65 km/hr or 40 miles per hour. Try it. 


THE SECANT LINE CONNECTING TWO POINTS ON A CURVE 


Instead of the tangent line through one point, consider the secant line through two 
points. For the tangent line the points came together. Now spread them apart. The 
point-slope form of a linear equation is replaced by the two-point form. 

The equation of the curve is still y = f(x). The first point remains at x = a, y = f(a). 
The other point is at x =c, y=f{(c). The secant line goes between them, and we want 
its equation. This time we don’t start with the slope—but mm is easy to find. 
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EXAMPLES The curve y= x°—2 goes through x= 2, y= 6. It also goes through 
x = 3, y= 25. The slope between those points is 


_changeiny 25-6 _ 
changeinx 3-2 


The point-slope form (at the first point) is y— 6 = 19(x — 2). This line automatically 
goes through the second point (3, 25). Check: 25— 6 equals 19(3— 2). The secant 
has the right slope 19 to reach the second point. It is the average slope Ay/Ax. 


A look ahead The second point is going to approach the first point. The secant 
slope Ay/Ax will approach the tangent slope dy/dx. We discover the derivative (in 
the limit). That is the main point now—but not forever. 

Soon you will be fast at derivatives. The exact dy/dx will be much easier than 
Ay/Ax. The situation is turned around as soon as you know that x° has slope 
9x®, Near x=1, the distance up is about 9 times the distance across. To find 
Ay = 1.0019 — 1°, just multiply Ax = .001 by 9. The quick approximation is .009, the 
calculator gives Ay = .009036. It is easier to follow the tangent line than the curve. 


Come back to the secant line, and change numbers to letters. What line connects 
x=a, y=f(a) tox=c, y=f(c)? A mathematician puts formulas ahead of numbers, 
and reasoning ahead of formulas, and ideas ahead of reasoning: 


distance up __f(c) — f(a) 
distance across. c—a 
(2) The height is y=f(a) at x=a 
(3) The height is y=f(c) atx =c (automatic with correct slope). 


(1) The slope is m= 


2F The two-point form uses the slope between the points: 


= (B=L) og oe 


| secant ine. © y- y-f@= 


At x =a the right side is zero. So y=f(a) on the left side. At x = c the right side has 
two factors c — a. They cancel to leave y = f(c). With equation (2) for the tangent line 
and equation (3) for the secant line, we are ready for the moment of truth. 


THE SECANT LINE APPROACHES THE TANGENT LINE 


What comes now is pretty basic. It matches what we did with velocities: 


A distance _ f(t + At)— So 


average velocity = A tims A 


The limit is df/dt. We now do exactly the same thing with slopes. The secant line 
turns into the tangent line as c approaches a: 


_ Af _fo-f@ 
slope of secant line: a pa 
af 
d 


Af 


slope of tangent line: = limit of Ay 
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There stands the fundamental idea of differential calculus! You have to imagine more 
secant lines than I can draw in Figure 2.7, as c comes close to a. Everybody recognizes 
c—aas Ax. Do you recognize f(c) —f(a) as f(x + Ax) — f(x)? It is Af, the change 
in height. All lines go through x = a, y=f(a). Their limit is the tangent line. 


Ax secant 


fO 
He soe secant y—f(a)= fof a 
fla) tangent tangent y —f(a)=f"(a)(x — a) 


acce 


Fig. 2.7 Secants approach tangent as their 
slopes Af/Ax approach df /dx. 


Intuitively, the limit is pretty clear. The two points come together, and the tangent 
line touches the curve at one point. (It could touch again at faraway points.) Mathe- 
matically this limit can be tricky—it takes us from algebra to calculus. Algebra stays 
away from 0/0, but calculus gets as close as it can. 

The new limit for df /dx looks different, but it is the same as before: 


fixie. 


ca c—a 


(4) 


EXAMPLE 6 Find the secant lines and tangent line for y= f(x) =sin x at x=0. 


The starting point is x = 0, y = sin 0. This is the origin (0, 0). The ratio of distance up 
to distance across is (sin c)/c: 


. sin c ; 
secant equation y = ras x tangent equation y = 1x. 


As c approaches zero, the secant line becomes the tangent line. The limit of (sin c)/c 
is not 0/0, which is meaningless, but 1, which is dy/dx. 


EXAMPLE 7 The gold you own will be worth Jt million dollars in t years. When 
does the rate of increase drop to 10% of the current value, so you should sell the 
gold and buy a bond? At t = 25, how far does that put you ahead of ft = 5? 


Solution The rate of increase is the derivative of ,/t, which is 1/2,/t. That is 10% 
of the current value Jt when 1/2,/t = Jt 10. Therefore 2t = 10 or t= 5. At that time 
you sell the gold, leave the curve, and go onto the tangent line: 


y-J3=-Ye-5) becomes y-/5=2 5 at t=25. 


With straight interest on the bond, not compounded, you have reached 
y= 3/5 = 6.7 million dollars. The gold is worth a measly five million. 


2.3 EXERCISES 
Read-through questions of the __¢_. The point-slope form of the tangent equation 


i — = d 
A straight line is determined by __@ __ points, or one point isy—f(j=_¢_. 


and the __b __. The slope of the tangent line equals the slope The tangent line to y=x3+xatx=I1hasslope_e_. Its 
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equation is __! . It crosses the y axis at _ 9 


and the 
x axis at _ . The normal line at this point (1, 2) has 


slope __|_. Its equation is y-2=__J__. The secant line 
from (1,2) to (2,__k ) has slope __| . Its equation is 
ae ee 


The point (c, f(c)) is on the line y—f(a) = m(x — a) pro- 
vided m=____. Asc approaches a, the slope m approaches 
°o__. The secant line approaches the _ P __line. 


1 (a) Find the slope of y = 12/x. 
(b) Find the equation of the tangent line at (2, 6). 
(c) Find the equation of the normal line at (2, 6). 
(d) Find the equation of the secant line to (4, 3). 


2 For y=x? + x find equations for 
(a) the tangent line and normal line at (1, 2); 
(b) the secant line tox =1+ h, y=(1+A)? +(1 +h). 


3 A line goes through (1, —1) and (4, 8). Write its equation 
in point-slope form. Then write it as y = mx + b. 


4 The tangent line to y=x?+6x at the origin is 
y= . Does it cross the curve again? 


5 The tangent line to y=x?—3x?4-x at the origin is 
y= . It is also the secant line to the point 


6 Find the tangent line to x = y? at x = 4, y=2. 


7 For y=x? the secant line from (a, a”) to (c, c?) has the 
equation . Do the division by c — a to find the tan- 
gent line as c approaches a. 


8 Construct a function that has the same slope at x = 1 and 
x = 2. Then find two points where y = x* — 2x? has the same 
tangent line (draw the graph). 


9 Find a curve that is tangent to y= 2x —3 at x =5. Find 
the normal line to that curve at (5, 7). 


10 For y= 1/x the secant line from (a, 1/a) to (c, 1/c) has the 
equation . Simplify its slope and find the limit as c 
approaches a. 


11 What are the equations of the tangent line and normal 
line to y=sin x at x = 2/2? 

12 If c and a both approach an in-between value x = b, then 
the secant slope (f(c) —f(a))/(c — a) approaches 


13 At x =a on the graph of y = 1/x, compute 
(a) the equation of the tangent line 
(b) the points where that line crosses the axes. 


The triangle between the tangent line and the axes always has 
area 


14 Suppose g(x) =f(x) +7. The tangent lines to f and g at 
x =4are . True or false: The distance between those 
lines is 7. 


15 Choose c so that y= 4x is tangent to y=x? +c. Match 
heights as well as slopes. 


16 Choose c so that y = 5x —7 is tangent to y= x? + cx. 


17 For y=x° + 4x? — 3x +1, find all points where the tan- 
gent is horizontal. 


18 y =4x can’t be tangent to y=cx?. Try to match heights 
and slopes, or draw the curves. 


19 Determine c so that the straight line joining (0, 3) and 
(5, —2) is tangent to the curve y =c/(x + 1). 


20 Choose b,c, d so that the two parabolas y=x?+bx+c 
and y = dx — x” are tangent to each other at x = 1, y=0. 


21 The graph of f(x) =x? goes through (1, 1). 
(a) Another point is x =c=1+h, y=f(c)= 
(b) The change in f is Af= 
(c) The slope of the secant is m= 
(d) As h goes to zero, m approaches 


22 Construct a function y =f(x) whose tangent line at x =1 
is the same as the secant that meets the curve again at x = 3. 


23 Draw two curves bending away from each other. Mark 
the points P and Q where the curves are closest. At those 
points, the tangent lines are and the normal lines 
are 


*24 If the parabolas y = x? + 1 and y=x — x? come closest at 
(a, a? + 1) and (c, ¢ — c?), set up two equations for a and c. 
25 A light ray comes down the line x = a. It hits the parabolic 
reflector y = x? at P =(a, a’). 

(a) Find the tangent line at P. Locate the point Q where 
that line crosses the y axis. 

(b) Check that P and Q are the same distance from the 
focus at F = (0, 3). 

(c) Show from (b) that the figure has equal angles. 


(d) What law of physics makes every ray reflect off the 
parabola to the focus at F? 


vertical ray 


26 In a bad reflector y =2/x, a ray down one special line 
x =a is reflected horizontally. What is a? 
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27 For the parabola 4py = x”, where is the slope equal to 1? 
At that point a vertical ray will reflect horizontally. So the 
focus is at (0, ). 


28 Why are these statements wrong? Make them right. 
(a) If y= 2x is the tangent line at (1, 2) then y = —4x is 
the normal line. 
(b) As c approaches a, the secant slope (f(c) —f(a))/(c — a) 
approaches (f(a) —f(a))/(a — a). 
(c) The line through (2, 3) with slope 4 is y — 2 = 4(x — 3). 


29 A ball goes around a circle: x = cos t, y=sint. Att = 32/4 
the ball flies off on the tangent line. Find the equation of that 
line and the point where the ball hits the ground (y = 0). 


30 If the tangent line to y=f(x) at x =a is the same as the 
tangent line to y = g(x) at x = b, find two equations that must 
be satisfied by a and b. 


31 Draw a circle of radius 1 resting in the parabola y= x?. 
At the touching point (a, a”), the equation of the normal line 
is . That line has x = 0 when y = . The dis- 
tance to (a, a”) equals the radius 1 when a = . This 
locates the touching point. 


32 Follow Problem 31 for the flatter parabola y=4x? and 
explain where the circle rests. 


33 You are applying for a $1000 scholarship and your time 
is worth $10 a hour. If the chance of success is 1 — (1/x) from 
x hours of writing, when should you stop? 


34 Suppose |f(c) —f(a)| <|c —a| for every pair of points a 
and c. Prove that |df/dx|< 1. 


35 From which point x = a does the tangent line to y = 1/x? 
hit the x axis at x = 3? 
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36 If u(x)/v(x) =7 find u’(x)/v'(x). Also find (u(x)/u(x)y’. 


37 Find f(c)=1.001'° in two ways—by calculator and by 
S() —f(@) &f'(a\(c — a). Choose a = 1 and f(x) = x?®. 


38 At a distance Ax from x = 1, how far is the curve y = 1/x 
above its tangent line? 


39 At a distance Ax from x = 2, how far is the curve y =x? 
above its tangent line? 


40 Based on Problem 38 or 39, the distance between curve 
and tangent line grows like what power (Ax)?? 


41 The tangent line to f(x)=x?—1 at x9=2 crosses the 
x axis at x; = . The tangent line at x, crosses the 
xX axis at x2= . Draw the curve and the two 
lines, which are the beginning of Newton’s method to solve 


f(x) =0. 


42 (Puzzle) The equation y= mx +b requires two numbers, 
the point-slope form y — f(a) =f'(a)(x — a) requires three, and 
the two-point form requires four: a, f(a), c, f(c). How can 
this be? 


43 Find the time T at the tangent point in Example 4, when 
you catch the car in front. 


44 If the waiting car only accelerates at 2 meters/sec”, what 
speed V must you slow down to? 


45 A thief 40 meters away runs toward you at 8 meters 
per second. What is the smallest acceleration so that v = at 
keeps you in front? 


46 With 8 meters to go in a relay race, you slow down badly 
(f=-—8 + 6t —4t?). How fast should the next runner start 
(choose v in f= vt) so you can just pass the baton? 


This section does two things. One is to compute the derivatives of sin x and cos x. 
The other is to explain why these functions are so important. They describe oscillation, 
which will be expressed in words and equations. You will see a ‘‘differential equation.” 
It involves the derivative of an unknown function y(x). 

The differential equation will say that the second derivative—the derivative of the 
derivative—is equal and opposite to y. In symbols this is y” = — y. Distance in one 
direction leads to acceleration in the other direction. That makes y and y’ and y’ all 
oscillate. The solutions to y” = — y are sin x and cos x and all their combinations. 


We begin with the slope. The derivative of y = sin x is y’ = cos x. There is no reason 
for that to be a mystery, but I still find it beautiful. Chapter 1 followed a bali around 
a circle; the shadow went up and down. Its height was sin ¢t and its velocity was cos t. 


1.00001... ; 
.995 
995 cosh 
1 sinh 
.100. . 
Fig. 2.8 
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We now find that derivative by the standard method of limits, when y(x)= sin x: 
Ay _ lim sin(x + h) — sin x 


: (1) 


dy_... 

7 limit of Ao 
The sine is harder to work with than x? or x°. Where we had (x + h)? or (x + h)®, we 
now have sin(x + h). This calls for one of the basic “addition formulas” from trigo- 
nometry, reviewed in Section 1.5: 


sin(x + h) = sin x cos h+ cos x sin h (2) 
cos(x + h)=cos x cos h— sin x sin h. (3) 
Equation (2) puts Ay = sin(x + h)— sin x in a new form: 
’ a ee a4 inh 
Ay _ sin x cos h+ cos x sin h sin * | nx cos h ns sin 4 
Ax h h h 


The ratio splits into two simpler pieces on the right. Algebra and trigonometry got 
us this far, and now comes the calculus problem. What happens as h — 0? It is no 
longer easy to divide by h. (I will not even mention the unspeakable crime of writing 
(sin h)/h= sin.) There are two critically important limits—the first is zero and the 
second is one: 


. cosh—1 . sinh _ 
a ee (5) 
The careful reader will object that limits have not been defined! You may further 
object to computing these limits separately, before combining them into equation (4). 
Nevertheless—following the principle of ideas now, rigor later—I would like to pro- 
ceed. It is entirely true that the limit of (4) comes from the two limits in (5): 

dy 


a (sin x)(first limit) + (cos x)(second limit) = 0+ cos x. (6) 


The secant slope Ay/Ax has approached the tangent slope dy/dx. 


We cannot pass over the crucial step—the two limits in (5). They contain the real 
ideas. Both ratios become 0/0 if we just substitute h= 0. Remember that the cosine of 
a zero angle is 1, and the sine of a zero angle is 0. Figure 2.8a shows a small angle h 
(as near to zero as we could reasonably draw). The edge of length sin h is close to 
zero, and the edge of length cos h is near 1. Figure 2.8b shows how the ratio of sin h 
to h (both headed for zero) gives the slope of the sine curve at the start. 

When two functions approach zero, their ratio might do anything. We might have 

bP h Jh 
A —>0 or ,7! or h — 00. 


No clue comes from 0/0. What matters is whether the top or bottom goes to zero 
more quickly. Roughly speaking, we want to show that (cos h — 1)/h is like h?/h and 
(sin h)/h is like h/h. 


Time out The graph of sin x is in Figure 2.9 (in black). The graph of sin(x + Ax) 
sits just beside it (in red). The height difference is Af when the shift distance is Ax. 
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Af _sin (x + h)—sinx 
ho h 


Fig. 2.9 sin(x + A) with h = 10° = 7/18 radians. Af/Ax is close to cos x. 


Now divide by that small number Ax (or h). The second figure shows Af/Ax. It is 
close to cos x. (Look how it starts—it is not quite cos x.) Mathematics will prove 
that the limit is cos x exactly, when Ax > 0. Curiously, the reasoning concentrates 
on only one point (x = 0). The slope at that point is cos 0= 1. 

We now prove this: sin Ax divided by Ax goes to 1. The sine curve starts with 
slope 1. By the addition formula for sin (x + A), this answer at one point will lead to 
the slope cos x at all points. 


Question Why does the graph of f(x + Ax) shift left from f(x) when Ax > 0? 
Answer When x = 0, the shifted graph is already showing f(Ax). In Figure 2.9a, the 
red graph is shifted left from the black graph. The red graph shows sin h when the 
black graph shows sin 0. 


THE LIMIT OF (sin A)/h IS 4 


There are several ways to find this limit. The direct approach is to let a computer 
draw a graph. Figure 2.10a is very convincing. The function (sin h)/h approaches 1 at 
the key point h=0. So does (tan h)/h. In practice, the only danger is that you might 
get a message like ‘‘undefined function’’ and no graph. (The machine may refuse to 
divide by zero at h = 0. Probably you can get around that.) Because of the importance 
of this limit, I want to give a mathematical proof that it equals 1. 


—n/2 h=0 m/2 
Fig. 2.40 (sin h)/h squeezed between cos x and 1; (tan h)/h decreases to 1. 


Figure 2.10b indicates, but still only graphically, that sin h stays below h. (The first 
graph shows that too; (sin h)/h is below 1.) We also see that tanh stays above h. 
Remember that the tangent is the ratio of sine to cosine. Dividing by the cosine is 
enough to push the tangent above h. The crucial inequalities (to be proved when h 
is small and positive) are 


sinh<h and tan h>h. (7) 
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Since tan h = (sin h)/(cos h), those are the same as 


ae <1 and ae > cos h. (8) 
h h 
What happens as h goes to zero? The ratio (sin h)/h is squeezed between cos h and 1. 
But cosh is approaching 1! The squeeze as h— 0 leaves only one possibility for 
(sin h)/h, which is caught in between: The ratio (sin h)/h approaches 1. 

Figure 2.10 shows that “squeeze play.” If two functions approach the same limit, so 
does any function caught in between. This is proved at the end of Section 2.6. 

For negative values of h, which are absolutely allowed, the result is the same. To 
the left of zero, h reverses sign and sin h reverses sign. The ratio (sin h)/h is unchanged. 
(The sine is an odd function: sin (— h) = — sin h.) The ratio is an even function, sym- 
metric around zero and approaching 1 from both sides. 

The proof depends on sin h < h < tan h, which is displayed by the graph but not 
explained. We go back to right triangles. 


1 1 
area-h < area > tan h 


Fig. 2.44 Line shorter than arc: 2 sin h < 2h. Areas give h < tan h. 


Figure 2.1la shows why sinh<h. The straight line PQ has length 2 sin h. The 
circular arc must be longer, because the shortest distance between two points is a 
straight line.t The arc PQ has length 2h. (Important: When the radius is 1, the arc 
length equals the angle. The full circumference is 2x and the full angle is also 27.) 
The straight distance 2 sin h is less than the circular distance 2h, so sin h < h. 

Figure 2.11b shows why h < tan h. This time we look at areas. The triangular area 
is }(base)(height) = $(1)(tan h). Inside that triangle is the shaded sector of the circle. 
Its area is h/2z times the area of the whole circle (because the angle is that fraction 
of the whole angle). The circle has area xr? = z, so multiplication by h/27 gives +h 
for the area of the sector. Comparing with the triangle around it, 4 tan h> $h. 

The inequalities sinh<h<tanh are now proved. The squeeze in equation (8) 
produces (sin h)/h + 1. Q.E.D. Problem 13 shows how to prove sin h < h from areas. 


Note All angles x and h are being measured in radians. In degrees, cos x is not the 
derivative of sin x. A degree is much less than a radian, and dy/dx is reduced by the 
factor 27/360. 


THE LIMIT OF (cos h — 4)/h IS 0 


This second limit is different. We will show that 1 — cos h shrinks to zero more quickly 
than h. Cosines are connected to sines by (sin h)? + (cos h)? = 1. We start from the 


tIf we try to prove that, we will be here all night. Accept it as true. 
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known fact sin h < h and work it into a form involving cosines: 
(1 —cos h)(1 + cos h) = 1 — (cos h)? = (sin h)? < h?. (9) 
Note that everything is positive. Divide through by h and also by 1 + cosh: 
1—cosh h 
< f 
h 1+cosh 


Our ratio is caught in the middle. The right side goes to zero because h > 0. This is 
another “squeeze”’—there is no escape. Our ratio goes to zero. 

For cos h — 1 or for negative h, the signs change but minus zero is still zero. This 
confirms equation (6). The slope of sin x is cos x. 


0 


(10) 


Remark Equation (10) also shows that 1 — cos h is approximately +h?. The 2 comes 
from 1+ cos h. This is a basic purpose of calculus—to find simple approximations 
like $h?. A “tangent parabola” 1 — +h? is close to the top of the cosine curve. 


THE DERIVATIVE OF THE COSINE 


This will be easy. The quick way to differentiate cos x is to shift the sine curve by 
n/2.That yields the cosine curve (solid line in Figure 2.12b). The derivative also shifts 
by 2/2 (dotted line). The derivative of cos x is — sin x. 

Notice how the dotted line (the slope) goes below zero when the solid line turns 
downward. The slope equals zero when the solid line is level. Increasing functions 
have positive slopes. Decreasing functions have negative slopes. That is important, and 
we return to it. 

There is more information in dy/dx than “function rising” or “‘function falling.” 
The slope tells how quickly the function goes up or down. It gives the rate of change. 
The slope of y=cos x can be computed in the normal way, as the limit of Ay/Ax: 


Ay — cos({x +h) — cos x (= he *) . (5 *) 
= =cos x | ————_] ~ sin x 


Ax h h h 
4Y = (cos x)(0) ~ (Gin x)(1) = — sin x 11) 
ae (cos x)(0) — (sin x)(1) = — sin x. ( 


The first line came from formula (3) for cos(x +h). The second line took limits, 
reaching 0 and 1 as before. This confirms the graphical proof that the slope of cos x 
is —sin x. 


y = sin x is increasing y=cos.x is decreasing 


‘Vy’ = cos x is positive 


y'=— sin v is negative 


7. - 


y" =—sin fis negative 


Fig. 2.42 (x) increases where }” is positive. )(x) bends up where \” is positive. 


THE SECOND DERIVATIVES OF THE SINE AND COSINE 


We now introduce the derivative of the derivative. That is the second derivative of the 
original function. It tells how fast the slope is changing, not how fast y itself is 
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changing. The second derivative is the “rate of change of the velocity.” A straight line 
has constant slope (constant velocity), so its second derivative is zero: 


f(t)}=5t has df/dt=5 and df /dt? =0. 
The parabola y = x? has slope 2x (linear) which has slope 2 (constant). Similarly 
f(t)=tat? has df/dt=at and d?f/dt? =a. 


There stands the notation df /dt? (or d? y/dx”) for the second derivative. A short 
form is f” or y”. (This is pronounced f double prime or y double prime). Example: 
The second derivative of y = x? is y” = 6x. 


In the distance-velocity problem, f” is acceleration. It tells how fast v is changing, 
while v tells how fast f is changing. Where df/dt was distance/time, the second 
derivative is distance/(time)?. The acceleration due to gravity is about 32 ft/sec? or 
9.8 m/sec”, which means that v increases by 32 ft/sec in one second. It does not mean 
that the distance increases by 32 feet! 

The graph of y = sin t increases at the start. Its derivative cos t is positive. However 
the second derivative is —sin t. The curve is bending down while going up. The arch 
is ‘concave down’ because y” = — sin t is negative. 

At t= 7 the curve reaches zero and goes negative. The second derivative becomes 
positive. Now the curve bends upward. The lower arch is “concave up.” 


y” > 0 means that y’ increases so y bends upward (concave up) 


y” <0 means that y’ decreases so y bends down (concave down). 


Chapter 3 studies these things properly——here we get an advance look for sin t. 

The remarkable fact about the sine and cosine is that y” = — y. That is unusual 
and special: acceleration = — distance. The greater the distance, the greater the force 
pulling back: 


y=sint has dy/dt=+cost and d*y/dt? =—sint =— y. 
y=cost has dy/dt=—sint and d?y/dt?=—cost=— y. 


Question Does d?y/dt? <0 mean that the distance y(t) is decreasing? 
Answer No. Absolutely not! It means that dy/dt is decreasing, not necessarily y. 
At the start of the sine curve, y is still increasing but y” <0. 

Sines and cosines give simple harmonic motion—up and down, forward and back, 
out and in, tension and compression. Stretch a spring, and the restoring force pulls 
it back. Push a swing up, and gravity brings it down. These motions are controlled 
by a differential equation: 

2 


crane. (12) 


a 


All solutions are combinations of the sine and cosine: y= A sin t + Bcos t. 

This is not a course on differential equations. But you have to see the purpose of 
calculus. It models events by equations. It models oscillation by equation (12). Your 
heart fills and empties. Balls bounce. Current alternates. The economy goes up and 
down: 


high prices - high production > low prices — --- 


We can’t live without oscillations (or differential equations). 
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2.4 EXERCISES 


Read-through questions 


The derivative of y= sin x is y’=__a__. The second deriva- 
tive (the __b of the derivative) is y’=__¢__. The fourth 
derivative is y’’=_ 4 

differential equations y”=__e 


Thus y=sinx satisfies the 
. So does 
y =cos x, whose second derivativeis_9 . 


and yy" = t 


All these derivatives come from one basic limit: (sin h)/h 
approaches _h_ . The sine of .01 radians is very close 
to i. So is the _ i of .01. The cosine of .01 is 
not .99, because [—cosh is much _k _ than h. The ratio 
(1 —cos h)/h? approaches __!__. Therefore cos h is close to 
1—4h? and cos .01 ~__m_. We can replace h by x. 


The differential equation y” = — y leadsto__n__. When y 
is positive, y’ is__o _. Therefore y’ is _p__. Eventually y 
goes below zero and y" becomes _q_. Then yis_o.t 


Examples of oscillation in real life are__s and__t 


1 Which of these ratios approach 1 as h > 0? 


sin? h sin h 


so h) 
(b) yee () Sin 2h (d) 


(a) > 


sin i. 


2 (Calculator) Find (sin h)/h at h=0.5 and 0.1 and .01. 
Where does (sin h)/h go above .99? 


3 Find the limits as h > 0 of 


sin? h sin a 


(a) i (b) 


4 Where does tan h = 1.01h? Where does tan h = h? 


5 y=sinx has period 2x, which means that sinx= 
. The limit of (sin (2x + h) — sin ie is 1 because 
. This gives dy/dx at x = 


6 Draw cos(x + Ax) next to cos x. Mark the height differ- 
ence Ay. Then draw Ay/Ax as in Figure 2.9. 


7 The key to trigonometry is cos?@=1-—sin?6. Set 
sinO~0 to find cos*@~%1—67. The square root is 
cos 0 1—40?. Reason: Squaring gives cos?@~ 
and the correction term is very small near 0 =0. 

8 (Calculator) Compare cos @ with 1 — 46? for 

(a) 0=0.1 (b)0=05 (c) @=30° (d) 0=3°. 

9 Trigonometry gives cos 6 = 1 — 2 sin? 46. The approxima- 

tion sin $6 ~ leads directly to cos 0 x 1 — 467. 


10 Find the limits as h — 0: 


1—cosh 1—cos*h 
(a) ee (b) = 
1—cos?h 1—cos 2h 
© Sah aes ae 


11 Find by calculator or calculus: 


G@ iim sin 3h (b) lim 1 


ho sin 2h 40 


—cos 2h 
1—cosh’ 


12 Compute the slope at x = 0 directly from limits: 
(a) y=tan x (b) y=sin(—x) 
13 The unmarked points in Figure 2.11 are P and S. Find the 


height PS and the area of triangle OPR. Prove by areas that 
sinh <h. 


14 The slopes of cos x and 1 —4x? are —sin x and 
The slopes of sin x and are cos x and 1 —4x?. 
15 Chapter 10 gives an infinite series for sin x: 


x3 x 


x 
cea es ar on 


From the derivative find the series for cos x. Then take its 
derivative to get back to —sin x. 


16 A centered difference for f(x) =sin x is 


fxt+th—f(x—h) sin +h)—sin(x—h) 
2h ~ 2h 


Use the addition formula (2). Then let h > 0. 


=? 


17 Repeat Problem 16 to find the slope of cos x. Use formula 
(3) to simplify cos (x + h) — cos(x — h). 


18 Find the tangent line to y =sin x at 
(a) x=0 (b) x=2 (c) x=7/4 


19 Where does y = sin x + cos x have zero slope? 


‘20 Find the derivative of sin (x + 1) in two ways: 


(a) Expand to sin x cos 1 + cos x sin 1. Compute dy/dx. 
(b) Divide Ay = sin(x + 1 + Ax) —sin(x + 1) by Ax. Write 
X instead of x + 1. Let Ax go to zero. 


21 Show that (tan h)/h is cas between 1 and 1/cos h. As 
h — 0 the limit is 


22 For y =sin 2x, the ratio Ay/h is 
sin 2(x+h)—sin 2x _ sin 2x(cos 2h — 1) + cos 2x sin 2h 
h 7 h : 
Explain why the limit dy/dx is 2 cos 2x. 


23 Draw the graph of y = sin 4x. State its slope at x =0, 2/2, 
x, and 2x. Does 4 sin x have the same slopes? 


3 cos x. Its maximum 
. The slope at that point 


24 Draw the graph of y=sinx+ 
value is y = atx= 

is 

25 By combining sin x and cos x, find a combination that 


starts at x = 0 from y = 2 with slope 1. This combination also 
solves y” = 
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26 True or false, with reason: 29 If his measured in degrees, find lim,-., (sin h)/h. You could 


ms : : set your calculator in degree mode. 
(a) The derivative of sin? x is cos? x . 


(b) The derivative of cos (—x) is sin x 30 Write down a ratio that approaches dy/dx at x =x. For 


(c) A positive function has a negative second derivative. y=sin x and Ax = .01 compute that ratio. 


(d) If y’ is increasing then y" is positive. 31 By the square rule, the derivative of (u(x))? is 2u du/dx. 
Take the derivative of each term in sin? x + cos? x = 1. 


27 Find solutions to dy/dx = sin 3x and dy/dx = cos 3x. 32 Give an example of oscillation that does not come from 


28 If y=sin 5x then y’ =5cos 5x and y’=—25sin 5x. So physics. Is it simple harmonic motion (one frequency only)? 


this function satisfies the differential equation y’ = ; 33 Explain the second derivative in your own words. 


= 2.5 The Product and Quotient and Power Rules [x 


What are the derivatives of x + sin x and x sin x and 1/sin x and x/sin x and sin"x? 
Those are made up from the familiar pieces x and sin x, but we need new rules. 
Fortunately they are rules that apply to every function, so they can be established 
once and for all. If we know the separate derivatives of two functions u and v, then 
the derivatives of u+v and uv and 1/v and u/v and u" are immediately available. 

This is a straightforward section, with those five rules to learn. It is also an impor- 
tant section, containing most of the working tools of differential calculus. But I am 
afraid that five rules and thirteen examples (which we need—the eyes glaze over with 
formulas alone) make a long list. At least the easiest rule comes first. When we add 
Junctions, we add their derivatives. 


Sum Rule 
eee _ d du dv 
The derivative of the sum u(x) + v(x) is —(ut+v)=—+—-. 
dx dx dx 


(1) 
EXAMPLE 4 The derivative of x + sin x is 1+ cos x. That is tremendously simple, 
but it is fundamental. The interpretation for distances may be more confusing (and 
more interesting) than the rule itself: 


Suppose a train moves with velocity 1. The distance at time ¢ is t. On the train 
a professor paces back and forth (in simple harmonic motion). His distance from 
his seat is sin t. Then the total distance from his starting point is t+ sin t, and 
his velocity (train speed plus walking speed) is 1 + cost. 


If you add distances, you add velocities. Actually that example is ridiculous, because 
the professor’s maximum speed equals the train speed (= 1). He is running like mad, 
not pacing. Occasionally he is standing still with respect to the ground. 

The sum rule is a special case of a bigger rule called “finearity.” It applies when 
we add or subtract functions and multiply them by constants—as in 3x — 4 sin x. By 
linearity the derivative is 3— 4.cos x. The rule works for all functions u(x) and v(x). 
A linear combination is y(x) = au(x) + bu(x), where a and b are any real numbers. 
Then Ay/Ax is 


au(x + Ax) + bo(x + Ax) — au(x)— bv(x) — _ u(x + Ax) — u(x) +h v(x + Ax) — v(x) 
Ax ~ Ax Ax : 
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The limit on the left is dy/dx. The limit on the right is a du/dx + b dv/dx. We are 
allowed to take limits separately and add. The result is what we hope for: 


Rule of Linearity 


dt _ a 
The derivative of au(x) + bv(x) is om (au + bv)=a a +b <. (2) 
The product rule comes next. It can’t be so simple—products are not linear. The 
sum rule is what you would have done anyway, but products give something new. 
The derivative of u times v is not du/dx times dv/dx. Example: The derivative of x* 
is 5x*. Don’t multiply the derivatives of x*> and x?. (3x? times 2x is not 5x*.) 
For a product of two functions, the derivative has two terms. 


Product Rule (the key to this section) 


The derivative of u(x)v(x) is cs (uv) =u—t+v—. (3) 
x x x 


EXAMPLE 2 u=x° times v = x? is uv = x°. The product rule leads to 5x*: 


dv du 
3 av au 
7 dx 


= x3(2x) + x2(3x7) = 2x* + 3x* = 5x4. 
EXAMPLE 3 In the slope of x sin x, I don’t write dx/dx = 1 but it’s there: 


d : : 
—=— (x sin x) = x cos x + sin x. 
dx 


EXAMPLE 4 If u= sin x and v=sin x then uv =sin? x. We get two equal terms: 
sin x La (sin x) + sin x # (sin x) = 2 sin x cos x 
dx dx , 


This confirms the “square rule” 2u du/dx, when u is the same as v. Similarly the slope 
of cos? x is —2 cos x sin x (minus sign from the slope of the cosine). 


Question Those answers for sin? x and cos? x have opposite signs, so the derivative 
of sin? x + cos? x is zero (sum rule). How do you see that more quickly? 


EXAMPLE5 The derivative of uvw is uvw' + uv'w + u’vw—one derivative at a time. 
The derivative of xxx is xx +.xx + xx. 


u(x + h) 
u Au pv _Av AD ay 
NL ee 
product 
be ee VO)! uve) 
sumutu Au+Av rar 
u(x) Au Au 


Fig. 2.43 Change in length = Au + Av. Change in area = u Av + v Au + Au Av. 
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After those examples we prove the product rule. Figure 2.13 explains it best. The 
area of the big rectangle is uv. The important changes in area are the two strips u Av 
and v Au. The corner area Au Av is much smaller. When we divide by Ax, the strips 
give u Av/Ax and v Au/Ax. The corner gives Au Av/Ax, which approaches zero. 

Notice how the sum rule is in one dimension and the product rule is in two 
dimensions. The rule for www would be in three dimensions. 

The extra area comes from the whole top strip plus the side strip. By algebra, 


u(x + h)v(x + h) — u(xc)o(x) = u(x + h) L(x + A) — v(x) + v(x) [u(x +h) — u(x) ]. (4) 


This increase is u(x + h)Av + v(x)Au—top plus side. Now divide by h (or Ax) and let 
h > 0. The left side of equation (4) becomes the derivative of u(x)v(x). The right side 
becomes u(x) times dv/dx—-we can multiply the two limits—plus v(x) times du/dx. 
That proves the product rule—definitely useful. 

We could go immediately to the quotient rule for u(x)/v(x). But start with u= 1. 
The derivative of 1/x is —1/x? (known). What is the derivative of 1/v(x)? 


Reciprocal Rule 
The derivative of at is aie (5) 
v(x) v 


The proof starts with (v)\(1/v) = 1. The derivative of 1 is 0. Apply the product rule: 


d {1 1 dv d fl — dvjdx 
—(-—]+-—=0 sothat —|{-]= 6 
” dx (;) v dx dx (5) v? (6) 
It is worth checking the units—in the reciprocal rule and others. A test of dimen- 
sions is automatic in science and engineering, and a good idea in mathematics. The 


test ignores constants and plus or minus signs, but it prevents bad errors. If v is in 
dollars and x is in hours, dv/dx is in dollars per hour. Then dimensions agree: 


d f1)\_ (1/dollars) faa ice ~dvjdx _ dollars/hour 
dx\v/~ hour 2 (dollars)? 


From this test, the derivative of 1/v cannot be 1/(dv/dx). A similar test shows that 
Einstein’s formula e = mc? is dimensionally possible. The theory of relativity might 
be correct! Both sides have the dimension of (mass)(distance)*/(time)’, when mass 
is converted to energy.t 


EXAMPLE 6 The derivatives of x~1, x77, x7" are —1x7*, ~2x73, —nx7"7}. 


Those come from the reciprocal rule with v = x and x? and any x": 


The beautiful thing is that this answer —nx~"~! fits into the same pattern as x”. 
Multiply by the exponent and reduce it by one. 


For negative and positive exponents the derivative of x" is nx"~’. (7) 


. 


*But only Einstein knew that the constant is 1. 
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Reci i 1-30? ae Au 
; ecproca’ y+Av 0 v(v + Av) i 
Rent u+Au_u_vAu—udv 
Av v Quopen vt+Av v v(v + Av) Av v 


Fig. 2.44 Reciprocal rule from (— Av)/v?. Quotient rule from (v Au —u Av)/v?. 


1 1 +sin x —COS x 
and —— are ——3z—_ and —~ 5. 
cos x sin x cos* x sin* x 


EXAMPLE 7 The derivatives of 


Those come directly from the reciprocal rule. In trigonometry, 1/cos x is the secant 
of the angle x, and 1/sin x is the cosecant of x. Now we have their derivatives: 


d sin x 1 sinx 

—— (sec x) = —5— = —— — =sec x tan x. (8) 
dx cos*x cos x cos x 

d cos x 1 cosx 

a (SOX) Ee OX OFX, (9) 
dx sin? x sin x sin x 


Those formulas are often seen in calculus. If you have a good memory they are worth 
storing. Like most mathematicians, I have to check them every time before using 
them (maybe once a year). It is really the rules that are basic, not the formulas. 


The next rule applies to the quotient u(x)/v(x). That is u times 1/v. Combining the 
product rule and reciprocal rule gives something new and important: 
Quotient Rule 


The derivative of at is Ce u ee — 2 dujdx ae du/dx ; 
v(x) v dx v 7 


You must memorize that last formula. The v? is familiar. The rest is new, but not very 
new. If v=1 the result is du/dx (of course). For u=1 we have the reciprocal 
rule. Figure 2.14b shows the difference (u + Au)/(v + Av) — (u/v). The denominator 
v(v + Av) is responsible for v?. 


EXAMPLE 8 (only practice) If u/v = x°/x> (which is x”) the quotient rule gives 2x: 
d (5) _ x3(5x4) — x9(3x?) 5x7 — 3x7 _ 


BOS yg 
dx \ x? xé x® 


EXAMPLE 9 (important) For u=sin x and v=cos x, the quotient is sin x/cos x = 
tan x. The derivative of tan x is sec? x. Use the quotient rule and cos? x + sin? x = 1: 


d /sinx cos x(cos x)— sin x(— sin x 1 
ae (= =) = cos x(cos x) — sin x(—sin x) = SS = sec? x. (11) 


cos? x cos? x 
Again to memorize: (tan x)’ = sec? x. At x =0, this slope is 1. The graphs of sin x 
and x and tan x all start with this slope (then they separate). At x= 2/2 the sine 
curve is flat (cos x = 0) and the tangent curve is vertical (sec? x = 00). 
The slope generally blows up faster than the function. We divide by cos x, once 
for the tangent and twice for its slope. The slope of 1/x is —1/x?. The slope is more 
sensitive than the function, because of the square in the denominator. 


EXAMPLE 410 


dx\ x 2 


d f(sinx\ x cos x—sin x 
; 


2.5 The Product and Quotient and Power Rules 75 


That one I hesitate to touch at x = 0. Formally it becomes 0/0. In reality it is more 
like 07/02, and the true derivative is zero. Figure 2.10 showed graphically that (sin x)/x 
is flat at the center point. The function is even (symmetric across the y axis) so its 
derivative can only be zero. 


This section is full of rules, and I hope you will allow one more. It goes beyond x” 
to (u(x))". A power of x changes to a power of u(x)—as in (sin x)® or (tan x)’ or 
(x? + 1)®. The derivative contains nu"~! (copying nx"~'), but there is an extra factor 
du/dx. Watch that factor in 6(sin x)° cos x and 7(tan x)® sec? x and 8(x? + 1)7(2x): 


Power Rule 


n—-1 du 


ge (12) 


The derivative of [ ue) |” is n| u(x) | 


For n= 1 this reduces to du/dx = du/dx. For n=2 we get the square rule 2u du/dx. 
Next comes u3. The best approach is to use mathematical induction, which goes from 
each n to the next power n+ 1 by the product rule: 


et)= < ty) = ve +u (me 2) =(n+ pw. 
That is exactly equation (12) for the power n + 1. We get all positive powers this way, 
going up from n = 1— then the negative powers come from the reciprocal rule. 

Figure 2.15 shows the power rule for n=1,2,3. The cube makes the point 
best. The three thin slabs are u by u by Au. The change in volume is essentially 
3u? Au. From multiplying out (u+Au)?, the exact change in volume is 
3u? Au + 3u(Au)? + (Au)>—which also accounts for three narrow boxes and a midget 
cube in the corner. This is the binomial formula in a picture. 


(Av)? 
1 cube 


u(Au)* 
3 bricks 


u2 Au 
3 slabs 


Au u Au 
Fig. 2.45 Length change = Au; area change = 2u Au; volume change ~ 3u? Au. 


EXAMPLE 44 (sin x)" = n(sin x)"~! cos x. The extra factor cos x is du/dx. 


Our last step finally escapes from a very undesirable restriction—that n must be 
a whole number. We want to allow fractional powers n= p/q, and keep the same 
formula. The derivative of x" is still nx"~'. 

To deal with square roots I can write (/x)? = x. Its derivative is 2./x(./x)' =1. 
Therefore (./xy! is 1/2,/x, which fits the formula when n= 3. Now try n= p/q: 
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Fractional powers Write u = x?!* as u4 = x”. Take derivatives, assuming they exist: 


d 
qui} x =px?-* (power rule on both sides) 
—_—=— {| (cancel x? with u‘) 


ot at! (replace p/q by n and u by x") 


EXAMPLE 12 The slope of x!/ is 3x~ 7/3. The slope is infinite at x = 0 and zero at 
x= oo. But the curve in Figure 2.16 keeps climbing. It doesn’t stay below an 
“asymptote.” 


slope 1/3 


infinite slope 
1/8 1 
Fig. 2.16 Infinite slope of x” versus zero slope: the difference between 0<n<1 and n> tl. 


EXAMPLE 43 The slope of x*/? is $x'/3. The slope is zero at x =0 and infinite at 
x = 00. The graph climbs faster than a line and slower than a parabola ( is between 
1 and 2). Its slope follows the cube root curve (times 4). 


WE STOP NOW! I am sorry there were so many rules. A computer can memorize 
them all, but it doesn’t know what they mean and you do. Together with the chain 
rule that dominates Chapter 4, they achieve virtually all the derivatives ever computed 
by mankind. We list them in one place for convenience. 


Rule of Linearity (au + bv)! = au' + be’ 
Product Rule (uv = uv’ + ov! 


Reciprocal Rule (1/v) = — v'/v? 
Quotient Rule (u/v) = (vu! — uv’)/v? 
Power Rule (uy = nu" ha! 


The power rule applies when n is negative, or a fraction, or any real number. The 
derivative of x* is mx*~ 1, according to Chapter 6. The derivative of (sin x)* is 
And the derivatives of all six trigonometric functions are now established: 


(sin x! = cos x (tan x!) = sec? x (sec x) = sec x tan x 


(cos x)’ = — sin x (cot x)’ =—csc?x (csc x)’ = — esc x cot x. 
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2.5 EXERCISES 


Read-through questions 


The derivatives of sinx cosx and t/cosx and sin x/cos x 
and tan>x come from the __a__rule,__b rule, ¢ rule, 
and _qd__ rule. The product of sinx times cosx has 
(uv) =uv'+_e =_ ¢t _. The derivative of I/vis_9 , 
so the slope of secx is __h__. The derivative of u/vis_1 
so the slope of tanx is |. The derivative of tan*x is 

k__. The slope of x"is__| and the slope of (u(x))" is 

m_. With n=—1 the derivative of (cosx)"! is _n_ , 
which agrees with the rule for sec x. 


Even simpler is the rule of _o , which applies to 
au(x) + bv(x). The derivative is___p__. The slope of 3 sin x + 
4 cosx is _@_. The derivative of (3 sinx+4cos x)? is 

__. The derivative of _s _ is 4 sin? x cos x. 

Find the derivatives of the functions in 1-26. 

1 (x+ 1)(x-1) 2 (x? + 1)? - 1) 
a + cae 4 ee + =i 
1+x 1+sinx 1+x?  1—sinx 
5 (x —1)(x —2)(x—3) 6 (x — 1)*(x — 2) 


7 x? cos x + 2x sin x 8 x'/2(x+sin x) 


3 


x?+1  sinx 
x?—1 °° cosx 


x341  cosx 


9 : 
x+1  sinx 


10 


11 x1/? sin? x + (sin x)!/2 12 x3/? sin? x + (sin x)3/? 


14 Jx(/x + Ni/x +2) 


16 (x —6)'° + sin! x 


13 x*cos x +x cos* x 

15 4x? sin x—x cos x+sin x 

17 sec? x — tan? x 18 csc? x — cot? x 
4 4 

” ee (x— 578 +2 a8 (5- x)?/3 


21 (sin x cos x)? 
23 u(x)v(x)w(x)z(x) 


1 1 
tanx cotx 


sin x —cos x 
sin x + cos x 


22 x cos x csc x 


24 [u(x)]? [o(x) 7? 


+sin 2x 


26 x sin x +cos x 


27 A growing box has length t, width 1/(1 +1), and height 
COS ft. 

(a) What is the rate of change of the volume? 

(b) What is the rate of change of the surface area? 


28 With two applications of the product rule show that the 
derivative of uvw is uvw’ + uv’ w + u'vw. When a box with sides 
u, v, w grows by Au, Av, Aw, three slabs are added with volume 
uv Aw and and 


29 Find the velocity if the distance is f(t)= 


500 + 100,/t — 10 for t > 10. 


5t? for t < 10, 


i : 3/2 P t 
30 A cylinder has radius r= lap and height h = ae 


(a) What is the rate of change of its volume? 
(b) What is the rate of change of its surface area (including 
top and base)? 
31 The height of a model rocket is f(t) = t3/(1 +t). 
(a) What is the velocity v(t)? 
(b) What is the acceleration dv/dt? 
32 Apply the product rule to u(x)u?(x) to find the power rule 
for u3(x). 
33 Find the second derivative of the product u(x)v(x). Find 
the third derivative. Test your formulas on u=v =x. 
34 Find functions y(x) whose derivatives are 
(a) x? (b) 1/x?_ ©) ~(1— x)? 
35 Find the distances f(t), starting from f(0)=0, to match 
these velocities: 
(a) v(t)=cos t sin t 
(c) o(t}=/1+t 
36 Apply the quotient rule to (u(x))3/(u(x))? and —v'/v?. 
The latter gives the second derivative of 


(d) cos? x sin x. 


(b) v(t)=tan t sec? ¢ 


37 Draw a figure like 2.13 to explain the square rule. 

38 Give an example where u(x)/v(x) is increasing but du/dx = 

dujdx = 1. 

39 True or false, with a good reason: 
(a) The derivative of x?" is 2nx?"~ 
(b) By linearity the derivative of a(x)u(x) + b(x)v(x) is 
a(x) du/dx-+ b(x) dv/dx. 

(c) The derivative of |x|? is 3|x|?. 
(d) tan? x and sec? x have the same derivative. 
(e) (uv)' =u'v' is true when u(x) = 1. 


1 


40 The cost of u shares of stock at v dollars per share is uv 
dollars. Check dimensions of d(uv)/dt and u dv/dt and v du/dt. 


41 If u(x)/v(x) is a ratio of polynomials of degree n, what are 
the degrees for its derivative? 


42 For y = 5x + 3, is (dy/dx)? the same as d? y/dx?? 


43 If you change from f(t)=tcost to its tangent line at 
t = 1/2, find the two-part function df /dt. 


44 Explain in your own words why the derivative of u(x)v(x) 
has two terms. 


45 A plane starts its descent from height y=h at x=—L 
to land at (0,0). Choose a,b,c,d so its landing path 
y=ax? + bx? +cx +d is smooth. With dx/dt = V = constant, 
find dy/dt and d?y/dt? at x=0 and x=—L. (To keep 
d*y/dt? small, a coast-to-coast plane starts down L> 100 
miles from the airport.) 
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ee =—26 lini: 


You have seen enough limits to be ready for a definition. It is true that we have 
survived this far without one, and we could continue. But this seems a reasonable 
time to define limits more carefully. The goal is to achieve rigor without rigor mortis. 

First you should know that limits of Ay/Ax are by no means the only limits in 
mathematics. Here are five completely different examples. They involve n — 00, not 
Ax — 0: 


1. a, = (n— 3)(n+ 3) (for large n, ignore the 3’s and find a, > 1) 
2. a,=4a,-,+4 (start with any a, and always a, — 8) 

3. a, = probability of living to year n (unfortunately a, — 0) 

4. a, = fraction of zeros among the first n digits of x (a, > 75?) 


5. a, =.4, a, =.49, a, = .493, .... No matter what the remaining decimals are, the 
a’s converge to a limit. Possibly a, — .493000 ..., but not likely. 


The problem is to say what the limit symbol -— really means. 

A good starting point is to ask about convergence to zero. When does a sequence 
of positive numbers approach zero? What does it mean to write a, > 0? The numbers 
Q,, 42,43, ..., must become “small,” but that is too vague. We will propose four 
definitions of convergence to zero, and I hope the right one will be clear. 


1. All the numbers a, are below 107 1°. That may be enough for practical purposes, 
but it certainly doesn’t make the a, approach zero. 

2. The sequence is getting closer to zero—each a,,, is smaller than the preceding 
a,- This test is met by 1.1, 1.01, 1.001, ... which converges to 1 instead of 0. 

3. For any small number you think of, at least one of the a,’s is smaller. That pushes 
something toward zero, but not necessarily the whole sequence. The condition would 
be satisfied by 1,4, 1,4, 1,4, ..., which does not approach zero. 

4. For any small number you think of, the a,’s eventually go below that number and 
stay below. This is the correct definition. 


I want to repeat that. To test for convergence to zero, start with a small number— 
say 10-'°. The a,’s must go below that number. They may come back up and go 
below again—the first million terms make absolutely no difference. Neither do the 
next billion, but eventually all terms must go below 10~!°. After waiting longer 
(possibly a lot longer), all terms drop below 10~?°. The tail end of the sequence 
decides everything. 


Question 4 Does the sequence 107, 1077, 107°, 1075, 107°, 108, ... approach 0? 
Answer Yes. These up and down numbers eventually stay below any «. 


a, <eifn>3 a,<eifn>6 non-convergence 


Fig. 2.47 Convergence means: Only a finite number of a’s are outside any strip around L. 


+€ by: 


2.6 Limits 


Question 2 Does 10~*, 107°, 10~*, 1078, 10~*, 107 1°, ... approach zero? 
Answer No. This sequence goes below 10~* but does not stay below. 


There is a recognized symbol for ‘an arbitrarily small positive number.” By 
worldwide agreement, it is the Greek letter ¢ (epsilon). Convergence to zero means 
that the sequence eventually goes below « and stays there. The smaller the ¢, the tougher 
the test and the longer we wait. Think of ¢ as the tolerance, and keep reducing it. 

To emphasize that e comes from outside, Socrates can choose it. Whatever ¢ he 
proposes, the a’s must eventually be smaller. After some ay, all the a’s are below the 
tolerance s. Here is the exact statement: 


for any « there is an N such that a,<¢éifn>N. 


Once you see that idea, the rest is easy. Figure 2.17 has N = 3 and then N = 6. 


EXAMPLE 4 The sequence }, $,#, ... starts upward but goes to zero. Notice that 
1, 4,9, ..., 100, ... are squares, and 2, 4, 8, ..., 1024, ... are powers of 2. Eventually 2” 
grows faster than n?, as in a,9 = 100/1024. The ratio goes below any «. 


EXAMPLE 2 1,0,4,0,4,0,... approaches zero. These a’s do not decrease steadily 
(the mathematical word for steadily is “‘monotonically”’) but still their limit is zero. 
The choice e = 1/1000 produces the right response: Beyond az 99, all terms are below 
1/1000. So N = 2001 for that ¢. 


The sequence 1, 5,4, 4, 4, 4, ... is much slower—but it also converges to zero. 

Next we allow the numbers a, to be negative as well as positive. They can converge 
upward toward zero, or they can come in from both sides. The test still requires the 
a, to go inside any strip near zero (and stay there). But now the strip starts at —«. 

The distance from zero is the absolute value |a,|. Therefore a, — 0 means |a,|—> 0. 
The previous test can be applied to |a,|: 


for any « there is an N such that |a,| << if n> N. 


EXAMPLE3 1, —4,3, — 4, ... converges to zero because 1,4,4,4, ... converges to zero. 


It is a short step to limits other than zero. The limit is L if the numbers a, — L 
converge to zero. Our final test applies to the absolute value |a, — L|: 


for any ¢ there is an N such that |a,— L|<<eifn>N. 


This is the definition of convergence! Only a finite number of a’s are outside any strip 
around L (Figure 2.18). We write a, — L or lim a, = L or lim,.,,, a, = L. 


Fig. 2.48 a, -» 0in Example 3; a, > 1 in Example 4; a, + oo in Example 5 (but a, , — a, -» 0). 
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EXAMPLE 4 The numbers 3, 3, Z, ... converge to L=1. After subtracting 1 the 
differences 4, 4,4, ... converge to zero. Those difference are |a, — L]. 


EXAMPLES The sequence 1,1+4,1+4+4,1+4+4+4+4, ... fails to converge. 


The distance between terms is getting smaller. But those numbers a,, az, a3, a4, ... ZO 
past any proposed limit L. The second term is 14. The fourth term adds on 4+ 4, 
SO a4 goes past 2. The eighth term has four new fractions 4+34+4+4, totaling 
more thang +}$+4+4=4. Therefore ag exceeds 24. Eight more terms will add more 
than 8 times +, so a,¢ is beyond 3. The lines in Figure 2.18c are infinitely long, not 
stopping at any L. 

In the language of Chapter 10, the harmonic series 1 +4+4+ --: does not converge. 
The sum is infinite, because the “‘partial sums” a, go beyond every limit L (aso9q is 
past L=9). We will come back to infinite series, but this example makes a subtle 
point: The steps between the a, can go to zero while still a, > 00. 


Thus the condition a,,— a, 0 is not sufficient for convergence. However this 
condition is necessary. If we do have convergence, then a,., — 4, > 0. That is a good 
exercise in the logic of convergence, emphasizing the difference between “sufficient” 
and “necessary.” We discuss this logic below, after proving that [statement A] implies 
[statement B]: 


if (a, converges to L] then [a, 4,1, — a, converges to zero]. (1) 


Proof Because the a, converge, there is a number N beyond which |a, — L| <« and 
also Ja, , — L|<«. Since a,,, —a, is the sum of a,,, —L and L—a,, its absolute 
value cannot exceed ¢ + ¢ = 2s. Therefore a,,, — a, approaches zero. 

Objection by Socrates: We only got below 2¢ and he asked for ¢. Our reply: If he 
particularly wants |a,., —4,|< 1/10, we start with e = 1/20. Then 2¢ = 1/10. But this 
juggling is not necessary. To stay below 2s is just as convincing as to stay below «. 


THE LOGIC OF “IF” AND “ONLY IF” 


The following page is inserted to help with the language of mathematics. In ordinary 
language we might say “I will come if you call.” Or we might say “I will come only 
if you call.” That is different! A mathematician might even say “I will come if and 
only if you call.” Our goal is to think through the logic, because it is important and 
not so familiar. 

Statement A above implies statement B. Statement A is a, > L; statement B is 
@,+1— 4,7 0. Mathematics has at least five ways of writing down A= B, and I 
though you might like to see them together. It seems excessive to have so many 
expressions for the same idea, but authors get desperate for a little variety. Here are 
the five ways that come to mind: 


A=>B 
A implies B 
if A then B 
A is a sufficient condition for B 


B is true if A is true 


+Logical thinking is much more important than ¢ and 6. 


2.6 Limits 


EXAMPLES /f [positive numbers are decreasing] then [they converge to a limit]. 
If [sequences a, and b, converge] then [the sequence a, + b, converges]. 
If (f(x) is the integral of v(x)] then [v(x) is the derivative of f(x)]. 


Those are all true, but not proved. A is the hypothesis, B is the conclusion. 

Now we go in the other direction. (It is called the “converse,” not the inverse.) We 
exchange A and B. Of course stating the converse does not make it true! B might 
imply A, or it might not. In the first two examples the converse was false——the a, 
can converge without decreasing, and a,+b, can converge when the separate 
sequences do not. The converse of the third statement is true—and there are five 
more ways to state it: 

A+B 
A is implied by B 
if B then A 
A is a necessary condition for B 
B is true only if A is true 


Those words “necessary” and “sufficient” are not always easy to master. The same 
is true of the deceptively short phrase “‘if and only if.” The two statements A> B and 
A<=B are completely different and they both require proof. That means two separate 
proofs. But they can be stated together for convenience (when both are true): 


AB 
A implies B and B implies A 
A is equivalent to B 
A is a necessary and sufficient condition for B 


A is true éf and only if B is true 
EXAMPLES [a,->L] = [24,>2L] = [a,+1>7L+1] = [a,-—L-0]. 
RULES FOR LIMITS 


Calculus needs a definition of limits, to define dy/dx. That derivative contains two 
limits: Ax + 0 and Ay/Ax > dy/dx. Calculus also needs rules for limits, to prove the 
sum rule and product rule for derivatives. We started on the definition, and now we 
start on the rules. 

Given two convergent sequences, a, > Land b, + M, other sequences also converge: 


Addition: a,+b,-L+M — Subtraction: a,—b,-L—M 
Multiplication: a,b, > LM Division: a,/b,— L/M (provided M #0) 

We check the multiplication rule, which uses a convenient identity: 
a,b, — LM = (a, — L)(b, — M) + M(a, ~ L) + L(b, — M). (2) 


Suppose |a, — L| < e beyond some point N, and |b, — M| < beyond some other point 
N’. Then beyond the larger of N and N’, the right side of (2) is small. It is less than 
eve + Me+ Le. This proves that (2) gives a,b, ~ LM. 

An important special case is ca, cL. (The sequence of b’s is c, c, c, c, ....) Thus a 
constant can be brought “outside” the limit, to give lim ca, = c lim a,. 
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THE LIMIT OF f(x) AS x >a 


The final step is to replace sequences by functions. Instead of a,,a,,... there is a 
continuum of values f(x). The limit is taken as x approaches a specified point a 
(instead of n— oo). Example: As x approaches a=0, the function f(x) =4— x? 
approaches L= 4. As x approaches a = 2, the function 5x approaches L= 10. Those 
statements are fairly obvious, but we have to say what they mean. Somehow it must 
be this: 


if x is close to a then f(x) is close to L. 


If x —a is small, then f(x) — L should be small. As before, the word small does not 
say everything. We really mean “‘arbitrarily small,” or “below any ¢.” The difference 
f(x)— L must become as small as anyone wants, when x gets near a. In that case 
lim,..., f(x) = L. Or we write f(x) > L as x > a. 

The statement is awkward because it involves two limits. The limit x — a is forcing 
f(x) > L. (Previously n — co forced a, > L.) But it is wrong to expect the same ¢ in 
both limits. We do not and cannot require that |x — a] <e produces |f(x)— L|<. 
It may be necessary to push x extremely close to a (closer than e). We must guarantee 
that if x is close enough to a, then | f(x) — L| <e. 


We have come to the “‘epsilon-delta definition’’ of limits. First, Socrates chooses s. 
He has to be shown that f(x) is within ¢ of L, for every x near a. Then somebody 
else (maybe Plato) replies with a number 6. That gives the meaning of “‘near a.” 
Plato’s goal is to get f(x) within e of L, by keeping x within 6 ofa: 


if 0<|x-—al<d then ([f(x)—L\<e. (3) 


The input tolerance is 6 (delta), the output tolerance is ¢. When Plato can find a 6 
for every e, Socrates concedes that the limit is L. 


EXAMPLE Prove that lim 5x=10. In this case a=2 and L=10. 


Socrates asks for |5x — 10] <. Plato responds by requiring |x — 2| < 6. What 6 should 
he choose? In this case |5x — 10] is exactly 5 times |x — 2]. So Plato picks 6 below «/5 
(a smaller 6 is always OK). Whenever |x — 2| < /5, multiplication by 5 shows that 
|5x — 10] <«. 


Remark 1 In Figure 2.19, Socrates chooses the height of the box. It extends above 
and below L, by the small number ¢. Second, Plato chooses the width. He must make 
the box narrow enough for the graph to go out the sides. Then |f(x)— L|<e. 


fm Lasx-a limit Z is not f(a) f(s) = step function 
@ SE 
no limit L 
asx—a 
x x 
a-5 at+8 a 


Fig. 2.49 S chooses height 2, then P chooses width 26. Graph must go out the sides. 


2.6 Limits 


When f(x) has a jump, the box can’t hold it. A step function has no limit as x 
approaches the jump, because the graph goes through the top or bottom—no matter 
how thin the box. 


Remark 2 The second figure has f(x)— L, because in taking limits we ignore the 
final point x = a. The value f(a) can be anything, with no effect on L. The first figure 
has more: f(a) equals L. Then a special name applies—f is continuous. The left figure 
shows a continuous function, the other figures do not. 

We soon come back to continuous functions. 


Remark 3 In the example with f= 5x and 6 = é/5, the number 5 was the slope. That 
choice barely kept the graph in the box— it goes out the corners. A little narrower, 
say 6=e/10, and the graph goes safely out the sides. A reasonable choice is 
to divide ¢ by 2|f'(a)|. (We double the slope for safety.) I want to say why this 6 
works—even if the e—d test is seldom used in practice. 

The ratio of f(x)— L to x — a is distance up over distance across. This is Af/Ax, 
close to the slope f’(a). When the distance across is 6, the distance up or down is 
near 6|f'(a)|. That equals ¢/2 for our “reasonable choice” of 6—so we are safely 
below ¢. This choice solves most exercises. But Example 7 shows that a limit might 
exist even when the slope is infinite. 


EXAMPLE 7 lim, J/x—1=0 (a one-sided limit). 


Notice the plus sign in the symbol x + 1+. The number x approaches a= 1 only from 
above. An ordinary limit x — 1 requires us to accept x on both sides of 1 (the exact 
value x = | is not considered). Since negative numbers are not allowed by the square 
root, we have a one-sided limit. It is L=0. 

Suppose ¢ is 1/10. Then the response could be 6 = 1/100. A number below 1/100 
has a square root below 1/10. In this case the box must be made extremely 
narrow, 6 much smaller than ¢, because the square root starts with infinite slope. 


Those examples show the point of the e—6 definition. (Given ¢, look for 6. This 
came from Cauchy in France, not Socrates in Greece.) We also see its bad feature: 
The test is not convenient. Mathematicians do not go around proposing e’s and 
replying with 6’s. We may live a strange life, but not that strange. 

It is easier to establish once and for all that 5x approaches its obvious limit 5a. 
The same is true for other familiar functions: x"-+a" and sinx—sina and 
(1 —x)~1->(1—.a)"!—except at a=1. The correct limit L comes by substituting 
x = a into the function. This is exactly the property of a “continuous function.” Before 
the section on continuous functions, we prove the Squeeze Theorem using « and 6. 


Proof g(x) is squeezed between f(x) and h(x). After subtracting L, g(x) — Lis between 
F(x) — L and h(x) — L. Therefore 
lg(x)-Ll<e if |f(x)-—Ll<e and |h(x)—L)<e. 


For any se, the last two inequalities hold in some region 0 < |x — al <6. So the first 
one also holds. This proves that g(x) > L. Values at x = a are not involved—until 
we get to continuous functions. 
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2.6 EXERCISES 


Read-through questions 


The limit of a, = (sin n)/n is a. The limit of a, =n*/2" is 
b_. The limit of a, =(—1)"is__¢ _. The meaning ofa, > 0 

is; Only__d___ of the numbers |a,|can be__e__. The meaning 

of a, L is: For every _f there is an __g _ such that 
h__ifn>__i__. The sequence 1, 1 +4,1+444, ... isnot 
i__ because eventually those sums go past __k 


The limit of f(x)=sinx as x->ais |. The limit of 
f(x) =x/|x| as x -+—2 is _m_, but the limit as x — 0 does 


not _n_. This function only has __o _-sided limits. The 
meaning of lim,., f(x) = L is: For every ¢ there is a 6 such 
that | f(x) — L| <¢ whenever __p 


Two rules for limits, when a,2L and b,-M, are 
a, +b,>__@__ and a,b, __t__. The corresponding rules 
for functions, when f(x) L and g(x) > M as x4, are 

sand _+__. In all limits, ja, — L] or |f(x)— L| must 
eventually go below and __u__any positive __v 


A= B means that Aisa __w__ condition for B. Then B is 
true__x  Aistrue. 4<>Bmeansthat Aisa__y___ condition 
for B. Then Bistrue _z__ A is true. 


1 What is a, and what is the limit L? After which N is 
Ja, — L| < 75? (Calculator allowed) 
(a) —1, +h adi... 
(G) Se Bi ae 1 
(e) a,=*/n (f) a, = 
(g) $4+1,04+5)77,0449. ... 


2 Show by example that these statements are false: 
(a) Ifa, > L and b, + L then a,/b, > 1 
(b) a, > L if and only if a? > L? 
(c) Ifa,<0 and a,—- L then L<0 


(d) If infinitely many a,’s are inside every strip around 
zero then a, > 0. 


3 Which of these statements are equivalent to B > A? 
(a) If A is true so is B 
(b) A is true if and only if B is true 
(c) Bis a sufficient condition for A 
(d) A is a necessary condition for B. 


4 Decide whether A > B or B= A or neither or both: 
(a) A=[a,>1] B=[-a,>—-1] 
(b}) A=[a,-0] B=[a,—a,-, 70] 
(c) A=[a,<n] B=[a,=n] 
(d) A=[a,>0] B=[sina,->0] 
(ec) A=[a,->0] B=[l/a, fails to converge] 
(f) A=[a,<n] B=[a,/n converges] 


*5 If the sequence a,, a2, a3, ... approaches zero, prove that 
we can put those numbers in any order and the new sequence 
still approaches zero. 


*6 Suppose f(x) > L and g(x) > M as x + a. Prove from the 
definitions that f(x)+9(x) > L+M asx a. 


Find the limits 7—24 if they exist. An c—6 test is not required. 


. t+3 7 
7 lim ai 8 lim | == 
172 t*—2 92 t— 
h)— 
9 lim LOA TO) (careful) 10 lim fs ? AC 
x0 h AO 
anes 2 
11 lim sin noe h 1 eae 
h>0 h x70 «6Ssin x 
|x |x| 


x70 


13, lim, FT one: -sided) 14 lim — (one-sided) 
x0" X 


15 lim ~ 16 lim LO-F@ 
x7 x eva c—a 
74.25 2S 
17 lim oe 18 lim ait 
x75 X— xs) 


sn 


—1 
(test x = .01 
x72 /6 +X 


21 lim [f(x) —f(aI@) 


22 im, (sec x — tan x) 
xn! 


sin x sin us — 1) 
23 lim ~ 24 lim —~—— 
x0 sin x/2 xo 


x? 1 

25 Choose 6 so that | f(x)| <zd5 if0<x <6. 

f(ix=10x fae f(x)=sin 2x 

26 Which does the definition of a limit require? 
)Uf@)—-Lh<e => 0<|x—-al<6 

(2) f(x) -Lh<e = 0<|x-al<d 

(3) [f(x)—L]<e = 0<|x—al<d 

27 The definition of “f(x)-— L as x > x” is this: For any 


é there is an X such that <e if x>X. Give an 
example in which f(x) >4 asx > x. 


f(x) =x sin x 


” 


28 Give a correct definition of “f(x) > 0 as x >— 2x, 


29 The limit of f(x) =(sin x)/x as x > % is . For 
é=.01 find a point X beyond which | f(x)}| <«. 


30 The limit of f(x)=2x/(1 +x) as x >a is L=2. For 
é=.01 find a point X beyond which | f(x) — 2} <e. 


31 The limit of f(x 
why not. 


)=sin x asx > x does not exist. Explain 
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32 (Calculator) Estimate the limit of ( + :) as X > 00. 


33 For the polynomial f(x) = 2x — 5x” + 7x3 find 
(a) lim f(x) (b) lim f(x) 


(c) jim fo (d) im © 
34 For f(x) = 6x? + 1000x find 

(a) tim £2) (>) tim 2) 

(©) tim io (4) tim a 


Important rule As x > 00 the ratio of polynomials f(x)/g(x) 
has the same limit as the ratio of their leading terms. f(x) = 
x3—x-+2 has leading term x? and g(x)=5x°+x+1 has 
leading term 5x°. Therefore f(x)/g(x) behaves like x3/5x® — 0, 
2(x)/ f(x) behaves like 5x°/x3 > 00, (f(x))?/g(x) behaves like 
x9/5x® = 1/5. 


35 Find the limit as x > oo if it exists: 


3x? + 2x+1 x4 x? + 1000 
342x+x7 x34+x* x3— 1000 


— L|<e, why is it OK to 


x sin — 

x 
36 Ifa particular 6 achieves | f(x) 
choose a smaller 6? 


37 The sum of ft+r47?4+-- +r 1 is a,=(1—r)(l —7). 
What is the limit of a, as n > 00? For which r does the limit 
exist? 


2.7. Continuous Functions 


38 If a,— L prove that there is a number N with this prop- 
erty: Ifn>N and m> N then |a, — a,,|<2e. This is Cauchy’s 
test for convergence. 


39 No matter what decimals come later, a; =.4, a, =.49, 
a3 = .493, ... approaches a limit L. How do we know (when 
we can’t know L)? Cauchy's test is passed: the a’s get closer 
to each other. 

(a) From a, onwards we have |a, — a,,| < 

(b) After which ay is |a,,—a,|< 10772 


40 Choose decimals in Problem 39 so the limit is L= .494. 
Choose decimals so that your professor can’t find L. 


41 If every decimal in .abcde--- is picked at random from 
0, 1, ..., 9, what is the “average” limit L? 


42 If every decimal is 0 or 1 (at random), what is the average 
limit L? 


43 Suppose a,=4a,-,+4 and start from a, = 10. Find a, 
and a; and a connection between a, — 8 and a,-, — 8. Deduce 
that a, > 8. 


44 “For every 6 there is an « such that | f(x)| <e if |x| <6.” 
That test is twisted around. Find ¢ when f(x) =cos x, which 
does not converge to zero. 


45 Prove the Squeeze Theorem for sequences, using e: If 
a,— Land c, > L and a, <b, <c, for n> N, then b, > L. 


46 Explain in 110 words the difference between “we will get 
there if you hurry” and “we will get there only if you hurry” 
and “we will get there if and only if you hurry.” 


This will be a brief section. It was originally included with limits, but the combination 
was too long. We are still concerned with the limit of f(x) as x > a, but a new number 


is involved. That number is f(a), the value of f at x = a. For a “limit,” 


x approached 


a but never reached it—so f(a) was ignored. For a “continuous function,” this final 


number f(a) must be right. 


May I summarize the usual (good) situation as x approaches a? 


1. The number f(a) exists 
2. The limit of f(x) exists 
3. The limit L equals f(a) 


(f is defined at a) 
(it was called L) 
(f(a) is the right value) 


In such a case, f(x) is continuous at x = a. These requirements are often written in a 
single line: f(x) + f(a) as x > a. By way of contrast, start with four functions that are 


not continuous at x = 0. 
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_ 1 
ff) =sin 5 


-1 
Fig. 2.20 Four types of discontinuity (others are possible) at x = 0. 


In Figure 2.20, the first function would be continuous if it had f(0) = 0. But it has 
f(0) = 1. After changing (0) to the right value, the problem is gone. The discontinuity 
is removable. Examples 2, 3, 4 are more important and more serious. There is no 
“correct” value for f(0): 


2. f(x)=step function (jump from 0 to 1 at x =0) 
3. f(x)=1/x? (infinite limit as x — 0) 
4. f(x)=sin(1/x) (infinite oscillation as x > 0). 


The graphs show how the limit fails to exist. The step function has a jump discontinu- 
ity. It has one-sided limits, from the left and right. It does not have an ordinary (two- 
sided) limit. The limit from the left (x + 07) is 0. The limit from the right (x > 0*) 
is 1. Another step function is x/|x|, which jumps from —1 to 1. 

In the graph of 1/x?, the only reasonable limit is L= + oo. I cannot go on record 
as saying that this limit exists. Officially, it doesn’t—but we often write it anyway: 
1/x? + co as x +0. This means that 1/x? goes (and stays) above every L as x > 0. 

In the same unofficial way we write one-sided limits for f(x) = 1/x: 


From the left, lim ae oo. From the right, lim os + 00. (1) 
x70- X x70t X 


Remark 1/x has a “pole” at x=0. So has 1/x? (a double pole). The function 
1/(x? — x) has poles at x = 0 and x = 1. In each case the denominator goes to zero 
and the function goes to + 00 or — oo. Similarly 1/sin x has a pole at every multiple 
of x (where sin x is zero). Except for 1/x these poles are “‘simple””—the functions are 
completely smooth at x = 0 when we multiply them by x: 


(x) (*) =1 and (x) (s+) = = and (x) (=) are continuous at x= 0. 
1/x? has a double pole, since it needs multiplication by x? (not just x). A ratio of 
polynomials P(x)/Q(x) has poles where Q=0, provided any common factors like 
(x + 1)/(x + 1) are removed first. 

Jumps and poles are the most basic discontinuities, but others can occur. The 
fourth graph shows that sin(1/x) has no limit as x > 0. This function does not blow 
up; the sine never exceeds 1. At x =4 and } and yoo it equals sin 3 and sin 4 and 
sin 1000. Those numbers are positive and negative and (?). As x gets small and 1/x 
gets large, the sine oscillates faster and faster. Its graph won’t stay in a small box of 
height e, no matter how narrow the box. 


CONTINUOUS FUNCTIONS 


DEFINITION ff is “continuous at x =a’ if f(a) is defined and f(x) >f(a) as x > a. 
If f is continuous at every point where it is defined, it is a continuous function. 


2.7 Continuous Functions 


Objection The definition makes f(x) = 1/x a continuous function! It is not defined 
at x = 0, so its continuity can’t fail. The logic requires us to accept this, but we don’t 
have to like it. Certainly there is no f(0) that would make 1/x continuous at x = 0. 


It is amazing but true that the definition of “‘continuous function”’ is still debated 
(Mathematics Teacher, May 1989). You see the reason-—we speak about a discontinu- 
ity of 1/x, and at the same time call it a continuous function. The definition misses 
the difference between 1/x and (sin x)/x. The function f(x) = (sin x)/x can be made 
continuous at all x. Just set f(0) = 1. 

We call a function ‘‘continuable’’ if its definition can be extended to all x in a way 
that makes it continuous. Thus (sin x)/x and ./x are continuable. The functions 1/x 
and tan x are not continuable. This suggestion may not end the debate, but I hope 
it is helpful. 


EXAMPLE 1 sin x and cos x and all polynomials P(x) are continuous functions. 
EXAMPLE 2 The absolute value |x| is continuous. Its slope jumps (not continuable). 
EXAMPLE 3 Any rational function P(x)/Q(x) is continuous except where Q = 0. 


EXAMPLE 4 The function that jumps between 1 at fractions and 0 at non-fractions 
is discontinuous everywhere. There is a fraction between every pair of non-fractions 
and vice versa. (Somehow there are many more non-fractions.) 


EXAMPLES The function 0*’ is zero for every x, except that 0° is not defined. So 
define it as zero and this function is continuous. But see the next paragraph where 
0° has to be 1. 


We could fill the book with proofs of continuity, but usually the situation is clear. 
“A function is continuous if you can draw its graph without lifting up your pen.” 
At a jump, or an infinite limit, or an infinite oscillation, there is no way across the 
discontinuity except to start again on the other side. The function x” is continuous 
for n> 0. It is not continuable for n < 0. The function x° equals 1 for every x, except 
that 0° is not defined. This time continuity requires 0° = 1. 

The interesting examples are the close ones—we have seen two of them: 


EXAMPLE 6 —* and al are both continuable at x = 0. 


Those were crucial for the slope of sin x. The first approaches 1 and the second 
approaches 0. Strictly speaking we must give these functions the correct values 
(1 and 0) at the limiting point x = 0—which of course we do. 

It is important to know what happens when the denominators change to x7. 


EXAMPLE 7 — blows up but aaa has the limit ; atx=0. 


Since (sin x)/x approaches 1, dividing by another x gives a function like 1/x. There 
is a simple pole. It is an example of 0/0, in which the zero from x? is reached more 
quickly than the zero from sin x. The “race to zero” produces almost all interesting 
problems about limits. 
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For 1 — cos x and x? the race is almost even. Their ratio is 1 to 2: 


1-cosx  1-—cos?x _ sin?x 1 


2 ~ "3 ay has i 
x x*(1+ cos x) x 1+cosx 1+1 


asx 0. 


This answer 3 will be found again (more easily) by “I’H6pital’s rule.”’ Here I emphasize 
not the answer but the problem. A central question of differential calculus is to know 


how fast the limit is approached. The speed of approach is exactly the information in 
the derivative. 


These three examples are all continuous at x =0. The race is controlled by the 
slope—because f(x) —f(0) is nearly f’(0) times x: 


derivative of sin x is 1 <> sin x decreases like x 
derivative of sin?x is 0 © sin’ x decreases faster than x 


derivative of x! is co =«+ x"/3 decreases more slowly than x. 
DIFFERENTIABLE FUNCTIONS 


The absolute value |x| is continuous at x = 0 but has no derivative. The same is true 
for x13. Asking for a derivative is more than asking for continuity. The reason is 
fundamental, and carries us back to the key definitions: 


Continuous atx: f(x + Ax)—f(x) 70 as Ax 30 


flx + Ax) — fos) 
Ax 


Derivative at x: 


>f'(x) as Ax > 0. 


In the first case, Af goes to zero (maybe slowly). In the second case, Af goes to zero 
as fast as Ax (because Af/Ax has a limit). That requirement is stronger: 


21 At a point where f(x) has a derivative, the function must be continuous. 


But f(x) can be continuous with no derivative. 


Proof The limit of Af=(Ax)(Af/Ax) is (O)(df/dx)=0. So f(x+ Ax) —f(x) > 0. 


The continuous function x‘? has no derivative at x = 0, because 4x” */? blows up. 
The absolute value |x| has no derivative because its slope jumps. The remarkable 
function 4.cos 3x +4cos 9x + -:- is continuous at all points and has a derivative at 
no points. You can draw its graph without lifting your pen (but not easily—it turns 
at every point). To most people, it belongs with space-filling curves and unmeasurable 
areas—in a box of curiosities. Fractals used to go into the same box! They are 
beautiful shapes, with boundaries that have no tangents. The theory of fractals is 
very alive, for good mathematical reasons, and we touch on it in Section 3.7. 

I hope you have a clear idea of these basic definitions of calculus: 


1 Limit (n> x or x—a) 2 Continuity (at x =a) 3 Derivative (at x = a). 


Those go back to ¢ and 6, but it is seldom necessary to follow them so far. In the 
same way that economics describes many transactions, or history describes many 
events, a function comes from many values f(x). A few points may be special, like 
market crashes or wars or discontinuities. At other points df/dx is the best guide to 
the function. 


2.7 Continuous Functions 


This chapter ends with two essential facts about @ continuous function on a closed 
interval. The interval is a<x <b, written simply as [a, b].f At the endpoints a and 
b we require f(x) to approach f(a) and f(b). 


Extreme Value Property A continuous function on the finite interval [a, b] has a 
maximum value M and a minimum value m. There are points x,,,, and Xin in [a, 5] 
where it reaches those values: 


f@max)=M > f(x) > f(%min)=m for all x in [a, 5]. 


Intermediate Value Property If the number F is between f(a) and f(b), there is a 
point c between a and b where f(c)= F. Thus if F is between the minimum m and 
the maximum M, there is a point c between x,;, and X,,, where f(c) = F. 


Examples show why we require closed intervals and continuous functions. For 
0<x<1 the function f(x)=x never reaches its minimum (zero). If we close the 
interval by defining f(0) = 3 (discontinuous) the minimum is still not reached. Because 
of the jump, the intermediate value F = 2 is also not reached. The idea of continuity 
was inescapable, after Cauchy defined the idea of a limit. 


2.7 EXERCISES 

Read-through questions x<0 cx x<l1 
Continuity requires the 9 of f(x) to exist as x > a and 3 fe)= {= x20 4 flx)= . x21 
to agree with ©. The reason that x/|x| is not continuous 
atx=Ois_¢_. This function does have _4 ___ limits. The c+x x<0 x? xe 
reason that 1/cos x is discontinuous at_e _is__{ . The 5 f(x)= ea 2 6 f(xX)= 
reason that cos(1/x) is discontinuous at x=0 is _9 x20 Set ee 
The function f(x)=__h _ has a simple pole at x = 3, where 
f7hasa_l pole. 7 f= ee id 8 sy=\* a 

The power x" is continuous at all x provided nis__j__. It x+1 xe Damen 
has no derivative atx =Owhennis_k_. f(x)=sin(—x)/x 0 2 
approaches__! as x0, so this isa __m__ function pro- 9 f(x) oe xyxr x# 10 f(x)= A acy ey 
vided we define f(0)=__"_. A “continuous function” must x=0 1 x>e 
be continuous at all _o | A “continuable function” can be 
extended to every point x so that _ P x#4 c x<0 

ee : : Il f(x)= 12 f(x)= 

If f has a derivative at x =a then f is necessarily 9 at }x x . x x20 
x=a. The derivative controls the speed at which f(x) 
approaches __' __. Ona closed interval [a, b], a continuous 24 
fhasthe__s value property andthe __t value property. x# na x#e 
It reaches its. u Mandits_v___m, and it takes on every 13 f(x)= 14 f(xy=4 x6 
value __W 2e =x=c 
In Problems 1-20, find the numbers c that make (x) into 
(A) a continuous function and (B) a differentiable function. 15 f(x)= — xx x#0 16 f(x) = (is x<e 
In one case f(x) — f(a) at every point, in the other case Af/Ax 7 S Doge Se see 
has a limit at every point. 

sinx x<1l cos? x 
i r=} ; fo} x40 3 pun foreee x)ix x#0 aie teaa 
c xel c x= 0 x=0 


+The interval [a, b] is closed (endpoints included). The interval (a, b) is open (a and b left out). 


The infinite interval [0, 00) contains all x > 0. 
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(sin x —x)/x° x40 


20 fix) =|x? +c? 
‘ er fix) =| | 


19 f(x)= 
Construct your own (x) with these discontinuities at x = 1. 
21 Removable discontinuity 

22 Infinite oscillation 

23 Limit for x > 1*, no limit for x > 17 

24 A double pole 

25 lim f(x)=4+ lim, f(x) 

26 lim f(x) = 00 but lim (x — 1) f(x) =0 

27 lim (x — I f(x)=5 


28 The statement “3x — 7 as x > 1” is false. Choose an « for 
which no 6 can be found. The statement ‘3x — 3 as x > 1” is 
true. For e=4 choose a suitable 6. 


29 How many derivatives f’, f”, ... are continuable 


functions? 
(a) f=x3?? (b) f=x?? sin x (c) f=(sin x)5? 


30 Find one-sided limits at points where there is no two- 
sided limit. Give a 3-part formula for function (c). 


|x| 


(a) x 


31 Let f(1)=1 and f(—1)=1 and f(x) =(x? —x)/(x? — 1) 
otherwise. Decide whether f is continuous at 

(a) x=1- (b) x=0 

*32 Let f(x) =x? sin 1/x for x #0 and f(0)=0. If the limits 

exist, find 

(a) lim f(x) (c) lim f'(). 
33 If f(0)=0 and f’(0)=3, rank these functions from 
smallest to largest as x decreases to zero: 

f(x), x, xf), f()+2x, 2F)—x), (£0). 


34 Create a discontinuous function f(x) for which f?(x) is 
continuous. 


d 
(b) sin |x| Oa |x? -1| 


(c) x=—-1. 


(b) df/dx atx =0 


35 True or false, with an example to illustrate: 
(a) If f(x) is continuous at all x, it has a maximum 
value M. 


(b) If f(x) <7 for all x, then f reaches its maximum. 
(c) If f(1)=1 and f(2)= —2, then somewhere f(x) =0. 
(d) If f(1)=1 and f(2)=—2 and f is continuous on 
[1, 2], then somewhere on that interval f(x) =0. 

36 The functions cos x and 2x are continuous. Show from 


the property that cos x = 2x at some point between 
0 and 1. 


37 Show by example that these statements are false: 
(a) If a function reaches its maximum and minimum then 
the function is continuous. 
(b) If f(x) reaches its maximum and minimum and all 
values between (0) and f(1), it is continuous at x = 0. 


(c) (mostly for instructors) If f(x) has the intermediate 
value property between all points a and b, it must be 
continuous. 


38 Explain with words and a graph why f(x) =x sin (1/x) is 
continuous but has no derivative at x =0. Set (0) =0. 


39 Which of these functions are continuable, and why? 


sin I/x x<0 


filx)= 


cos I/x x>1l 


eos. : =x? 4.0” 
fas) = 5 When sinx 40 fa(x)=x°+0 


40 Explain the difference between a continuous function and 
a continuable function. Are continuous functions always con- 
tinuable? 


*41 f(x) is any continuous function with f(0) =/(1). 
(a) Draw a typical f(x). Mark where f(x) =f(x + 4). 
(b) Explain why g(x) =f(x +4) —f(x) has g(4) = — g(0). 
(c) Deduce from (b) that (a) is always possible: There must 
be a point where 2(x) =0 and f(x) =f(x +4). 
42 Create an f(x) that is continuous only at x =0. 
43 If f(x) is continuous and 0 <f(x) <1 for all x, then there 
is a point where f(x*) =x*. Explain with a graph and prove 
with the intermediate value theorem. 


44 In the e—6 definition of a limit, change 0 <|x —a| <6 to 
|x —a| <6. Why is f(x) now continuous at x =a? 


45 A function has a at x=0 if and only if 
(f(x) —f(0))/x is ___ at x = 0. 
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CHAPTER 3 


Applications of the Derivative 


Chapter 2 concentrated on computing derivatives. This chapter concentrates on using 
them. Our computations produced dy/dx for functions built from x” and sin x and 
cos x. Knowing the slope, and if necessary also the second derivative, we can answer 
the questions about y = f(x) that this subject was created for: 


1. How does y change when x changes? 
2. What is the maximum value of y? Or the minimum? 
3. How can you tell a maximum from a minimum, using derivatives? 


The information in dy/dx is entirely focal. It tells what is happening close to the point 
and nowhere else. In Chapter 2, Ax and Ay went to zero. Now we want to get them 
back. The local information explains the larger picture, because Ay is approximately 
dy/dx times Ax. 


The problem is to connect the finite to the infinitesimal—the average slope to the 
instantaneous slope. Those slopes are close, and occasionally they are equal. Points 
of equality are assured by the Mean Value Theorem—which is the local-global 
connection at the center of diflerential calculus. But we cannot predict where dy/dx 
equals Ay/Ax. Therefore we now find other ways to recover a function from its 
derivatives—or to estimate distance from velocity and acceleration. 

It may seem surprising that we learn about y from dy/dx. All our work has been 
going the other way! We struggled with y to squeeze out dy/dx. Now we use dy/dx 
to study yp. That’s life. Perhaps it really is life, to understand one generation from 
later generations. 


Re 83.14 Linear Approximation HE 


The book started with a straight line {= vt. The distance is linear when the velocity 
is constant. As soon as v begins to change, f= vt falls apart. Which velocity do we 
choose, when u(t} is not constant? The solution is to take very short time intervals, 91 
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in which v is nearly constant: 
f=vt is completely false 
Af=vAt is nearly true 
df= vdt is exactly true. 


For a brief moment the function f(t) is tinear—and stays near its tangent line. 

In Section 2.3 we found the tangent line to y = f(x). At x = a, the slope of the curve 
and the slope of the line are f’(a). For points on the line, start at y = f(a). Add the 
slope times the “increment” x — a: 


Y = fla) + f"{a}(x — a). (i) 


We write a capital Y for the line and a small y for the curve. The whole point of 
tangents is that they are close (provided we don’t move too far from a): 


ye Yor f(x) ® fla)+ faye — a). (2) 


That is the all-purpose finear approximation. Figure 3.1 shows the square root 
function y= Swe its tangent line at x=a= 100. At the point y=./100= 10, 
the slope is 1/2./x = 1/20. The table beside the figure compares p(x) with Y(x). 


{00 200 300 400 
Hg. 3.4 ¥{x) is the linear approximation to /x near x =a= 100. 


The accuracy gets worse as x departs from 100. The tangent line leaves the curve. 
The arrow points to a good approximation at 102, and at 101 it would be even better. 
In this example Y its larger than y—the straight line is above the curve. The slope of 
the line stays constant, and the slope of the curve is decreasing. Such a curve will 
soon be called “concave downward,” and its tangent lines are above it. 

Look again at x = 102, where the approximation is good. In Chapter 2, when we 
were approaching dy/dx, we started with Ap/Ax: 


_ /102— ,/100 


u slope = 102-100 (3) 
Now that is turned around! The slope is 1/20, What we don’t know is ./102: 
(102 = ./100 + (slope)(102 — 100). (4) 


You work with what you have. Earlier we didn’t know dy/dx, so we used (3). Now 
we are experts at dy/dx, and we use (4). After computing y'= 1/20 once and for 
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all, the tangent line stays near /x for every number near 100. When that nearby 
number is 100 + Ax, notice the error as the approximation is squared: 


1.¥ 1 
+— = 100 + Ax + —{Ax)*. 
( /i00 an) 100 + Ax (Ax) 


The desired answer is 100 + Ax, and we are off by the last term involving (Ax)?. The 
whole point of linear approximation is to ignore every term after Ax. 

There is nothing magic about x = 100, except that it has a nice square root. Other 
points and other functions allow y~ ¥. J would like to express this same idea in 
different symbois. Instead of starting from a and going to x, we start from x and go a 
distance Ax to x + Ax. The letters are different but the mathematics is identical. 


3A At any point x, and for any smooth function y= f(x), 


Sixt Ax) - fd 
Ax , 


For the approximation to f(x + Ax), multiply both sides by Ax and add f(x): 
fxc+ Ax) & fix) + (lope at x)(Ax). 6 


slope atx = 


(5) 


EXAMPLE 4 An important linear approximation: {1 + x)" ~1+nx for x near zero. 


EXAMPLE 2. 4 second important approximation: 1/(1 + x)’ ~ 1 —nx for x near zero. 


Discussion Those are really the same. By changing n to —n in Example 1, it becomes 
Example 2, These are linear approximations using the slopes n and ~—n at x =0: 


(1+x)" = 1+(slope at zero) times (x — 0)= 1+ nx. 
Here is the same thing with f(x})= x". The basepoint in equation (6) is now 1} or x: 
(1+ Axf ~ 1+ nAx (x + Ax)" x x" + nx" Ax, 
Better than that, here are numbers. For n= 3 and —1 and 100, take Ax = .0!: 


rT 1 109 
3 ~ ———— ———— a~ 
(1.01) 1.03 99 ( + a) m2 


Actually that last number is no good. The 100th power is too much. Linear approxi- 

mation gives 1 + 100Ax = 2, but a calculator gives {1.01)'°° = 2.7.... This is close to 

e, the all-important number in Chapter 6. The binomial formula shows why the 

approximation failed: 

(100)(99) 
{2)(1) 


Linear approximation forgets the (Ax)? term. For Ax = 1/100 that error is nearly 4. 
It is too big to overlook. The exact error is $(Ax)*f"(c), where the Mean Value 
Theorem in Section 3.8 places ¢ between x and x + Ax. You already see the point: 


(1 + Ax)!°° = 1+ 100 Ax + (Ax)? +. 


y — Y is of order (Ax)?. Linear approximation, quadratic error. 


DIFFERENTIALS 


There is one more notation for this linear approximation. It has to be presented, 
because it is often used. The notation is suggestive and confusing at the same time— 
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it keeps the same symbols dx and dy that appear in the derivative. Earlier we took 
great pains to emphasize that dy/dx is not an ordinary fraction.t Until this paragraph, 
dx and dy have had no independent meaning. Now they become separate variables, 
like x and y but with their own names. These quantities dx and dy are called 
differentials. 

The symbols dx and dy measure changes along the tangent line. They do for the 
approximation Y(x) exactly what Ax and Ay did for }{x). Thus dx and Ax both 
measure distance across. 

Figure 3.2 has Ax = dx. But the change in y does not equal the change in Y. One 
is Ay (exact for the function). The other is dy (exact for the tangent line). The 
differential dy is equal to AY, the change along the tangent line. Where Ay is the true 
change, dy is its linear approximation (dy/dx)dx. 

You often see dy written as f’(x)dx. 


ytAy yx) 
y + dy Y(x 
d Ay = change tm » (along curve} 
) dx | | dy = change in Y (along tangent) 
Ax Fig. 3.2 The linear approximation to Ay is 
ot +— 
X=@Q xX+dx=n+Ax dy =f'(x)dx. 


EXAMPLE 3 y= x” has dy/dx = 2x so dy= 2x dx. The table has basepoint x = 2. 
The prediction dy diflers from the true Ay by exactly {Ax)? = .01 and .04 and .09. 


dx dy Ax Ay 
y=x? 1 04 1 041 Ay = (2+ Ax)? — 2? 
dy = 4dx 2 08 2 0,84 Ay = 4Ax + (Ax)? 
3 12 3 1,29 


The differential dy = f'(x)dx is consistent with the derivative dy/dx =/‘(x). We 
finally have dy =(dy/dx)dx, but this is not as obvious as it seems! It looks like 
cancellation—it is really a definition. Entirely new symbols could be used, but dx 
and dy have two advantages: They suggest small steps and they satisfy dy =f’(x}dx. 
Here are three examples and three rules: 


d(x") = nx"~ dx d(f + g)=adf+ dg 
d(sin x)= cos x dx d(cf) = cdf 
d(tan x)= sec?x dx d( fg}=fdg+gdf 


Science and engineering and virtuaily all applications of mathematics depend on 
linear approximation. The true function is “fnearized,” using its slope v: 


Increasing the time by At increases the distance by = vAt 
Increasing the force by Af increases the deflection by ~ vAf 


Increasing the production by Ap increases its value by ~ vAp. 


tFraction or not, it is absolutely forbidden to cancel the d’s. 
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The goal of dynamics or statics or economics is to predict this multiplier »—the 
derivative that equals the slope of the tangent line. The multiplier gives a local 
prediction of the change in the function. The exact law is nonlinear—but Ohm’s law 
and Hooke’s law and Newton’s law are linear approximations. 


ABSOLUTE CHANGE, RELATIVE CHANGE, PERCENTAGE CHANGE 


The change Ay or Af can be measured in three ways. So can Ax: 


Absolute change Af Ax 
A A 
Relative change re = 
A 
Percentage change D x 100 = x 100 


Relative change is often more realistic than absolute change. If we know the distance 
to the moon within three miles, that is more impressive than knowing our own height 
within one inch. Absolutely, one inch is closer than three miles. Relatively, three miles 
is much closer: 
3 miles 1 inch 
ee 001% < 1.4%. 
300,000 miles ~ Finches % “Ol <!.4% 
EXAMPLE 4 The radius of the Earth is within 80 miles of r = 4000 miles. 
(a) Find the variation dV in the volume V = }xr’, using linear approximation. 
(b) Compute the relative variations dr/r and dV/V and AV/V. 


Solution The job of calculus is to produce the derivative. After dV/dr = 4nr’, its 


work is done. The variation in volume is dV = 472(4000)*(80) cubic miles. A 2% 
relative variation in r gives a 6% relative variation in V: 


OE ie ene dV _ 4n(4000)*(80) 
r 4000 ~” Ve. 4x(4000)3/3 


Without calculus we need the exact volume at r = 4000 + 80 (also at r = 3920): 
AV _ 4x(4080)7/3 — 4n(4000)7/3 
V 41(4000)3/3 


One comment on dV = 4nr?dr, This is (area of sphere) times (change in radius). It is 
the volume of a thin shell around the sphere. The shell is added when the radius 
grows by dr. The exact AV/V is 3917312/640000%, but calculus just calls it 6%. 


= 6%. 


6.1% 


3.4 EXERCISES 


Read-through questions In terms of x and Ax, linear approximation is 
: ee a ee +Ax)xf(x)+__i. The error is of order {Ax} or 

On the graph, a linear approximation is given by the __a fix : ie ; : 

line. At x =a, the equation for that line is Y=f(a)+__b (x Sint with p=_i_. phe: cients) @y, e10ale 

Near x=a=10, the linear approximation to y= x? is Y= times the differential __1__. Those movements are along the 


. Atx=11 the exact valueis(1th =a. The 


approximation is Y=__@_. In this case Ay=__f and 
dy=__g__. If we know sin x, then to estimate sin(x + Ax) we 


m__ line, where Ay is along the __n 


Find the linear approximation Y to y = f(x) near x =a: 


1 fix)=x4x4,2=0 2 fi) = I/x, a= 2 
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ix) =tan x a= 72/4 4 fix) =sin x, a= 2/2 


3f 

5 fix) =x sin x, a= 20 6 fix)=sin?x, a=0 
Compute 7-12 within .0! by deciding on f{x), choosing the 
basepoint a, and evaluating f(a) + /(e)(x—a). A calculator 
shows the error. 


7 {2.001)" 8 sin(.02} 
9 cosi(.03) 10 (15.99)! 
I] 1;.98 12 sin(3.14) 


Calculate the numerical error in these linear approximations 
and compare with 3(Ax)?/"(x): 


13 (f.01P = 1 + 30.01) 
15 (sin .O1)? = 0+ 0.01) 


17 (L + qa)! = 2 


14 cos(.01) = 1+ 4.01) 
16 (1.01) 41 -3¢01) 
18 8.99 &3 +4(—.01) 


Contirm the approximations 19-21 by computing /(0): 


2 


pha a> “ae Ix . —— : 
2 ett x? eet -_ (use f(t) = fe? +4, then put w= x°) 
22 Write down the differentials df for f(x)=cos x and 
tx + Ustx — I) and (x? + 1). 


3.2. Maximum and Minimum Problems 


In 23-27 find the linear change dV in the volume or @A in the 
surface area. 


23 dV if the sides of a cube change from 10 to 10.1, 
24 dA if the sides of a cube change from x to x +dx. 
25 dA if the radius of a sphere changes by dr. 


26 d@V if a circular cylinder with r = 2 changes height from 3 
to 3.05 (recall V = xr*h). 


27 dV if a cylinder of height 3 changes from r = 2 to r = 1.9. 
Extra credit: What is dV ifr and h both change (dr and dhP? 


28 In relativity the mass is moi /] —{eie?? at velocity v. By 
Problem 20 this is near mtg + for smail v. Show that 
the kinetic energy jmv? and the change in mass satisfy 
Einstein’s equation ¢ = (Amjc?. 


29 Enter 1.1 on your calculator. Press the square root key 5 
times (slowlyl. What happens each time to the number after 
the decimal point? This is because ./1 +1 = 


30 In Problem 29 the numbers you see_are less than 1.05, 
1.025, .... The second derivative of (1 + x is so the 
linear approximation is higher than the curve. 


31 Enter 0.9 on your calculator and press the square root 
key 4 times. Predict what will appear the fifth time and press 
again, You now have the root of 0.9. How many 
decimals agree with 1» 34(0.1)? 


Our goal is to learn about f(x) from dfidx. We begin with two quick questions. 
If dfidx is positive, what docs that say about f? [f the slope is negative. how is that 
reflected in the function? Then the third question is the critical one: 


How do you identify 4 maximum or minimunt 


Normal answer: The slope ts zero. 


This may be the most important application of calculus, to reach dfdx = 0. 


Take the casy questions first. Suppose dfidx is positire for every x between a and 4, 
All tangent lines slope upward. The function f(x) is inereasing as x goes from a to b. 


To define increasing and decreasing. look at any two points x < X. “Increasing” 
requires f(x} < f(X ). “Decreasing” requires f(x) > f{(X). 4 positive slape does not mean 
a positive function, The function itself can be positive or negative. 


EXAMPLE 1 /(x}= x" — 2x has slope 2x — 2. This slope is positive when x > 1 and 
negative when x <1. The function increases after x = 1 and decreases before x = 1. 


3.2 Maximum and Minimum Problems 


f min 


1) (e- 2) (v- 3) (W-4) 


x2—2y 


Fig. 3.3. Slopes are — +. Slope is + — + — + so f is up-down-up-down-up. 


We say that without computing f(x) at any point! The parabola in Figure 3.3 goes 
down to its minimum at x = | and up again. 


EXAMPLE 2 x? — 2x + 5 has the same sfope. Its graph is shifted up by 5, a number 
that disappears in dfidx. All functions with slope 2x — 2 are parabolas x? — 2x +C, 
shifted up or down according to C. Some parabolas cross the x axis (those crossings 
are solutions to f{x) = 0). Other parabolas stay above the axis. The solutions to 
x? — 2x +5=0 are complex numbers and we don’t see them. The special parabola 
x? -—2x+1={x— 1) grazes the axis at x= 1. It has a “‘double zero.” where f(x) = 
dfidx = 0. 


EXAMPLE 3 Suppose dfidx =(x — L)(x — 2)(x — 3)(x — 4). This slope is positive 
beyond »=4 and up to x=1 (dfidx = 24 at x =0). And dfidx is positive again 
between 2 and 3. At x = 1,2. 3,4, this slope is zero and f(x) changes direction. 

Here f(x) is a fifth-degree polynomial. because f‘(x) is fourth-degree. The graph of 
f goes up-down-up-down-up. It might cross the x axis five times. /t must cross 
at least once (like this one). When complex numbers are allowed, every fifth-degree 
polynomial has five roots. 


You may feel that “positire slope implies increasing function” 1s obvious—perhaps 
itis. But there is still something delicate. Starting from df/dx > Oat every single point, 
we have to deduce f{ X) > f(x) at pairs of points. That is a “local to global” question, 
to be handled by the Mean Value Theorem. It could also wait for the Fundamental 
Theorem of Calculus: The difference {(X)— fix) equals the area under the graph of 
djidx. That area is positive. so {( X) exceeds /{x). 


MAXIMA AND MINIMA 


Which x makes f(x) as large as possible? Where is the smallest f(x)? Without calculus 
we are reduced to computing values of f(x) and comparing. With calculus, the infor- 
mation is in dfjdx. 

Suppose the maximum or minimum is at a particular point x. [t is possible that 
the graph has a corner—and no derivative. Bur if dfidx exists, it must be zerv. The 
tangent line is level. The parabolas in Figure 3.3 change from decreasing to increasing. 
The slope changes from negative to positive. At this crucial point the slope is zero. 
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3 Applications of the Derivative 


3C Local Maximum or Minimum Suppose the maximum or minimum 


occurs at a point x inside an interval where f(x) and dff/dx are defined. Then 


Sx) = 0. 


The word “/ocal” allows the possibility that in other intervals, f(x) goes higher or 
lower. We only look near x, and we use the definition of df/dx. 

Start with f(x + Ax) — f(x). If f(x) is the maximum, this difference is negative or 
zero, The step Ax can be forward or backward: 


x+Ax)- i 
if Ax > 0: Js x) f(x) = aSERWIS <0 and in the limit a <0. 
Ax positive dx 
. + Ax) - ti 
Paco: LEST. Mea 0) And inthe mit 0. 
Ax negative dx 


Both arguments apply. Both conclusions dfjdx <0 and df/dx > 0 are correct. Thus 
dfjdx = 0, 


Maybe Richard Feynman said it best. He showed his friends a plastic curve that 
was made in a special way — “no matter how you turn it, the tangent at the lowest 
point is horizontal.” They checked it out. It was true. 

Surely You're Joking, Mr. Feynman! is a good book (but rough on mathematicians). 


EXAMPLE 3 (continued) Look back at Figure 3.3b. The points that stand out 
are not the “‘ups” or “downs”? but the “turns.” Those are stationary points, where 
dfjdx = 0. We see two maxima and two minima. None of them are absolute maxima 
or minima, because f(x) starts at ~ «© and ends at + x. 


EXAMPLE 4 f(x) = 4x? ~ 3x* has slope 12x? — 12x. That derivative is zero when 
x? equals x*, at the two points x= 0 and x= 1. To decide between minimum and 
maximum (local or absolute), the first step is to evaluate f(x) at these stationary points. 
We find f(0) = 0 and f(1) = 1. 

Now look at large x. The function goes down to — & in both directions. (You can 
mentally substitute x = 1000 and x = — 1000). For large x, — 3x* dominates 4x?. 


Conclusion {= 1 is an absolute maximum, /=0 is not a maximum or minimum 
(local or absolute). We have to recognize this exceptional possibility, that a curve (or 
a car}can pause for an instant ({’ = 0) and continue in the same direction. The reason 
is the ‘‘double zero” in 12x? — 12x3, from its double factor x”. 


fa=4x7-3x4 


-3 rough point 


1/3 1 2 3 
Fig. 3.4 The graphs of 4x° — 3x* and x ~x7~'. Check rough points and endpoints. 


$.2. Maximum and Minimum Probiems 


EXAMPLE 5 Define f(x)=x-+x~' for x > 0. Its derivative 1 — 1/x? is zero at x = 1. 
At that point f(1)=2 is the minimum value. Every combination like $+ 3 or $+4 
is larger than f_;, = 2. Figure 3.4 shows that the maximum of x + x~! is + 00.t 


Important The maximum always occurs at a stationary point (where df/dx = 0) or a 
rough point (no derivative) or an endpoint of the domain. These are the three types 
of critical points. All maxima and minima occur at critical points! At every other 
point df/dx > 0 or df/dx < 0. Here is the procedure: 


L. Solve df/dx = 0 to find the stationary points f{x). 
2. Compute f(x) at every critical point—stationary point, rough point, endpoint. 
3. Take the maximum and minimum of those critical values of f(x). 


EXAMPLE 6 (Absolute value f(x} = |x|) The minimum is zero at a rough point. The 
maximum is at an endpoint. There are no stationary points. 

The derivative of y= |x| is never zero. Figure 3.4 shows the maximum and mini- 
mum on the interval [—3, 2]. This is typical of piecewise linear functions. 


Question Could the minimum be zero when the function never reaches f(x) = 0? 
Answer Yes, f(x) = 1/(1 + x)? approaches but never reaches zero as x > 00, 


Remark} x— +o and f{x)— + are avoided when f is continuous on a closed 
interval a < x <b. Then f(x) reaches its maximum and its minimum (Extreme Value 
Theorem). But x -» 00 and f(x)— oo are too important to rule out. You test x — oo 
by considering large x. You recognize f(x} > oo by going above every finite value. 


Remark 2 Note the difference between critical points (specified by x) and critical 
values (specified by f{x)). The example x + x~' had the minimum point x = 1 and the 
minimum value f{1) = 2. 


MAXIMUM AND MINIMUM IN APPLICATIONS 


To find a maximum or minimum, solve f‘(x) = 0. The slope is zero at the top and 
bottom of the graph. The idea is clear—and then check rough points and endpoints. 
But to be honest, that is not where the problem starts. 

In a real ‘application, the first step (often the hardest) is to choose the unknown 
and find the function. It is we ourselves who decide on x and f(x). The equation 
df/dx =0 comes in the middle of the problem, not at the beginning. E will start on 
a new example, with a question instead of a function. 


EXAMPLE 7 Where should you get onto an expressway for minimum driving time, 
if the expressway speed is 60 mph and ordinary driving speed is 30 mph? 


I know this problem well—it comes up every moming. The Mass Pike goes to MIT 
and I have to join it somewhere. There is an entrance near Route !28 and another 
entrance further in. I used to take the second one, now I take the first. Mathematics 
should decide which is faster—some mornings | think they are maxima. 

Most models are simplified, to focus on the key idea. We will allow the expressway 
to be entered at any point x (Figure 3.5). Instead of two entrances (a discrete problem) 


tA good word is approach when f(x) — 00. Infinity is not reached. But I still say “the maximum 
is ac.” 
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3 Applications of the Derivative 


we have a continuous choice (a calculus problem). The trip has two parts, at speeds 
30 and 60: 
a distance ,/a? + x? up to the expressway, in \/a* + x?/30 hours 


a distance b x on the expressway, in (b — x)/60 hours 


| a—=— 
Problem Minimize f(x) = total time = 30¥ atx At — x). 


3 


We have the function f(x). Now comes calculus. The first term uses the power rule: 
The derivative of u'? is $u7'*dujdx. Here u = a? + x? has dujdx = 2x: 


{1 1 
” —_ 2 2y-1i2 
Xy=malatt+x 2x)- —. ] 
I) 30 2! yoe""(2x) 60 (1) 
To solve f’(x) = 0, multiply by 60 and square both sides: 
(a7 + x7) '2(2x)= 1 gives 2x=(a7 + x7)? and 4x? =a? + x?, (2) 
Thus 3x?=a*. This yields two candidates, x= ai./3 and x= — a/./3. But a 
negative x would mean useless driving on the expressway. In fact f’ is not zero at 
x= —a/\/3. That false root entered when we squared 2x. 
MIT @ driving trme f(y) driving time fv} 
} when b> a/V 3. when bh < a/v 3 
bas 
enter 


freeway 


a 


ain 3 h 


Fig. 3.5 Join the freeway at x minimize the driving time f(x). 


é 


I notice something surprising. The stationary point x = a',/3 does not depend on 
b. The total time includes the constant 5/60, which disappeared in dfidx. Somehow 
h must enter the answer. and this is a warning to go carefuily. The minimum might 
occur at a rough point or an endpoint. Those are the other critical points of /, and 
our drawing may not be realistic, Certainly we expect x <b, or we are entering the 
expressway beyond MIT. 

Continue with calculus. Compute the driving time f(x) for an entrance at 
x*=ajV/3: 


=a fa t (yt Ube] = ts: 

sia eta ( +) co 7 60! 

The square root of 4a7/3 is 2ai/3. We coinbined 2/30 — 1/60 = 3:60 and divided 

by \/3. ds this stationary value {* a minimum!’ You must look also at endpoints: 
enter at x = 0: travel time is 30 + 6.60 = f** 


enter at x = A: travel time is. a? + A730 =f FFF, 


3.2 Maximum and Minimum Problems 


The comparison f* <f{** should be automatic. Entering at x =0 was a candidate 
and calculus didn’t choose it. The derivative is not zero at x = 0. Jt is not smart to 
go perpendicular to the expressway. 

The second comparison has x = 6. We drive directly to MIT at speed 30. This 
option has to be taken seriously. In fact it is optimal when 6 is smail or a is large. 

This choice x = b can arise mathematically in two ways. If all entrances are between 
0 and 6, then 6 is an endpoint. If we can enter beyond MIT, then 5 is a rough point. 
The graph in Figure 3.5c has a corner at x = 6, where the derivative jumps. The 
reason is that distance on the expressway is the absolute value |b — x|-—never negative. 

Either way x = b is a critical point. The optimal x ts the smailer of aj f3 and b, 


if a}./3 <b: stationary point wins, enter at x = aj./3, total time {* 
if a}./3 2 b: no stationary point, drive directly to MIT, time f*** 


The heart of this subject is in “word problems.” All the calculus is in a few lines, 
computing {’ and solving f‘(x) = 0. The formulation took longer. Step | usuatly does: 


1. Express the quantity to be minimized or maximized as a function f(x). 
The variable x has to be selected. 
2. Compute f(x), solve f(x) = 0, check critical points for fai, and finax- 


A picture of the problem {and the graph of {(x)) makes all the difference. 


EXAMPLE 7 (continued) Choose x as an angle instead of a distance. Figure 3.6 
shows the triangle with angle x and side a. The driving distance to the expressway is 
a sec x. The distance on the expressway is b-- a tan x. Dividing by the speeds 30 and 
60, the driving time has a nice form: 


asecx b-—atanx 


{x) = total ti = 
f{x) = total time 30 60 (3) 
The derivatives of sec x and tan x go into df/dx: 
i 
a = mi sec X tan x — es sec?x. (4) 
Now set df/dx = 0, divide by a, and muitiply by 30 cos?x: 
sin x =4. (5) 


This answer is beautiful. The angle x is 30°! That optimal angle (2/6 radians) has 
sin x = }. The triangle with side a and hypotenuse aj/3 is a 30 60 90 right triangle. 

I] don’t know whether you prefer (/a* + x? or trigonometry. The minimum is 
exactly as before—either at 30° or going directly to MIT. 


e ; Lill 12000 
. 
| EOF - 
& la tana i = energy a 
x 1 | P 
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| 
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Fig. 3.6 (4) Driving at angle x. (b) Energies of spring and mass. {(c) Profit = income -- cost. 
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3 Applications of the Derivative 


EXAMPLE 8 In mechanics, nature chooses minimum energy. A spring is pulled down 
by a mass, the energy is f(x), and df/dx = 0 gives equilibrium. It is a philosophical 
question why so many laws of physics involve minimum energy or minimum time— 
which makes the mathematics easy. 

The energy has two terms—for the spring and the mass. The spring energy is 
+kx*—positive in stretching (x > 0 is downward) and also positive in compression 
(x < 0). The potential energy of the mass is taken as — mx—decreasing as the mass 
goes down. The balance is at the minimum of f(x) = $kx? — mx. 

I apologize for giving you such a small probiem, but it makes a crucial point. 
When f(x) is quadratic, the equilibrium equation df/dx = 0 is linear. 


dfidx =kx ~m=0. 


Graphically, x = m/k is at the bottom of the parabola. Physically, kx = m is a balance 
of forces—the spring force against the weight. Hooke’s law for the spring force is 
elastic constant k times displacement x. 


EXAMPLE 9 Derivative of cost = marginal cost (our first management example), 


The paper to print x copies of this book might cost C = 1000+ 3x doliars. The 
derivative is dC/dx = 3. This is the marginal cost of paper for each additional book. 
If x increases by onc book, the cost C increases by $3. The marginal cost is like the 
velocity and the total cost is like the distance. 

Marginal cost is in dollars per book, Total cost is in dollars. On the plus side, the 
income is (x) and the marginal income is d//dx. To apply calculus, we overlook the 
restriction to whole numbers. 

Suppose the number of books increases by dx.t The cost goes up by {dC/dx) dx. 
The income goes up by (di/dx) dx. If we skip ail other costs, then profit P(x) = 
income I(x) — cost C(x). In most cases P increases to a maximum and falls back. 

At the high point on the profit curve, the marginal profit is zero: 


dP:dx=Q or dfidx=dCidx, (6) 
Profit is maximized when marginal income |’ equals marginal cost C’. 
This basic rule of economics comes directly from calculus, and we give an example: 


C(x) = cost of x advertisements = 900 + 400x — x? 
setup cost 900, print cost 400x, volume savings x? 


!(x) = income due to x advertisements = 600x — 6x? 
sales 600 per advertisement, subtract 6x? for diminishing returns 


optimal decision dC idx = dijdx or 400-—2x=600- 12x or x=20 
profit = income — cost = 9600 — 8500 = 1100. 


The next section shows how to verify that this profit is a maximum not a minimum. 
The first exercises ask you to solve dfjdx = 0. Later exercises also look for f(x). 


tMaybe dx is a differential calculus book. I apologize for that. 
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3.2 EXERCISES 


Read-through questions 


If df/dx > O in an interval then f(x) is . If a maximum 
or minimum occurs at x then f{x}=__b __. Points where 
f'(x) =0 are called points. The function fix) = 3x* — x 
has a(minimum})(maximum) atx =__d__. Astationary point 
that is not a maximum or minimum occurs for f(x} = __e 


Extreme values can also occur where __f _ is not defined 
or at the___g__ of the domain. The minima of |x| and 5x for 
—2<x<2 are atx=_ hand x=__i_, even though 
df/dx is not zero. x* is an absolute __ | when f{x*) > fix) 
for all x. A & ~=minimum occurs when f{x*) </(x) for 
all x near x*. 


The minimum of 4ax? — bx is atx=_ om 


Find the stationary points and rough points and endpoints. 
Decide whether each point is a local or absolute minimum nr 
maximum. 


I fix) =x? 44x45, -wex<m 
2 fx) =x ~ 12x, —o<x< 0 
3 fix) =x? +3, -1<x<4 

4 fix) =x? +(Q/x), 1x <4 

5 fix) =(x~x?), 
6 fd = f(x ~ x7), O< x < 1 

7 fix) = 3x* + 8x3 — 18x?, ~ao<x<a 

8 fix) = {x? —4x for O< x <1, x? —4 for 1 <x<2} 
9 fixy)=/x-14+/9-x, 1 <x <9 

10 f(x} =x4+sin x, O<x<2n 

11 fix) = 
12 fi) = 
13 f{x) 
14 fix) = distance from x 2 0 to nearest prime number 
15 f(x) = 
16 fix)=xJ/1—x?,0<x<1 
17 fix}=x'? — x37 Ona <4 


18 f{x}=sin x +cos x, O<x <2z 


—l<x<¢l 


x{1—x)°, -o<x<a 
x/(i +x}, O< x < 100 


= distance from x > 0 to nearest whole number 


Ix+1[+|x—1], -3¢x<2 


19 fixj=x4sin x,0<x<22 
20 {0} =cos76 sin 6, —n SO <2 
‘21 f{0)=4 sin 0—Jcos 6, 0<0<22 


22 fx) = {x? +1 for x <1, x? — 4x45 for x 2 1}. 


In applied problems, choose metric units if you prefer. 
23 The airlines accept a box if length + width + height = 
i+w+h< 62" or 158 cm. If # is fixed show that the maxi- 


mum volume (62—-w-h)wh is V=A(3l — 4h). Choose fA to 
maximize V. The box with greatest volume is a 


24 If a patient’s pulse measures 70, then 80, then 120, what 
least squares value minimizes (x —70}* +(x — 80)? + 
(x — 120)?? If the patient got nervous, assign 120 a lower 
weight and minimize (x — 70)? + (x ~ 80)? + Hx — 120)%. 
25 At speed p, a truck uses av + (b/v) gallons of fuel per mile. 
How many miles per gallon at speed vo? Minimize the fuel 
consumption, Maximize the number of miles per gallon. 
26 A limousine gets (120—2v)/S miles per gallon. The 
chauffeur costs $10/hour, the gas costs $1/gallon. 

(a) Find the cost per mile at speed v. 

(b) Find the cheapest driving speed. 
27 You should shoot a basketball at the angle 6 requiring 
minimum speed. Avoid line drives and rainbows. Shooting 
from (0,0) with the basket at (a,b), minimize /(@)= 
I (a sin 6 cos 8 — b cos? 4). 

(a) If b=0 you are level with the basket. Show that 

@ = 45° is best (Jabbar sky hook}. 

(b) Reduce df/d# = 0 to tan 28 = — a/b. Solve when a= bd. 

(c) Estimate the best angle for a free throw. 
The same angie allows the largest margin of error (Sports 
Science by Peter Brancazio). Section 12,2 gives the fiight path. 


28 On the longest and shortest days, in June and December, 
why does the length of day change the least? 


29 Find the shortest Y connecting P, Q, and B in the figure. 
Originally B_was a birdfeeder. The length of ¥ is L(x)= 
(b—x) +2 fa? + x*. 

(a} Choose x to minimize L (not allowing x > 6). 

(b) Show that the center of the ¥ has 120° angles. 

(c) The best ¥ becomes a V when a/b = : 


income 
R=3x-7? 


30 If the distance function is f{t)=(1 + 32)/(1 + 3¢?), when 
does the forward motion end? How far have you traveled? 
Extra credit: Graph f(z) and df/dt. 
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In 31-34, we make and sell x pizzas. The income is R(x) = 
ax + 5x? and the cost is C(x) = ¢ + dx + ex?. 


3 The profit is I(x) = . The average profit per 
pizza is = . The marginal profit per additional pizza 
is ¢diMidx = . We should maximize the 
(profit} (average profit) (marginal profit). 


32 We receive R(x) = ax + bx? when the price per pizza is 
Axy= . In reverse: When the price is p we sell x = 
pizzas (a function of p), We expect 6 <0 because 


33 Find x to maximize the profit IM(x). At that x the marginal 
profit is dIt/dx = 


34 Figure B shows R(x)=3x—x? and C,(x}=1+4x? and 
C2(x)=24+x7, With cost C,, which sales x makes a profit? 
Which x makes the most profit? With higher fixed cost in C>, 
the best plan is 


The cookie box and popcorn box were created by Kay Dundas 
from a 12” = 12” square. A box with no top is a calculus classic. 


6-% 


35 Choose x to find the maximum volume of the cookie box. 
36 Choose x to maximize the volume of the popcorn box. 


37 A high-class chocolate box adds a strip of width x down 
across the front of the cookie box. Find the new volume V(x) 
and the x that maximizes it. Extra credit: Show that V,,,, is 
reduced by more than 20%. 


38 For a box with no top, cut four squares of side x from the 
corners of the 12” square. Fold up the sides so the height is 
x. Maximize the volume. 


Geometry provides many problems, more applied than they 
seem. 


39 A wire four feet long is cut in two pieces. One piece forms 
a circle of radius r, the other forms a square of side x. Choose 
r to minimize the sum of their areas. Then choose r to 
maximize, 


3 Appiications of the Derivative 


40 A fixed wall makes one side of a rectangle. We have 200 
feet of fence for the other three sides. Maximize the area A in 
4 steps: 

1 Draw a picture of the situation. 

2 Select one unknown quantity as x (but not 4?). 

3 Find all other quantities in terms of x. 

4 Solve dA/dx =0 and check endpoints. 


41 With no fixed wall, the sides of the rectangle satisfy 
2x + 2y = 200. Maximize the area, Compare with the area of 
a circle using the same fencing. 


42 Add 200 meters of fence to an existing straight 100-meter 
fence, to make a rectangle of maximum area (invented by 
Professor Klee). 


43 How large a rectangle fits into the triangle with sides 
x=0, y=0, and x/4 + »/6=1? Find the point on this third 
side that maximizes the area xy. 


44 The largest rectangle in Problem 43 may not sit straight 
up. Put one side along x/4 + y/6=1 and maximize the area. 


45 The distance around the rectangle in Problem 43 is 
P=2x+2y, Substitute for y to find P(x). Which rectangle 
has Paya, = 12? 


46 Find the right circular cylinder of largest volume that fits 
in a sphere of radius 1. 


47 How large a cylinder fits in a cone that has base radius R 
and height H? For the cylinder, choose r and # on the sloping 
surface r/R + h/H = { to maximize the volume V = ar7h. 


48 The cylinder in Problem 47 has side area A =2zrh. 
Maximize A instead of V. 


49 Including top and bottom, the cylinder has area 
A = 2nrh + 2nr? = 2arH(l —(r/R}) + 2nr?. 
Maximize A when H > R. Maximize A when R> H. 
*50 A wall 8 feet high is | foot from a house. Find the length 


L of the shortest ladder over the wall to the house. Draw a 
triangle with height y, base [+ x, and hypotenuse L, 


51 Find the closed cylinder of volume V = ar7h = 167 that 
has the least surface area. 


$2 Draw a kite that has a triangle with sides I, }, 2x next to 
a triangle with sides 2x, 2, 2. Find the area 4 and the x that 
maximizes it. Hint: In dA/dx simplify \/1 — x? — x?/,\/1—x? 
to (1 —2x7)/,/1 — x?. 


In 53-56, x and }' are nonnegative aumbers with x + = 10. 
Maximize and minimize: 


53 xy 55 y—(1/x) 


57 Find the total distance /(x) from A to X to C. Show that 
dfidx = 0 leads to sin a = sin ¢. Light reflects at an equal angle 
to minimize travel time. 


54 x? 4 y? 56 sinx siny 
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reflection 


58 Fermat's principle says that light travels from A to B on 
the quickest path. Its velocity above the x axis is v and below 
the x axis is w. 


(a) Find the time T(x) from A to X to B. On AX, time = 
distance/velocity = ./r? + x?/v. 

(b) Find the equation for the minimizing x. 

(c) Deduce Sneli’s law (sin a)/v = (sin b)/w. 


“Closest point problems” are models for many applications. 
59 Where is the parabola y = x? closest to x =0, y= 2? 
60 Where is the line »y = 5 — 2x closest to (0, 0)? 


61 What point on y= —x? is closest to what point on 
y=5—2vx? At the nearest points, the graphs have the same 
slope. Sketch the graphs. 


62 Where is y=x* closest to (0,4)? Minimizing 
x7 +(y—4)? +y¥+(y—4? gives y <0. What went wrong? 


63 Draw the line y = mx passing near (2, 3), (1, 1), and {—E, 1). 
For a least squares fit, minimize 


(3 — 2m)? + (1 — mn)? + (1 +m). 


3.3. Second Derivatives: Bending and Acceleration 


64 A triangle has corners (—1, 1), (x, x”), and (3, 9) on the 
parabola y =x’. Find its maximum area for x between —[ 
and 3. Hint: The distance from (X,Y) to the line y= mx+6 
is ]¥ — mX — bl/./t +m’. 

65 Submannes are located at (2,0) and (1, 1). Choose the 


slope m so the line y = mx goes between the submarines but 
stays as far as possible from the nearest one. 


Problems 66—72 go back to the theory. 


66 To find where the graph of (x) has greatest ee solve 
. For y= 1/(t + x*) this point is 


67 When the difference between f(x} and g(x) is smallest, their 
slopes are . Show this point on the graphs of 
f=2+x? and g=2x —x?. 


68 Suppose y is fixed. The minimum of x? + xy — y? {a func- 
tion of x) is my) = . Find the maximum of m(y). 

Now x is fixed. The maximum of x? + xy — y? (a function 
of y) is M(x) = . Find the minimum of A(x). 


69 For each m the minimum value of f(x) — 
m, What is f(x)? 


mx occurs at x = 


70 y= x + 2x? sin{1/x) has slope 1 at x =0. But show that y 
is not increasing on an interval around x = 0, by finding points 
where dy/dx = | — 2 cos{1/x) + 4x sin(t/x) is negative. 


71 True or false, with a reason: Between two local minima of 
a smooth function f{x} there is a local maximum. 


72 Create a function yx) that has its maximum at a rough 
point and its minimum at an endpoint. 


73 Draw a circular pool with a lifeguard on one side and 
a drowner on the opposite side. The lifeguard swims with 
velocity v and runs around the rest of the pool with velocity 
w= 10v. If the swim direction is at angle @ with the direct 
line, choose @ to minimize and maximize the arrival time. 


When f’(x) is positive, f(x) is increasing. When dy/dx is negative, y{x) is decreasing. 
That is clear, but what about the second derivative? From Jooking at the curve, 
can you decide the sign of f(x) or d?y/dx?? The answer is yes and the key is in the 


bending. 


A straight line doesn’t bend. The slope of y= mx + b is m (a constant). The second 
derivative is zero. We have to go to curves, to see a changing slope. Changes in the 


derivative show up in f(x): 


f=x? has f’ = 2x and f” =2 (this parabola bends up) 


y=sin x has dy/dx = cos x and d*y/dx? = — sin x (the sine bends down) 
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The slope 2x gets larger even when the parabola is falling. The sign of f or f’ is not 
revealed by f". The second derivative tells about change in slope. 

A function with {"(x) > 0 is concave up. It bends upward as the slope increases. It 
is also called convex. A function with decreasing slope—this means f”(x) < 0-—is 
concave down. Note how cos x and | + cos x and even 1 +4x+ cos x change from 
concave down to concave up at x= 72/2. At that point f” = — cos x changes from 
negative to positive. The extra 1 + $x tilts the graph but the bending is the same. 


tangent above 
"<0 


tangent 
crosses 


Fig. 3.7 Increasing slope = concave up ({” > 0). Concave down is f” <0. Inflection point f” = 0. 


Here is another way to see the sign of f”. Watch the tangent lines, When the curve 
is concave up, the tangent stays below it. A linear approximation is too low. This 
section computes a quadratic approximation—which includes the term with f” > 0. 
When the curve bends down (f” <0), the opposite happens—the tangent lines are 
above the curve. The linear approximation is too high, and f” lowers it. 

In physical motion, ”(t) is the acceleration—in units of distance/(time)*. Accelera- 
tion is rate of change of velocity. The oscillation sin 2t has v= 2 cos 2¢ (maximum 
speed 2) and a= — 4 sin 2 (maximum acceleration 4). 

An increasing population means f’ > 0. Am increasing growth rate means {" > 0. 
Those are different. The rate can slow down while the growth continues. 


MAXIMUM VS. MINIMUM 
Remember that f'(x)=0 locates a stationary point. That may be a minimum or a 
maximum. The second derivative decidest Instead of computing f(x) at many points, 


we compute f"(x) at one point—~the stationary point. It is a minimum if f"(x)> 0. 


3D When f‘(x) = 0 and f"{(x) > 0, there is a focal minimum at x. 


When f(x) = 0 and f"{x) <0, there is a focal maximum at x. 


To the left of a minimum, the curve is falling. After the minimum, the curve rises. The 
slope has changed from negative to positive. The graph bends upward and {"(x) > 0. 

At a maximum the slope drops from positive to negative. In the exceptional case, 
when {'(x}=0 and also f"(x)= 0, anything can happen. An example is x*. which 
pauses at x = 0 and continues up (its slope is 3x? > 0). However x* pauses and goes 
down (with a very flat graph). 

We emphasize that the information from f(x) and f"(x) is only “local.” To be 
certain of an absolute minimum or maximum, we need information over the whole 
domain. 


3.3 Second Derivatives: Bending and Acceleration 


EXAMPLE 4 f(x)=x?-—x? has f'(xj)=3x?—2x and f"(x)=6x—2. 


To find the maximum and/or minimum, solve 3x* ~ 2x = 0. The stationary points 
are x= 0 and x =4. At those points we need the second derivative. It is f’(0) = — 2 
(local maximum) and {”(4) = + 2 (local minimum). 

Between the maximum and minimum is the inflection point. That is where 
J’(x) = 0. The curve changes from concave down to concave up. This example has 
f(x} = 6x — 2, so the inflection point is at x = 4. 


INFLECTION POINTS 


In mathematics it is a special event when a function passes through zero. When the 
function is f, its graph crosses the axis. When the function is f’, the tangent line is 
horizontal. When f” goes through zero, we have an inflection point. 

The direction of bending changes at an inflection point. Your eye picks that out in 
a graph. For an instant the graph is straight (straight lines have {” = 0). It is easy to 
see crossing points and stationary points and inflection points. Very few people can 
recognize where f’” = 0 or f’” = 0. I am not sure if those points have names. 

There is a genuine maximum or minimum when f(x) changes sign. Similarly, there 
is a genuine inflection point when f(x) changes sign. The graph is concave down on 
one side of an inflection point and concave up on the other side.t The tangents are 
above the curve on one side and below it on the other side. At an inflection point, 
the tangent line crosses the curve (Figure 3.7b). 

Notice that a parabola y = ax? + 6x +c has no inflection points: y” is constant. A 
cubic curve has one inflection point, because f” is linear. A fourth-degree curve might 
or might not have inflection points—the quadratic f(x) might or might not cross 
the axis. 


EXAMPLE 2 x‘ — 2x? is W-shaped, 4x° — 4x has two bumps, 12x? — 4 is U-shaped. 
The table shows the signs at the important values of x: 


x —f/2 -1 -/3 0 Ufa 1 2 


fx) oO | - 0,0 = 
f' 0 + 0 - 0 
F%(x) 0 - 0 


Between zeros of f(x) come zeros of f’{x) (stationary points). Between zeros of f'(x) 
come zeros of f(x) {inflection points). In this example (x) has a double zero at the 
origin, so a single zero of f’ is caught there. It is a local maximum, since f"(0) < 0. 

Inflection points are important—not just for mathematics. We know the world 
population will keep rising. We don’t know if the rate of growth will slow down. 
Remember: The rate of growth stops growing at the inflection point. Here is the 1990 
report of the UN Population Fund. 


The next ten years will decide whether the world population trebles or merely 
doubles before it finally stops growing. This may decide the future of the earth as 
a habitation for humans. The population, now 5.3 billion, is increasing by a quarter 
ofa million every day. Between 90 and 100 million people will be added every year 


tThat rules out f(x}=x*, which has {” = 12x? > 0 on both sides of zero. Its tangent line is 
the x axis. The line stays below the graph—so no inflection point. 
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during the 1990s; a billion people —a whole China—over the decade. The fastest 
growth will come in the poorest countries. 

A few years ago it seemed as if the rate of population growth was slowingt 
cverywhere except in Africa and parts of South Asia. The world’s population 
seemed set to stabilize around 10.2 billion towards the end of the next century. 

Today, the situation looks less promising. The world has overshot the marker 
points of the 1984 ‘most likely” medium projection. It is now on course for an 
eventual total that will be closer to 1! billion than to 10 billion, 

If fertility reduetions continuc to be slower than projected, the mark could be 
missed again. In that case the world could be headed towards a total of up to 14 
billion people. 


Starting with a census, the UN follows each age group in each country. They 
estimate the death rate and fertility rate—the medium estimates are published. This 
report is saying that we are not on track with the estimate. 

Section 6.5 will come back to population, with an equation that predicts 10 billion. 
It assumes we are now at the inflection point. But China’s second census just started 
on July 1, 1990. When it’s finished we wiil know if the inflection point is still ahead. 

You now understand the meaning of f(x). Its sign gives the direction of bending — 
the change in the slope. The rest of this section computes how much the curve bends— 
using the size of f” and not just its sign. We find quadratic approximations based on 
f(x). In some courses they are optional—the main points are highlighted. 


CENTERED DIFFERENCES AND SECOND DIFFERENCES 


Caiculus begins with average velocities, computed on either side of x: 


JANE OES) Owe Ns) and fe) = fe= —Alx Beal are close to f(x). (t) 
Ax Ax 


We never mentioned it, but a better approximation to f(x) comes from averaging 
those two averages. This produces a centered difference, which is based on x + Ax 
and x — Ax. It divides by 2Ax: 

me [‘ TANI WEIS = adit AX) fi BS) 


Ax Ax 


PW 5 JAX =e) 


We claim this is better. The test is to try it on powers of x. 

For f(x) = x these ratios all give f° = 1 (exactly). For f(x) = x7, only the centered 
difference correctly gives f' = 2x. The one-sided ratio gave 2x + Ax (in Chapter [| it 
was 2¢ +h). It is only “first-order accurate.” But centering leaves no error. We are 
averaging 2x~ Ax with 2x —Ax. Thus the centered difference is “second-order 
accurate.” 


T ask now: What ratio conrerges to the second derivative? One answer is to take 
differences of the first derivative. Certainly Af’/Ax approaches f”. But we want a 
ratio involving f itself. A natural idea is to take differences of differences, which 
brings us to ‘second differences”: 

eA Sot) eg er a LS a 
SO ae eB ee a 
Ax (Ax)? dx? . 


+The United Nations watches the second derivative! 


3.3 Second Derivatives: Bending and Acceleration 


On the top, the difference of the difference is A(Af) = A?/. It corresponds to d?f. 
On the bottom, (Ax)? corresponds to dx?. This explains the way we place the 2’s in 
d? f/dx*. To say it differently: dx is squared, df is not squared—as in distance/(time)?. 

Note that (Ax)? becomes much smaller than Ax. If we divide Af by (Ax)?, the ratio 
blows up. It is the extra cancellation in the second difference A*f that allows the limit 
to exist. That limit is f”(x). 


Application The great majority of differential equations can’t be solved exactly. 
A typical case is f”(x) = — sin f(x) (the pendulum equation). To compute a solution, 
I would replace f"(x) by the second difference in equation (3). Approximations at 
points spaced by Ax are a very large part of scientific computing. 


To test the accuracy of these differences, here is an experiment on f(x)= 
sin x + cos x. The table shows the errors at x = 0 from formulas (1), (2), (3): 


step length Ax one-sided errors centered errors second difference errors 


1/4 1347 0104 — .0052 
1/8 0650 0026 — .0013 
1/16 0319 0007 — 0003 
1/32 0158 0002 — .0001 


The one-sided errors are cut in half when Ax is cut in half. The other columns 
decrease like (Ax)?. Each reduction divides those errors by 4. The errors from one- 
sided differences are O(Ax) and the errors from centered differences are O(Ax)’. 


The “big O” notation When the errors are of order Ax, we write E = O(Ax). This 
means that E < CAx for some constant C. We don’t compute C—in fact we don’t 
want to deal with it. The statement “one-sided errors are Oh of delta x” captures 
what is important. The main point of the other columns is E = O(Ax)?. 


LINEAR APPROXIMATION VS. QUADRATIC APPROXIMATION 


The second derivative gives a tremendous improvement over linear approximation 
fia) +f'(a)(x — a). A tangent line starts out close to the curve, but the line has no 
way to bend. After a while it overshoots or undershoots the true function (see 
Figure 3.8). That is especially clear for the model f(x) = x”, when the tangent is the 
x axis and the parabola curves upward. 

You can almost guess the term with bending. It should involve f”, and also (Ax)’. 
It might be exactly f”(x) times (Ax)? but it is not. The model function x? has f” = 2. 
There must be a factor } to cancel that 2: 


pproximation to a smooth function f(x) near x = a is 
fx) = fla) +f@(x— a) +4f"(a@(x — a). (4) 


At the basepoint this is f(a) = f(a). The derivatives also agree at x = a. Furthermore 
the second derivatives agree. On both sides of (4), the second derivative at x =a is 
f"(a). 

The quadratic approximation bends with the function. It is not the absolutely 
final word, because there is a cubic term 4f”(a)(x — a) and a fourth-degree term 
3; f”’(a)(x — a)* and so on. The whole infinite sum is a “Taylor series.’’ Equation (4) 
carries that series through the quadratic term—which for practical purposes gives a 
terrific approximation. You will see that in numerical experiments. 
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Two things to mention. First, equation (4) shows why f” > 0 brings the curve above 
the tangent line. The linear part gives the line, while the quadratic part is positive 
and bends upward. Second, equation (4) comes from (2) and (3). Where one-sided 
differences give f(x + Ax) =f(x)+/‘(x)Ax, centered differences give the quadratic: 


from (2): fix+ Ax) = fix— Ax) +2f'(x) Ax 

from (3); fix +t Ax) & 2f0)—flx— Ax) +"(Qx)(Ax?. 
Add and divide by 2. The result is f(x + Ax) ~f(x)+f(xJAx t+ 4/"(xH(Ax)?. This is 
correct through (Ax)? and misses by (Ax)°, as examples show: 


EXAMPLE 3 (x+Ax)> = (x*)+ (3x*)(Ax) + $(6x)(Ax)* + error (Ax). 


EXAMPLE 4 (1+x)' = Ltnxt+4n(n— 1)x?. 


The first derivative at x =0 is n. The second derivative is n(n — 1). The cubic term 
would be $n(n — 1)(n — 2)x?. We are just producing the binomial expansion! 


i : , 
EXAMPLE 5 qa 1+x+x?=start of a geometric series. 
—xX 


1/{1 — x) has derivative 1/(1 — x)’. Its second derivative is 2/{1 — x)°. At x =0 those 
equal 1,1,2. The factor } cancels the 2, which leaves 1,1,1. This explains 1 + x + x?. 
The next terms are x° and x*. The whole series is 11 -— x)= 1+x+x7+x7+>->, 


Numerical experiment 1/./1+xx1—4x+ $x? is tested for accuracy. Dividing x 
by 2 almost divides the error by 8. If we only keep the linear part 1 — x, the error 
is only divided by 4. Here are the errors at x = 4, 3, and 7x: 


linear approximation (error = 3x7} 0194 .0053 .0014 


=5 
quadratic approximation (erro = Ge} ~—.00401 -—.00055 —- .00007 


3.3 EXERCISES 


1 A graph that is concave upward is inaccurately said to 
“hold water,” Sketch a graph with f"(x) > 0 that would not 
hold water. 


Read-through questions 


The direction of bending is given by the sign of _9__. If the 
second derivative is__> _ in an interval, the function is con- 
cave up (or convex). The graph bends _ ¢ __. The tangent 
lines are othe graph. If f”(x) <0 then the graph is con- 


cave © ,andtheslopeis_ f 


2 Find a function that is concave down for x <0 and con- 
cave up for 0 <x <1 and concave down for x > 1. 


3 Can a function be always concave down and never cross 


Ata point where f’(x) = 0 and f(x) > 0, the function has a 

@__. Atapoint where __h __, the function has a maximum. 
A point where f’(x)=0 is an _J! _ point, provided f” 
changes sign. The tangent line __|___ the graph. 


The centered approximation to f‘(x) is [_k_]/2Ax. The 
3-point approximation to f"(x)is{__' _]/(Ax}*. The second- 
order approximation to f(x + Ax} is fod +f'(xJAx +_m_. 
Without that extra term this isjust the _% approximation. 


With that term the error is O(__° _). 


zero? Can it be always concave down and positive? Explain. 
4 Find a function with f’(2)=0 and no other inflection 
point. 
True or false, when f(x) is a 9th degree polynomial with 
f'() =0 and f'(3} = 0. Give (or draw) a reason. 
5 fix) =0 somewhere between x =I and x = 3. 


6 {"{x}=0 somewhere between x = I and x = 3. 
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7 There is no absolute maximum at x = 3. 
§ There are seven points of inflection. 
9 If f(x) bas nine zeros, it has seven inflection points. 


10 If f(x) has seven inflection points, it has nine zeros. 


In 11-16 decide which stationary points are maxima or 


lt fixy= x? -~ 6x 
13 f(x) =x* — 6x? 


15 fix) =sin x —cos x 


12 fix} = x3 — 6x? 
14 f(x) = x!! — 6x! 
16 f(x} =x +sin 2x 


Locate the inflection points and the regions where f(x} is com 
cave up or down. 


17 f(x}=x4+x7-x3 
19 fix) = (x — 2)°{x — 4)? 


21 ‘If f(x) is an even function, the centered difference 
[f{Ax) —f{-Ax)]/2Ax exactly equals f‘(0) =0. Why? 


22 If f(x) is an odd function, the second difference 
[ftAx) — 2,0) +f Ax)]/(Ax)’ exactly equals f"(0) = 0. Why? 


18 f{x)=sin x+tan x 
20 fix) = sin x + (sin x}* 


Write down the quadratic /(0} +/"(O)x + 4/"(0)x? im 23-26. 


23 f(x} =cos x +sin x 24 f(x) =tan x 

25 f(x) = (sin x)/x 26 fix)=L+x4x? 

In 26, find f(1) +f (I(x — 1) 4 4/f"C)(x — 1)? around a = 1. 
27 Find A and Bin /1—x 21+ Ax + Bx’. 

28 Find A and Bin 1/(t —x)* + 1+ Ax + Bx’. 


29 Substitute the quadratic approximation into 
[fix + Ax) —f{x)]/Ax, to estimate the error in this one-sided 
approximation to f(x). 

30 What is the quadratic approximation at x = 0 to f(— Ax)? 
31 Substitute for f(x+ Ax) and f{x—Ax) in the centered 
approximation [f(x +Ax)—f(x—Ax}}/2Ax, to get 
f'(x)+ error. Find the Ax and (Ax)* terms in this error. Test 
on f(x) =x? atx=0. 


32 Guess a third-order approximation f{Ax} = /(0}+ 
S(QAx +4") Ax} + . Test it on f{x) = x3. 


Construct a table as in the text, showing the actual errors at 
x =0 in one-sided differences, centered differences, second 
differences, and quadratic approximations. By hand take two 
values of Ax, by calculator take three, by computer take four. 


33 fix}= x? + x* 34 fix) = 1/1 —x) 
35 f(x) =x? +sin x 


36 Example 5 was 1/(1—x)+1+x+x?. What is the error 
at x = 0.1? What is the error at x = 2? 


37 Substitute x =.01 and x = —0.1 in the geometric series 
it —x)=14+x4+x?+-- to find 1/99 and t/1.1—first to 
four decimals and then to all decimals. 


38 Compute cos 1° by equation (4) with a =0. OK to check 
on a calculator. Also compute cos 1. Why so far off? 


39 Why is sin x x not only a linear approximation but also 
a quadratic approximation? x = 0 is an point. 


40 Iff(x) is an even function, find its quadratic approximation 
at x = 0, What is the equation of the tangent line? 


41 For f{x)=x+x?+x°, what is the centered difference 
LA3) —fU1)]/2, and what is the true slope f‘(2)? 


42 For f(xs)=x+x?+x°3, what is the second difference 
LA3) — 2f€2) +fl)]/12, and what is the exact f”(2)? 


43 The error in f{a}+/‘(a){x-—a) is approximately 
4f"(a)(x — a)’. This error is positive when the function is 
. Then the tangent line is the curve. 


44 Draw a piecewise linear yx) that is concave up. Define 
“concave up” without using the test d?y/dx? > 0. If derivatives 
don’t exist, a new definition is needed. 


45 What do these sentences say about for f’ or f” or f’”? 
{. The population is growing more slowly. 
2. The plane is landing smoothly. 
3. The economy is picking up speed. 
4. The tax rate is constant, 
5. A bike accelerates faster but a car goes faster. 
6. Stock prices have peaked. 
7. The rate of acecleration is slowing down. 
8. This course is going downhill. 


46 (Recommended) Draw a curve that goes up-down-up. 
Below it draw its derivative. Then draw its second derivative. 
Mark the same points on all curves—the maximum, minimum, 
and inflection points of the first curve. 


47 Repeat Problem 46 on a printout showing x)= 
x? —4x? +x +2 and dy/dx and d?y/dx? on the same graph. 
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Reading a graph is like appreciating a painting. Everything is there, but you have to 
know what to look for. One way to learn is by sketching graphs yourself, and in the 
past that was almost the only way. Now it is obsolete to spend weeks drawing 
curves—a computer or graphing calculator does it faster and better. That doesn’t 
remoye the need to appreciate a graph (or a painting), since a curve displays a 
tremendous amount of information. 

This section combines two approaches. One is to study actual machine-produced 
graphs (especially electrocardiograms). The other is to understand the mathematics 
of graphs—siope, concavity, asymptotes, shifts, and scaling. We introduce the 
centering transform and Zoom transform. These two approaches are like the rest of 
calculus, where special] derivatives and integrals are done by hand and day-to-day 
applications are by computer. Both are essential—the machine can do experiments 
that we could never do. But without the mathematics our instructions miss the point. 
To create good graphs you have to know a few of them personally. 


READING AN ELECTROCARDIOGRAM (ECG or EKG) 


REFERENCE’ —«- The graphs of an ECG show the electrical potential during a heartbeat. There are 
twelve graphs---six from leads attached to the chest, and six from leads to the arms 
= and left leg. (It doesn’t hurt, but everybody is nervous. You have to lie still, because 


contraction of other muscles will mask the reading from the heart.) The graphs record 


r clectrical impulses, as the cells depolarize and the heart contracts. 

200 — What can I explain in two pages? The graph shows the fundamental pattern of the 
~'73——- ECG. Note the P wave, the QRS complex, and the T wave. Those patterns, seen 

\si-— differently in the twelve graphs, tell whether the heart is normal or out of rhythm— 

130-—— _ or suffering an infarction (a heart attack). 


First of all the graphs show the Aeart rate. The dark vertical lines are by convention 
4 second apart. The light lines are 7; second apart. If the heart beats every $ second 
(one dark line) the rate is 5 beats per second or 300 per minute. That is extreme 
tachycardia—not compatible with life. The normal rate is between three dark lines 


s0-—— per beat (2 second, or 100 beats per minute) and five dark lines (one second between 
beats, 60 per minute). A baby has a faster rate, over 100 per minute, In this figure 
4g—-. __ the rate fs . A rate below 60 is bradycardia, not in itself dangerous. For a resting 


athlete that is normal. 

Doctors memorize the six rates 300, 150, 100, 75, 60, 50. Those correspond to 1, 2, 
4a— 3,4, 5, 6 dark lines between heartbeats. The distance is easiest to measure between 
spikes (the peaks of the R wave). Many doctors put a printed scale next to the chart. 
One textbook emphasizes that “Where the next wave falls determines the rate. No 
mathematical computation is necessary.” But you see where those numbers come 
33-—— from. 


HEART RATE (3 CYCLES FROM REFERENCE ARROW—CHART PAPER SPEED: 25 mm./sec. 
on 
ao 


3.4 Graphs 


The next thing to look for is heart rhythm. The regular rhythm is set by the 
pacemaker, which produces the P wave. A constant distance between waves is good— 
and then each beat is examined. When there is a block in the pathway, it shows as 
a delay in the graph. Sometimes the pacemaker fires imegularly. Figure 3.10 shows 
sinus arrythmia (fairly normal). The time between peaks is changing. In disease or 
emergency, there are potential pacemakers in all parts of the heart. 

I should have pointed out the main parts. We have four chambers, an atrium- 
ventricle pair on the left and right. The SA node should be the pacemaker. The 
stimulus spreads from the atria to the ventricles—from the smail chambers that 
“prime the pump” to the powerful chambers that drive blood through the body. The 
P wave comes with contraction of the atria. There is a pause of 7 second at the AV 
node. Then the big QRS wave starts contraction of the ventricles, and the T wave is 
when the ventricles relax. The cells switch back to negative charge and the heart cycle 
is complete. 


Fig. 3.9 Happy person with a heart and a normal electrocardiogram. 


The ECG shows when the pacemaker goes wrong. Other pacemakers take over— 
the AV node will pace at 60/minute. An early firing in the ventricle can give a wide 
spike in the QRS complex, followed by a long pause. The impulses travel by a slow 
path. Also the pacemaker can suddenly speed up (paroxysmal tachycardia is 
150—250/minute). But tbe most critical danger is fibrillation. 

Figure 3.10h shows a dying heart. The ECG indicates irregular contractions—no 
normal PQRST sequence at all. What kind of heart would generate such a rhythm? 
The muscles are quivering or “fibrillating” independently. The pumping action is 
nearly gone, which means emergency care. The patient needs immediate CPR— 
someone to do the pumping that the heart can’t do. Cardio-pulmonary resuscitation 
is a combination of chest pressure and air pressure (hand and mouth) to restart the 
rhythm. CPR can be done on the street. A hospital applies a defibrillator, which 
shocks the heart back to life. It depolarizes ali the heart cells, so the timing can be 
reset. Then the charge spreads normally from SA node to atria to AV node to 
ventricles. 

This discussion has not used all twelve graphs to locate the problem. That needs 
vectors. Look ahead at Section 11.1 for the heart vector, and especially at Section 11.2 
for its twelve projections. Those readings distinguish between atrium and ventricle, 
left and night, forward and back. This information is of vital importance in the event 
of a heart attack. A “heart attack” is a myocardial infarction (MI). 

An MI occurs when part of an artery to the heart is blocked (a coronary occlusion). 
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Fig. 3.40 Doubtful aivihen Serious fibrillation. Signals of a heart attack. 


An area is without blood supply—therefore without oxygen or glucose. Often the 
attack is in the thick left ventricle, which needs the most blood. The cells are first 
ischemic, then injured, and finally infarcted (dead). The classical ECG signals involve 
those three I’s: 


Ischemia: Reduced blood supply, upside-down T wave in the chest leads. 
Injury: An elevated segment between § and T means a recent attack. 
Infarction: The Q wave, normally a liny dip or absent, is as wide as a small 
square (7s second). It may occupy a third of the entire QRS complex. 


The Q wave gives the diagnosis. You can find all three I’s in Figure 3.10c. 
It is absolutely amazing how much a good graph can do. 


THE MECHANICS OF GRAPHS 


From the meaning of graphs we descend to the mechanics. A formula is now given 
for f(x). The problem is to create the graph. It would be too old-fashioned to evaluate 
f(x) by hand and draw a curve through a dozen points. A computer has a much 
better idea of a parabola than an artist (who tends to make it asymptotic to a straight 
line), There are some things a computer knows, and other things an artist knows, 
and still others that you and I know—because we understand derivatives. 

Our job is to apply calculus. We extract information from f’ and f” as well as f. 
Smail movements in the graph may go unnoticed, but the important properties come 
through. Here are the main tests: 


. The sign of f(x} (above or below axis: f= 0 at crossing point) 

. The sign of f(x) {increasing or decreasing: f’ = 0 at stationary point) 

. The sign of f"(x) (concave up or down: f” = 0 at inflection point) 

. The behavior of f(x) as x > 00 and x — co 

. The points at which f(x) — o& or f{x}-+ — 0 

. Even or odd? Periodic? Jumps in f or f’? Endpoints? Sto)? 


Ata DA wn = 


2x io ee Gee 
i—x) f= a-xp 


2 
EXAMPLES = f(x)= 5 = = Of) 


The sign of f(x) depends on 1 — x”. Thus f(x) > 0 in the inner interval where x? < 1. 
The graph bends upwards (f”(x) > 0) in that same interval. There are no inflection 
points, since f” is never zero, The stationary point where f’ vanishes is x = 0. We 
have a focal minimum at x = 0. 

The guidelines (or asymptotes) meet the graph at infinity. For large x the important 
terms are x* and — x”. Their ratio is +x?/—x? = ~ 1—which is the limit as x -+ <0 
and x > — oo. The horizontal asymptote is the line y= — 1. 

The other infinities, where { blows up, occur when 1 — x? is zero. That happens at 

= 1and x= — 1, The vertical asymptotes are the lines x = | and x = —1. The graph 
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in Figure 3.11a approaches those lines. 


if fix) + b as x + + 00 or — 00, the line y= 5 is a horizontal asymptote 
if f(x) + +00 or — a as xa, the line x = a is a vertical asymptote 
if f(x) — (mx + b) > 0 as x — + 00 or — 00, the line y= mx + b is a sloping asymptote. 


Finally comes the vital fact that this function is even: f(x) = f{— x) because squaring 
x obliterates the sign. The graph is symmetric across the p axis. 

To summarize the effect of dividing by 1 — x”: No effect near x = 0, Blowup at 1 
and —1 from zero in the denominator, The function approaches —1 as |x| — 00. 


; x? ; eee: Pee 
EXAMPLE 2 fixy= ay f(x) = 1 f= @- Dp 
This example divides by x — 1. Therefore x = 1 is a vertical asymptote, where f(x) 
becomes infinite. Vertical asymptotes come mostly from zero denominators. 

Look beyond x = 1. Both f(x} and f(x) are positive for x > 1. The slope is zero at 
x= 2. That must be a local minimum. 

What happens as x > 00? Dividing x? by x — 1, the leading term is x. The function 
becomes large. It grows linearly—we expect a sloping asymptote. To find it, do the 
division properly: 

v4 


i 
=x+1+—. 
OP ia eee @) 
The last term goes to zero. The function approaches y=x-+ 1 as the asymptote. 
This function is not odd or even. Its graph is in Figure 3.11b. With zoom out you 
see the asymptotes. Zoom in for f= 0 or f’=0 or f” = 0. 


Ag. 3.44 The graphs of x?/{t —x*)} and x?/(x — 1) and sinx +4 sin 3x. 


EXAMPLES f{x)=sinx+4sin3x has the slope /‘(x)=cos x + cos 3x. 


Above ail these functions are periodic. If x increases by 2z, nothing changes. The 
graphs from 2z to 4x are repetitions of the graphs from 0 to 2z. Thus f(x + 2x) = f{x) 
and the period is 2x. Any interval of length 2x will show a complete picture, and 
Figure 3.11¢ picks the interval from — 2 to z. 

The second outstanding property is that f is edd. The sine functions satisfy 
fi- x) = — fx). The graph is symmetric through the origin. By reflecting the right half 
through the origin, you get the left half. In contrast, the cosines in f’(x) are even. 

To find the zeros of f(x) and f'(x) and f(x), rewrite those functions as 


fC =2 sin x — $sin’x (f'(x}= —2 cos x+4 cos?x f"(x)= — 10 sin x + 12 sin*x. 
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3 Applications of the Dertvattve 


We changed sin 3x to 3 sin x — 4sin*x. For the derivatives use sin?x = 1 — cos?x. 
Now find the zeros—the crossing points, stationary points, and inflection points: 


f=0 2sin x=4$sin?x > sin x=0 or sin?x= 3 > x=0, 42 
f'=0 2cosx=4cos’x => cos x=0 or cos*x=}$ > x= +n/4, 2/2, +30/4 
: 


f’=0 Ssinx=6 sin¢x = sin x=0 or sin?’x=% > x=0, 166°, 114°, +2 


That is more than enough information to sketch the graph. The stationary points 
n/4, 1/2, 3n/4 are evenly spaced. At those points f(x) is ./8/3 (maximum), 2/3 (local 
minimum), ./8/3 (maximum). Figure 3.1 1¢ shows the graph. 


J would like to mention a beautiful continuation of this same pattern: 
f(x) = sin x + $ sin 39x +4 sin Sx+-- f(x} = cos x + cos 3x + cos 5x+- 


If we stop after ten terms, f(x) is extremely close to a step function. If we don’t stop, 
the exact step function contains infinitely many sines. It jumps from — 2/4 to +7/4 as 
x goes past zero. More precisely it is a ‘‘sgware wave,” because the graph jumps back 
down at x and repeats. The slope cos x + cos 3x + --- also has period 2x. dnfinitely 
many cosines add up to a delta function! (The slope at the jump is an infinite spike.) 
These sums of sines and cosines are Fourier series. 


GRAPHS BY COMPUTERS AND CALCULATORS 


We have come to a topic of prime importance. If you have graphing software for a 
computer, or if you have a graphing calculator, you can bring calculus to life. A graph 
presents (x) in a new way—different from the formula. Information that is buried 
in the formula is clear on the graph. But don’t throw away y(x) and dy/dx. The 
derivative is far from obsolete. 

These pages discuss how calculus and graphs go together. We work on a crucial 
problem of applied mathematics—to find where (x) reaches its minimum. There is 
no need to tell you a hundred applications. Begin with the formula. How do you find 
the point x* where y(x) is smallest? 

First, draw the graph. That shows the main features. We should see (roughly) where 
x* lies. There may be several minima, or possibly none. But what we see depends on 
a decision that is ours to make—the range of x and y in the viewing window. 

If nothing is known about (x), the range is hard to choose. We can accept a default 
range, and zoom in or out. We can use the autoscaling program in Section 1.7. 
Somehow x* can be observed on the screen. Then the problem is to compute it. 

I would like to work with a specific example. We solved it by calculus—to find 
the best point x* to enter an expressway. The speeds in Section 3.2 were 30 and 60. 
The length of the fast road will be b = 6. The range of reasonable values for the entering 
point is 0 < x <6. The distance to the road in Figure 3.12 is a= 3. We drive a distance 
./37 +x? at speed 30 and the remaining distance 6 — x at speed 60: 


1 l 
driving time (x)= 30° [P+ x7+ ri — x). (2) 


This is the function to be minimized. Its graph is extremely flat. 

It may seem unusual for the graph to be so level. On the contrary, it is common. 
A flat graph is the whole point of dy/dx = 0. 

The graph near the minimum looks like y= Cx?. It is a parabola sitting on a 
horizontal tangent. At a distance of Ax = .01, we only go up by C{Ax)? = .0001 C. 
Unless C is a large number, this Ay can hardly be seen. 


+.02 


slope dyf/dx 


x* 
zero slope 
at minimum 
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Fig. 3.42 Enter at x. The graph of driving time p(x). Zoom boxes locate x*. 


The solution is to change scale. Zoom in on x*. The tangent line stays flat, since 
dy/dx is still zero. But the bending from C is increased. Figure 3.12 shows the zoom 
box blown up into a new graph of (x). 


A calculator has one or more ways to find x*. With a TRACE mode, you direct a 
cursor along the graph. From the display of y values, read y,,,, and x* to the 
nearest pixel. A zoom gives better accuracy, because it stretches the axes—each 
pixel represents a smaller Ax and Ay. The TI-81 stretches by 4 as default. Even 
better, let the whole process be graphical—draw the actual ZOOM BOX on the 
screen. Pick two opposite corners, press ENTER, and the box becomes the new 
viewing window (Figure 3.12). 


The first zoom narrows the search for x". It lies between x = 1 and x = 3. We build 
anew ZOOM BOX and zoom in again. Now 1.5 < x* <2. Reasonable accuracy 
comes quickly. High accuracy does not come quickly. It takes time to create the box 
and execute the zoom. 


Question 4. What happens as we zoom in, if all boxes are square (equal scaling)? 
Answer The picture gets flatter and flatter. We are zooming in to the tangent line. 
Changing x to X/4 and y to ¥/4, the parabola y = x? flattens to Y= X7/4. To see 
any bending, we must use a long thin zoom box. 


I want to change to a totally different approach. Suppose we have a formula for 
dyjdx. That derivative was produced by an infinite zoom! The limit of Ay/Ax came by 
brainpower alone: 

dy _ x _ 
dx 30,/37+x? 60° 


Cail this f(x). 


This function is zero at x*. The computing problem is completely changed: Solve 
Six) = 0. It is easier to find a root of f(x) than a minimum of Wx). The graph of f(x) 
crosses the x axis. The graph of y{x) goes flat—this is harder to pinpoint. 

Take the model function y = x? for |x| <.01. The slope f= 2x changes from —.02 
to +.02. The value of x? moves only by .0001—its minimum point is hard to see. 

To repeat: Minimization is easier with dy/dx. The screen shows an order of magni- 
tude improvement, when we trace or zoom on Jf{x)=0. In calculus, we have been 
taking the derivative for granted. It is natural to get blasé about dy/dx = 0. We forget 
how intelligent it is, to work with the slope instead of the function. 


Question 2 How do you get another order of magnitude improvement? 

Answer Use the next derivative! With a formula for df/dx, which is d?y/dx?, tbe 
convergence is even faster. In two steps the error goes from .01 to .0001 to .00000001. 
Another infinite zoom went into the formula for df/dx, and Newton’s method takes 
account of it. Sections 3.6 and 3.7 study f(x) = 0. 
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3 Applications of the Derivative 


The expressway example allows perfect accuracy. We can solve dy/dx = 0 by alge- 
bra. The equation simplifies to 60x = 30./3* + x?. Dividing by 30 and squaring yields 
4x? = 3? + x?, Then 3x? = 3?. The exact solution is x* = ,/3 = 1.73205... 

A model like this is a benchmark, to test competing methods. It also displays what 
we never appreciated—the extreme flatness of the graph. The difference in driving 
time between entering at x* = ,/3 and x =2 is one second. 


THE CENTERING FRANSFORM AND ZOOM TRANSFORM 


For a photograph we do two things—point the right way and stand at the right 
distance. Then take the picture. Those steps are tbe same for a graph. First we pick 
the new center point. The graph is shifted, to move that point from {a, 5) to (0, 0). 
Then we decide how far the graph should reach. It fits in a rectangle, just like the 
photograph. Rescaling to x/c and y/d puts the desired section of the curve into the 
rectangle. 

A good photographer does more (like an artist). The subjects are placed and the 
camera is focused. For good graphs those are necessary too. But an everyday calcula- 
tor or computer or camera is built to operate without an artist—just aim and shoot. 
I want to explain how to aim at y = f(x). 

We are doing exactly what a calculator does, with one big difference. It doesn’t 
change coordinates. We do. When x = 1, y = — 2 moves to the center of the viewing 
window, the calculator still shows that point as (1, — 2). When the centering transform 
acts on y+ 2= {x — 1), those numbers disappear. This will be confusing unless x 
and y also change. The new coordinates are X =x—1 and Y=y+ 2. Then the new 
equation is Y= mX, 

The main point {for humans) is to make the algebra simpler. The computer has no 
preference for Y= mX over y— yo = m(x— x9). It accepts 2x” — 4x as easily as x?. 
But we do prefer Y= mX and y =x’, partly because their graphs go through (0, 0). 
Ever since zero was invented, mathematicians have liked that number best. 


3F A centering transform shifts left by a and down by b: 


X=x-—aand Y= y-—b change y=/(x} into ¥+ b=f(X + a). 


EXAMPLE 4 The parabola y = 2x? — 4x has its minimum when dy/dx = 4x — 4=0. 
Thus x = 1 and y= — 2. Move this bottom point to the center: y= 2x? — 4x is 


Y+2=AX—1-4X-1) or Y=2X?. 


The new parabola ¥ = 2X7” has its bottom at (0, 0). It is the same curve, shifted across 
and up. The only simpler parabola is y = x”. This final step is the job of the zoom. 


Next comes scaling. We may want more detail (zoom in to see the tangent line}. 
We may want a big picture (zoom out to check asymptotes), We might stretch one 
axis more than the other, if the picture looks like a pancake or a skyscraper. 


3G SOA zoom transform scales the X and Y axes by c and d: 
x=cX and y=dY change Y=F(X) to y=dF(x/c). 
The new x and y are boldface letters, and the graph is rescaled. Often c= d. 
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EXAMPLES Start with Y=2X?. Apply a square zoom with c=d. In the new xy 
coordinates, the equation is y/c = 2{x/c}?. The number 2 disappears if c= d= 2. With 
the right centering and the right zoom, every parabola that opens upward is y = x’. 


Question 3 What happens to the derivatives (slope and bending) after a zoom? 
Answer The slope (first derivative) is multiplied by d/c. Apply the chain rule to y= 
aF(x/c). A square zoom has d/c = 1—lines keep their slope. The second derivative is 
multiplied by d/c?, which changes the bending. A zoom out divides by small numbers 
c= d, so the big picture is more curved. 

Combining the centering and zoom transforms, as we do in practice, gives y in 
terms of x: 


y=f(x) becomes Y=f{(X+a)—b andthen y= al s(% + a) - 7} 


x=A we Loe x=4(A —3) 
y=B 


y=B-1 y =8(B-1) 


Fig. 3.14 Change of coordinates by centering and zoom. Calculators still show (x, y). 


Question 4 Find x and y ranges after two transforms. Start between —1 and 1. 
Answer The window after centering is ~-1<x-a<1 and -—1<y—b<1. The 
window after zoom is —1 <¢(x — a) <1 and —1<d{y—b)< 1. The point (1, 1) was 
originally in the corner. The point (c™! + a, d~' + 5) is now in the corner. 


The numbers a, b,c, d are chosen to produce a simpler function (like y= x7). Or 
else—this is important in applied mathematics—they are chosen to make x and y 
“dimensionless.” An example is y =4cos 8. The frequency 8 has dimension 1/time. 
The amplitude $ is a distance. With d= 20cm and c= 8 sec, the units are removed 
and y = cost. 

May I mention one transform that does change the slope? It is a rotation. The 
whole plane is turned. A photographer might use it—but normally people are sup- 
posed to be upright. You use rotation when you turn a map or straighten a picture. 
In the next section, an unrecognizable hyperbola is turned into ¥ = 1/X, 


3.4 EXERCISES 


Read-through questions around the graph looks longand__ | _. We__™__ in to that 


The position, slope, and bending of y=/{x) are decided by 
9, > and ¢ If | f{x)| > wo asx >a, the line x = 
aisa vertical ¢ . If f(x} 6 for large x, then y=bisa 
e_. If f(x)—mx— 6 for large x, then p=mx+b is a 
t__. The asymptotes of y=x7/(x?-—4} are _g__. This 
function is even because (¥—x)= __h_. The function sin kx 
has pertod _f_. 


Near a point where dy/dx =0, the graph is extremely 
}__. For the model y = Cx*, x =.1 givesy=__k . A box 


box for another digit of x*. But solving dy/dx =0 is more 
accurate, because its graph _" _ the x axis. The slope of 


dy/dx is © _. Each derivative is like an __P zoom. 


To move (a, 5} to (0, 0), shift the variables to X =__4 
and Y=_¢__. This __$ __ transform changes y =/(x) to 
Y= _1t __, The original slope at (a, b} equals the new slope 
at ___¥ __. To stretch the axes by c and d, set x=cX and 

v__. The__¥*__ transform changes Y= F(X}toy=_ x 
Slopes are multiplied by _y . Second derivatives are 
multiplied by _ 2 


420 3 Applications of the Dertvative 


1 Find the pulse rate when heartbeats are 4 second or two 
dark lines or x seconds apart. 


2 Another way to compute the heart rate uses marks for 
§-second intervals. Doctors count the cycles in an interval. 
(a} How many dark lines in 6 seconds? 
{b} With 8 beats per interval, find the rate. 
(c) Rule: Heart rate = cycles per interval times 


Which functions in 3-18 are even or odd or periodic? Find all 
asymptotes: y= 6b or x =a or y= mx +. Draw roughly by 
hand or smoothly by computer. 


3 f(x) =x —(9/x) 4 f(x)= x" (any integer n) 


x? 
5 f= 6 {@)=7— 
243 743 


9 f(x) =(sin x}{sin 2x) 10 f(x}=cos x + cos 3x + cos 5x 


x stn x 
x*—1 


11 f(x}= 


12 f(x) = 


I 1 
13 f()= i 14 f(x}= 5 — x 


xi+] 
x31 


i7 f(xj=x—sin x 


sin x +¢os x 
sin x — cos x 


18 f(x) =(tx)— /x 


15 f(xj= 16 f(xj= 


In 19-24 construct {(x) with exactly these asymptotes. 


19 x=1 and y=2 0M x=1,x=2,y=0 


21 y=xand x=4 22 y=2x+3and x=0 
23 y=x (x 0), p= — x(x > — &) 
4x=1,x=3,y=x 


25 For P(xVQ(x) to have y = 2 as asymptote, the polynomials 
P and Q must be , 


26 For P(x)/Q(x) to have a sloping asymptote, the degrees of 
P and @ must be i 


27 For P{x)/Q{x) to have the asymptote y = 0, the degrees of 
P and @ must . The graph of x*/{(1 +x?) has what 
asymptotes? 

28 Both 1f{x—1) and [/(x—i)? have x=! and y=O as 
asymptotes. The most obvious difference in the graphs is 


29 If f(x) has asymptotes x=1 and y=3 then f(x) has 
asymptotes : 


3M True (with reason) or false (with example). 
(a) Every ratio of polynomials has asymptotes 
(b) If f(x) is even so is f(x) 
(c) If f(x) is even so is f(x) 
{d) Between vertical asymptotes, f’(x} touches zero. 


31 Construct an f(x) that is “even around x = 3,” 


32 Construct g(x) to be “odd around x =z.” 


Create graphs of 33-38 on a computer or calculator. 

33 p(x) = (1 + bx, —3<x<3 

BM ox)ex"* O1<x<2 

35 p(x) = sin(x/3) + sin(x/5) 

36 p(x) =(2—x)(2+x), -3<x<3 

37 y{x} = 2x? + 3x? — 12x + 5 on [—3, 3] and [2.9, 3.1] 
38 100(sin(x + .1)— 2 sin x + sin(x —.f)}] 


In 39-40 show the asymptotes on large-scale computer graphs. 


x? +8x—15 x*—6x74 1 
sia aa) a A Te 
—2 x?-—x42 

We) y= Tees 2S eae a ar, 


41 Rescale y=sin x so X is in degrees, not radians, and Y 
changes from meters to centimeters. 


Problems 42-46 minimize the driving time y{x) in the text. 
Some questions may not fit your software. 


42 Trace along the graph of y{x) to estimate x*. Choose an 
xy range or use the default. 


43 Zoom in by c=d=4. How many zooms until you reach 
x* = 1.73205 or 1.7320508? 


44 Ask your program for the minimum of y{x) and the solu- 
tion of dy/dx =0, Same answer? 


45 What are the scaling factors c and d for the two zooms in 
Figure 3.12? They give the stretching of the x and y axes. 


46 Show that dy/dx = — 1/60 and d?y/dx? = 1/90 at x =0. 
Linear approximation gives dy/dx ~ ~— 1/60+x/90. So the 
slope is zero near x = . This is Newton’s method, 
using the next derivative. 


Change the function to p(x) = ./15 + x*/30 +. (10 — x)/60, 
47 Find x* using only the graph of (x). 
48 Find x* using also the graph of dy/dx. 


49 What are the xy and XY and xy equations for the line in 
Figure 3.14? 


3.5 Parabolas, Ellipses, and Hyperbolas 421 


50 Define f(x) =sinx+4sin3x+4sin 5x+- (mn terms). 
Graph f, and f,> from —z to x. Zoom in and describe the 
Gibbs phenomenon at x = 0. 


On the graphs of 51—56, zoom in to all maxima and minima 
(3 significant digits). Estimate inflection points. 


SL y= 2x? — 16x* + 5x3 — 37x? + 21x + 683 
§2 ya=x°—xt— 
§3 y=x(x — [I}(x-—2yx-4) 


3x+1-—2 


3.5 Parabolas, Ellipses, and Hyperbolas 


84 y=7 sin 2x +5 cos 3x 
§5 y=(x? — 2x + 1)/(x* — 3x? — 15), -3 <x <5 
5 y=x sin (1/x), O1<x<1 


57 A 10-digit computer shows y=0 and dy/dx =.01 at x* =1. 
This root should be correct to about (8 digits) (10 digits) 
(12 digits). Hint: Suppose y = .01 (x — 1 + error), What errors 
don’t show in 10 digits of y? 


88 Which is harder to compute accuralely: Maximum point 
or inflection point? First derivative or second derivative? 


Here is a list of the most important curves in mathematics, so you can tell what is 


coming. It is not easy to rank the top four: 


1. straight lines 
2. sines and cosines (oscillation) 
3. exponentials (growth and decay) 


4. parabolas, ellipses, and hyperbolas (using 1, x, y, x”, xy, y*). 


The curves that I wrote last, the Greeks would have written first. It is so natura! to 
go from linear equations to quadratic equations. Straight lines use 1, x, y. Second 
degree curves include x’, xy, y?. If we go on to x° and y*, the mathematics gets 
complicated. We now study equations of second degree, and the curves they produce. 

It is quite important to see both the equations and the curves. This section connects 
two great parts of mathematics—analysis of the equation and geometry of the curve. 
Together they produce “analytic geometry.” You already know about functions and 
graphs. Even more basic: Numbers correspond to points. We speak about “the point 


(5, 2).” Euclid might not have understood. 


Where Euclid drew a 45° line through the origin, Descartes wrote down y= x. 
Analytic geometry has become central to mathematics—we now look at one part of it. 


Fig. 3.45 The cutting plane gets steeper: circle to ellipse to parabola to hyperboia. 
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3 Applications of the Derivative 


CONIC SECTIONS 


The parabola and ellipse and hyperbola have absolutely remarkable properties. The 
Greeks discovered that all these curves come from slicing a cone by a plane. The 
curves are “conic sections.” A level cut gives a circle, and a moderate angle produces 
an ellipse. A steep cut gives the two pieces of a Ayperbola (Figure 3.15d}. At the 
borderline, when the slicing angle matches the cone angle, the plane carves out a 
parabola. \t has one branch like an ellipse, but it opens to infinity like a hyperbola. 

Throughout mathematics, parabolas are on the border between ellipses and 
hyperbolas. 

To repeat: We can slice through cones or we can look for equations. For a cone 
of light, we see an ellipse on the wall. (The wall cuts into the light cone.) For an 
equation Ax? + Bxy + Cy? + Dx + Ey + F=0, we will work to make it simpler. The 
graph will be centered and rescaled (and rotated if necessary), aiming for an equation 
like y= x?, Eccentricity and polar coordinates are left for Chapter 9. 


THE PARABOLA y= ax?+ bx+c¢ 


You knew this function long before calculus. The graph crosses the x axis when 
y =O. The quadratic formula solves y = 3x? — 4x + 1 =, and so does factoring into 
(x — 1}(3x — 1). The crossing points x = 1 and x = 4 come from algebra. 

The other important point is found by calculus. It is the minimum point, where 
dy/dx = 6x — 4=0. The x coordinate is = 4, halfway between the crossing points. 
The height is ymin = ~— 3. This is the vertex V in Figure 3.16a—at the bottom of the 
parabola. 

A parabola has no asymptotes. The slope 6x — 4 doesn’t approach a constant. 


To center the vertex Shift left by } and up by 4. So introduce the new variables 
X=x-—4#and Y=y+4. Then x=} and y= — § correspond to X = ¥Y=0—which 
is the new vertex: 

y=3x7-4x+1 becomes Y=3X?. (i) 


Check the algebra. Y= 3X? is the same as y + = 3(x — 3)’. That simplifies to the 
original equation y = 3x? — 4x + 1. The second graph shows the centered parabola 
Y= 3X?, with the vertex moved to the origin. 
To zoom in on the vertex Rescale X and Y by the zoom factor a: 

Y=3X" becomes y/a=3(x/a)’. 
The final equation has x and y in boldface. With a = 3 we find y = x?—the graph is 
magnified by 3. In two steps we have reached the model parabola opening upward. 


yo3x?-4r41 


directrix at y = - 


vertex at (2/3, - 1/3) 
fig. 3.146 Parabola with minimum at V, Rays reflect to focus. Centered in (b), rescaled in (c). 
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3.5 Parabolas, Ellipses, and Hyperbolas 


A parabola has another important point—the foeuws. Its distance from the vertex 
is called p. The special parabola y = x” has p= 1/4, and other parabolas Y= aX? 
have p= 1/4a. You magnify by a factor a to get y= x”. The beautiful property of a 
parabola is that every ray coming straight down is reflected to the focus. 

Problem 2.3.25 located the focus F—here we mention two applications. A solar 
collector and a TV dish are parabolic. They concentrate sun rays and TV signals 
onto a point—~a heat celt or a receiver collects them at the focus. The 1982 UMAP 
Journal explains how radar and sonar use the same idea. Car headlights turn the 
idea around, and send the light outward. 

Here is a classical fact about parabolas. From each point on the curve, the distance 
to the focus equals the distance to the “directrix.” The directrix is the line y= —p 
below the vertex (so the vertex is halfway between focus and directrix). With p=4, 
the distance down from any (x, y) is y+4. Match tbat with tbe distance to the focus 
at (0, })— this is the square root below. Out comes the special parabola y = x?: 


yt+4=./x?+(y-}" —\- (square both sides) ——~ y=x?. (2) 


The exercises give practice with all the steps we have taken—center the parabola to 
Y= aX’, rescale it to y =x”, locate the vertex and focus and directrix. 


Summary for other parabolas y= ax? +bx +c has its vertex where dy/dx is zero. 
Thus 2ax+b=0 and x= —5/2a. Shifting across to that point is “completing the 
square”: 


2 
ax? +hxtc equals a(x+ 4 +C. (3) 


Here C = c — (b?/4a) is the height of the vertex. The centering transform X = x + (b/2a), 
Y=y-—C produces Y= aX?. It moves the vertex to (0, 0), where it belongs. 
For the ellipse and hyperbola, our pian of attack is the same: 


1, Center the curve to remove any linear terms Dx and Ey. 
2. Locate each focus and discover the reflection property. 
3. Rotate to remove Bxy if the equation contains it. 


2 
a =4 (CIRCLES HAVE a= b) 


x? 
—+t 
ELLIPSES sits 


This equation makes the ellipse symmetric about (0, 0)—the center. Changing x to 
~xor y to —y leaves the same equation. No extra centering or rotation is needed. 
The equation also shows that x?/a* and y?/b? cannot exceed one. (They add to 
one and can’t be negative.) Therefore x? < a”, and x stays between — a and a. Similarly 
y stays between b and —b. The ellipse is inside a rectangle. 
By solving for y we get a function {or two functions!) of x: 


? AS, oe b 
pain sy gives —=+ 1-3 or yobo va’ x?. 


The graphs are the top half (+) and bottom half (—) of the ellipse. To draw the ellipse, 
plot them together. They meet when y=0, at x=a on the far right of Figure 3.17 
and at x = — a on the far left. The maximum y = 6 and minimum y = — } are at the 
top and bottom of the ellipse, where we bump into the enclosing rectangle. 

A circle is a special case of an ellipse, when a = b. The circle equation x? + y* = r? 
is the ellipse equation with a= 6 =r. This circle is centered at (0, 0); other circles are 


ole 
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centered at x =h, y=k. The circle is determined by its radius r and its center (h, k): 
Equation of circle: (x — h) + (y— ky? =r’. (4) 
In words, the distance from (x, y) on the circle to (h, k) at the center is r. The 
equation has linear terms — 2hx and ~ 2ky—they disappear when the center is (0, 0). 
EXAMPLE 4 Find the circle that has a diameter from (1, 7) to (5, 7). 
Solution The center is halfway at (3, 7). So r= 2 and (x — 3)? + (y— 7)? = 2?. 


EXAMPLE 2 Find the center and radius of the circle x? ~ 6x + y* — 14y = — 54. 


Solution Complete x? — 6x to the square (x — 3)? by adding 9. Complete y? — 14y 
to (y— 7)? by adding 49. Adding 9 and 49 to both sides of the equation leaves 
(x — 3)? +(y— 7) = 4—the same circle as in Example 1. 


Quicker Solution Match the given equation with (4). Then # = 3, k= 7, and r=2: 
x?~6x+y?—~ 14y=-—54 must agree with x?—2hx+h? + y?—2kytk?=r?, 


The change to X = x—f and Y= y—k moves the center of the circle from (A, k) 
to (0, 0). This is equally true for an ellipse: 
Kany =k Pa ee Se 

Tabane B =1 becomes art ol 

When we rescale by x = X/a and y = Y/b, we get the unit circle x? + py? = 1. 

The unit circle has area 2. The ellipse kas area nab (proved later in the book). The 
distance around the circle is 2x. The distance around an ellipse does not rescale—it 
has no simple formula. 


The ellipse 


(sy, y) 
: (0, 2) 


(-a, 0) (3, 0) 


(0, -») 


Aig. 3.47 Uncentered circle. Centered ellipse x?/3? + y?/2* = 1. The distance from center to 
far right is also a=3. All rays from F, reflect to F,. 


Now we leave circles and concentrate on ellipses. They have twe foci (pronounced 
fo-sigh). For a parabola, the second focus is at infinity. For a circle, both foci are at 
the center. The foci of an ellipse are on its longer axis (its major axis), one focus on 
each side of the center: 


F,isatx=c= /a*—b* and F,isatx=~—c. 


The right triangle in Figure 3.17 has sides a, 6, c. From the top of the ellipse, the 
distance to each focus is a. From the endpoint at x = a, the distances to the foci are 
a+cand a-—c. Adding (a+ c)+(a@—c) gives 2a. As you go around the ellipse, the 
distance to F, plus the distance to F, is constant (always 2a). 


3.5 Parabolas, Ellipses, and Hyperbolas 


3H _ At all points on the ellipse, the sum of distances from the foci is 2a. This 
is another equation for the ellipse: 


from F, and F, to (x,y): ./(x—c)? +y? + 


To draw an ellipse, tie a string of length 2a to the foci. Keep the string taut and your 
moving pencil will create the ellipse. This description uses a and c—the other form 
uses a and b (remember b? + c? = a”). Problem 24 asks you to simplify equation (5) 
until you reach x?/a? + y?/b? = 1. 

The “whispering gallery” of the United States Senate is an ellipse. If you stand at 
one focus and speak quietly, you can be heard at the other focus (and nowhere else). 
Your voice is reflected off the walls to the other focus—following the path of the 
string. For a parabola the rays come in to the focus from infinity—where the second 
focus is. 

A hospital uses this reflection property to split up kidney stones. The patient sits 
inside an ellipse with the kidney stone at one focus. At the other focus a lithotripter 
sends out hundreds of small shocks. You get a spinal anesthetic (I mean the patient) 
and the stones break into tiny pieces. 

The most important focus is the Sun. The ellipse is the orbit of the Earth. See 
Section 12.4 for a terrible printing mistake by the Royal Mint, on England’s last 
pound note. They put the Sun at the center. 


(x+c)?+y?=2a. (5) 


Question 4. Why do the whispers (and shock waves) arrive together at the second 
focus? 
Answer Whichever way they go, the distance is 2a. Exception: straight path is 2c. 


Question 2 Locate the ellipse with equation 4x? + 9y? = 36. 
Answer Divide by 36 to change the constant to 1. Now identify a and b: 


2 2 


5 t p= isoa=,/9 and b= /4. Foci at + /9-4=+4//5. 


Question 3 Shift the center of that ellipse across and down to x= 1, y= — 5. 
Answer Change x to x-—1. Change y to y+5. The equation becomes 
(x — 1)?/9 + (y + 5)?/4= 1. In practice we start with this uncentered ellipse and go the 
other way to center it. 
y’ x? 
HYPERBOLAS > 4 
Notice the minus sign for a hyperbola. That makes all the difference. Unlike an ellipse, 
x and y can both be large. The curve goes out to infinity. It is still symmetric, since 
x can change to —x and y to —y. 
The center is at (0, 0). Solving for y again yields two functions (+ and —): 


- 2 
a gives a a or pate Jerre. (6) 


The hyperbola has two branches that never meet. The upper branch, with a plus sign, 

has y >a. The vertex V, is at x =0, y= a—the lowest point on the branch. Much 

further out, when x is large, the hyperbola climbs up beside its sloping asymptotes: 
2 


2 
t%- aa Y a 
if pe 1000 then re. 1001. So a8 close to Bo 5: 
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radio signals 


reach curve 
waves 
reflect 
to Fy 
a 
eo = = ~ 
¢ Fy ~*~ 


Fig. 3.46 The hyperbola ty? —yx* =1 has a=2, b=3,c=./4+9, The distances to F, and 
F, differ by 2a = 4. 


The asymptotes are the lines p/a = x/b and y/a = — x/b. Their slopes are a/b and — a/b. 
You can’t miss them in Figure 3.18. 

For a hyperbola, the foci are inside the two branches. Their distance from the 
center is still called ¢, But now c = ./a? + b?, which is larger than a and b. The vertex 
is a distance c— a from one focus and c+ a from the other. The difference (not the 
sum) is (c + a} — {¢ — a) = 2a. 

All points on the hyperbola have this property: The difference between distances to 
the foci is constantly 2a. A ray coming in to one focus is reflected toward the other. 
The reflection is on the outside of the hyperbola, and the inside of the ellipse. 

Here is an application to navigation. Radio signals leave two fixed transmitters at 
the same time. A ship receives the signals a millisecond apart. Where is the ship? 
Answer: It is on a hyperbola with foci at the transmitters. Radio signals travel 
186 miles in a millisecond, so 186 = 2a, This determines the curve. In Long Range 
Navigation (LORAN) a third transmitter gives another hyperbola. Then the ship 
is located exactly. 


Question 4 How do hyperbolas differ from parabolas, far from the center? 
Answer Hyperbolas have asymptotes. Parabolas don’t. 


The hyperbola has a natural rescaling. The appearance of x/b is a signal to change 
to X. Similarly y/a becomes Y. Then Y= 1 at the vertex, and we have a standard 
hyperbola: 


y?/a*—x7/b2=1 becomes Y¥?—- X71. 


A 90° turn gives ¥? — Y? = 1— the hyperbola opens to the sides. A 45° turn produces 
2XY=1. We show below how to recognize x*+xy+y?=1 as an ellipse and 
x? + 3xy + y? = 1 as a hyperbola. (They are not circles because of the xy term.) When 
the xy coefficient increases past 2, x? + py? no longer indicates an ellipse. 


Question § Locate the hyperbola with equation 9y? — 4x? = 36. 
Answer Divide by 36. Then y?/4 — x?/9 = 1. Recognize a= ,/4 and b= ./9. 


Question 6 Locate the uncentered hyperbola 9y? ~ 18y— 4x? — 4x = 28. 

Answer Complete 9y7-— 18y to 9(y— 1)? by adding 9. Complete 4x7+ 4x to 
4(x + $)? by adding 4(4)7 = 1. The equation is rewritten as 9(p— 1)? — 4(x +4)? = 
28 + 9-1. This is the hyperbola in Question 5 — except its center is (— 4, 1). 


3.5 Parabolas, Ellipses, and Hyperbolas 
To summarize: Find the center by completing squares. Then read off a and b. 
THE GENERAL EQUATION Ax? + Bxy + Cy? + Dx + Ey+ F=0 


This equation is of second degree, containing any and ail of 1, x, y,x*, xy, y?. 
A plane is cutting through a cone. fs the curve a parabola or ellipse or hyperbola? 
Start with the most important case Ax? + Bxy + Cy? = 1. 


To recognize the curve, we remove Bxy by rotating the plane. This also changes A 
and C—but the combination B* — 4AC is not changed (proof omitted). An example 
is 2xy=1, with B?=4, It rotates to y?-— x?=1, with ~4A4C =4. That positive 
number 4 signals a hyperbola—since A = — | and C= 1 have opposite signs. 

Another example is x* + y? = 1. It is a circle (a special ellipse). However we rotate, 
the equation stays the samc. The combination B* — 4AC = 0 — 4+1-1 is negative. as 
predicted for ellipses. 

To rotate by an angle a, change x and y to new variables x' and p’: 


x=x'cosx—y' sin a x= xcosatysina 
ne ; and . . (7) 
ye=x'sin a+ y' cosa yun ysin a+xcosa. 


Substituting for x and y changes Ax? + Bxy+ Cy? =1to A’x? + Bix y+Cy*=1,. 
The formulas for A’, B’, C’ are painful so I go to the key point: 


B’ is zero if the rotation angie « has tan 2a = B/(A — C). 
With B’ =0, the curve is easily recognized from A’x’? + C'y’* = 1. It is a hyperbola 
if A’ and C’ have opposite signs. Then B’* — 4A‘C’ is positive. The original B? — 4AC 
was also positive, because this special combination stays constant during rotation. 
After the xy term is gone, we deal with x and »—by centering. To find the center, 
complete squares as in Questions 3 and 6. For total perfection, rescale to one of the 
model equations y= x? or x? +y?= 1 or y?—x7=1, 


The remaining question is about F = 0, What is the graph of Ax? + Bxy+ Cy? =0? 
The ellipse-hyperbola-parabola have disappeared. But if the Greeks were right, the 
cone is stili cut by a plane, The degenerate case F = 0 occurs when the plane cuts 
right through the sharp point of the cone. 

A level cut hits only that one point (0, 0). The equation shrinks to x? + j? = 0,a 
circle with radius zero. A steep cut gives two lines. The hyperbola becomes }” — x? = 0, 
leaving only its asymptotes y= + x. A cut at the exact angle of the cone gives only 
one line, as in x? = 0. A single point. two lines, and one line are very extreme cases of 
an ellipse, hyperbola, and parabola. 


All these “conic sections” come from planes and cones. The beauty of the geometry. 
which Archimedes saw, is matched by the importance of the equations. Galileo dis- 
covered that projectiles go along parabolas (Chapter 12). Kepler discovered that the 
Earth travels on an ellipse (also Chapter 12). Finally Einstein discovered that light 
travels on hyperbolas. That is in four dimensions, and not in Chapter 12. 
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equation vertices foci 
b 1 

P =ax? + bxt+ -—,¢e-— — infini 
y=ax*tbxt+c ( aa?’ 4a 78 above vertex, also infinity 
x 

E git pp bars (a, 0) and (—a, 0) (c, 0) and (~c, 0): c= ./a? — b* 
yx 

H og (0,a) and (0, -a) (0, c) and (0, —c): c= ./a* +b’ 


3.5 EXERCISES 


Read-through questions 
The graph of y=x74+2x+5 is a_9__. Its lowest point 


(the vertex) is (x, y)=(__b __), Centering by X = x+1 and 
Y=_¢ moves the vertex ta (0, 0). The equation becomes 
Y=_4__. The focus of this centered parabola is__© _. All 


rays coming straight down are __f to the focus. 


The graph of x? + 4y?=16isan_9_. Dividing by fh 
leaves x7/a? + y*/b?=1 witha=_ | andb=_) _. The 
graph lies in the rectangle whose sides are__k _. The area is 
nab=__! __. The foci are atx =+c=_™_. The sum of 
distances from the foci to a point on this ellipse is always 

n__. If we rescale to X = x/4 and Y=y/2 the equation 


becomes © and the graph becomesa _ P 


The graph of y>—x?=9isa_4_. Dividing by 9 leaves 
yja?—x?/b>=1 with a=_+t and b=_ 8. On the 
upper branch y>__t _. The asymptotes are the lines __¥ 
The foci areat y= +c=__¥__. The __® __ of distances from 
the foci to a point on this hyperbola is __* 


All these curves are conic sections—-the intersection of a 
y__anda_z_. Asteep cutting angle yieldsa A. At 
the borderline angle we get a ___8 _. The general equation is 
Ax?+_C¢ +f =0. If D=£ =O the center of the graph is 
at_O . The equation Ax? + Bxy + Cy? = 1 gives an ellipse 


when __— __. The graph of 4x? + Sxy+6y?=lisa_F 


1 The vertex of y=ax?+bx+¢ is at x =~—b/2a. What ts 
special about this x? Show that it gives y = c — (b?/4a). 


2 The parabola y = 3x? — 12x has %min = . At this 
minimum, 3x? is as large as 12x, Introducing 
X=x—2and Y=y-+12 centers the equation to : 


Draw the curves 3-14 by hand or calculator or computer. 
Locate the vertices and foci. 


3 y=x?-2x-3 4 y=(x—1)? 
5 4y=-x? 6 4x=y? 
7 (x-—1P +(y-1P?=1 8 x? 497 =9 


9 9x74 y?=9 10 x?7/4—(p—1P =1 


Wy? —4x? =1 
13 y?-—x?=0 


12 (y—1P - 4x? =1 
l4 xy=0 


Problems 15—20 are about parabotas, 21-34 are about ellipses, 
35-41 are about hyperbolas. 


15 Find the parabola y=ax*+5x+c that goes through 
(0, 0} and (1, 1) and (2, 12). 


16 y =x? — x has vertex at . To move the vertex to 

(0,0)setX¥=_ and Y=____sd Then Y= X?. 

17 (a) In equation (2) change } 10 p. Square and simpiify. 
(b) Locate the focus and directrix of ¥=3X*. Which 
points are a distance { from the directrix and focus? 


18 The parabola y =9 x? opens with vertex at 
. Centering by Y= y—9 yields ¥=—x?. 

19 Find equations for all parabolas which 

(a} open to the right with vertex at (0, 0} 

(b) open upwards with focus at (0, 0) 

(c} open downwards and go through (0, 0) and (1, 0). 
20 A projectile is at x =¢, y=¢—2? at time ¢. Find dx/dt and 
dy/dt at the start, the maximum height, and an xy equation 
for the path. 


21 Find the equation of the ellipse with extreme points at 
(+2, 0) and (0, +1}. Then shift the center to (t, 1) and find the 
new equation. 
22 On the ellipse x?/a?+y?/b?=1, solve for y when 
x=c=_/a? —b7, This height above the focus will be valuable 
in proving Kepier’s third law. 
23 Find equations for the ellipses with these properties: 
(a) through (5, 0} with foci at (+4, 0) 
(b) with sum of distances to (1, 1) and (5, 1) equal to 12 
(c) with both foci at (0,0) and sum of distances = 
2a = 10. 
24 Move a square root to the right side of equation (5) and 


square both sides. Then isolate the remaining square root and 
square again. Simplify to reach the equation of an ellipse. 


3.5 Parabolas, Ellipses, and Hyperbolas 429 


25 Decide between circle-ellipse-parabola-hyperbola, based 
on the XY equation with X =x—1] and Y=y+3, 

{a) x7 -2x+ 37+ 6¥=6 

(b) x? —2x — 3? -—6y =6 

(c) x? — 2x +2y? + 12p=6 

(d) x? -—2x-—y =O. 
26 A tilted cylinder has equation (x —2y—2z)?+ 


(y — 2x — 22)? = 1. Show that the water surface at z =0 is an 
ellipse. What is its equation and what ts B? — 4AC? 


27 (4, 9/5) is above the focus on the ellipse x7/25 4 y7/9 = 1. 
Find dyjdx at that point and the equation of the tangent line. 


28 (a) Check that the line xxo + yyy =r? is tangent to the 
circle x? + 3? =r? at (x9, yo). 
(b) For the ellipse x*/a? + y7/b? = 1 show that the tangent 
equation is XX_/a? + }¥9/b? = 1. (Check the slope.) 


29 The slope of the normal line in Figure A is s = — 1;(slope 
of tangent} = . The slope of the line from F, is 
S= . By the reflection property, 


5 


I ] 1 
S=cot 20= 5 (cot d- tan 0}= 5(s- :). 


Test your numbers s and § against this equation. 


30 Figure B proves the reflecting property of an ellipse. 
R is the mirror image of F, in the tangent line; Q is any other 
point on the line. Deduce steps 2, 3, 4 from 1, 2,3: 


. PF, +PF,<QF,+OF, tleft side = 2a, Q is outside) 
| PR+ PF, <QR+ OF, 

. Pis on the straight line from F, to R 

x = ft: the reflecting property is proved. 


Bw he 


31 The ellipse (x — 3)?/4 +(y— 1)?/4 = | is really a 
with center at and radius ___ . Choose X and 
Y¥ to produce X74 ¥7= 1, 


32 Compute the area of a square that just fits inside the 
ellipse x? sa? + 7/6? = 1. 


33 Rotate the axes of x? + xy + y? = 1 by using equation (7) 
with sin x=cos x= 1/\/2, The xy’ equation should show an 
ellipse. 


34 What are a,b,c for the Earth's orbit around the sun? 


35 Find an equation for the hyperbola with 
(a) vertices (0, +1}, foci (0, +2) 
(b) vertices (0, +3), asymptotes y = + 2x 
{c) (2, 3) on the curve, asymptotes y= +x 


36 Find the slope of }7—x?=1 at (xo, ¥9). Show that 
¥¥o — XXq = | goes through this point with the right slope (it 
has to be the tangent line). 


37 If the distances from (x, y) to (8, 0) and (—8, 0) differ by 
10, what hyperbola contains (x, ¥)? 


38 If a cannon was heard by Napoleon and one second later 
by the Duke of Wellington, the cannon was somewhere on a 
with foci at 


39 y:?—4y is part of (y—2)?= and 2x?+4 12x 
is part of 2(x+3y= Therefore? — 4p — 
2x?—12x=0 gives the hyperbola (y — 2)?—2(x+3)= 

. [ts center is and it opens to the . 


40 Following Problem 39 turn y?+2y=x?4 10x into 
¥?7=X?4C with X, Y, and C equal to 


41 Draw the hyperbola x? —4y?=1 and find its foci and 
asyniptotes, 


Problems 42-46 are about second-degree curves (conics). 


42 For which A, C, F does Ax? + Cy? + F =0 have no solu- 
tion (empty graph)? 


43 Show that x? + 2xy 4 y? + 2x + 2y + | =Ois the equation 
(squared) of a single line. 


44 Given any points in the plane, a second-degree 
curve 4x? + + + F =0 goes through those pomts. 


45 (a) When the plane s=ax+hy+¢ meets the cone 
2? =x*+ y*, eliminate z by squaring the plane equation. 
Rewrite in the form Ax? + Bxy + Cy? — Dx + Ey + F =0. 
(b) Compute B?. 4AC in terms of a and b. 

{c} Show that the plane meets the cone in an ellipse if 
a? +b? <1 and a hyperbola if a? + 6? > 1 (steeper). 


46 The roots of ax? + bx + ¢ = Oalso involve the special com- 
bination b? — 4ac, This quadratic equation has two real roots 
if and no real roots if . The reots come 
together when b? =4ac, which is the borderline case like a 
parabola. 
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Re §8§=693.6 Iterations x,,,=F(x,) Ti 


Iteration means repeating the same function. Suppose the function is F(x) = cos x. 
Choose any starting value, say x) = 1. Take its cosine: x, = cos X9 = .54. Then take 
the cosine of x,. That produces x, = cos .54 = .86. The iteration is x,., =cos x,. I] 
am in radian mode on a calculator, pressing “cos” each time. The early numbers are 
not important, what is important is the output after 12 or 30 or 100 steps: 


EXAMPLE 4 X12 =.75, X43 =.73, X14 =.74, Bers X29 =.7391, X39 = 7391, 


The goal is to explain why the x’s approach x* = .739085 ..... Every starting value 
Xo leads to this same number x*. What is spectal about .7391? 


Note on iterations Do x, =cos X9 and x, =cos x, mean that x, = cos? x9? Abso- 
lutely not! Iteration creates a new and different function cos(cos x). It uses the cos 
button, not the squaring button. The third step creates F(F(F(x))). AS soon as you 
can, iterate with x,,, =4 cos x,. What limit do the x's approach? Is it 4(.7931)? 


Let me slow down to understand these questions. The central idea is expressed by 
the equation x,,,, = F(x,). Substituting xg into F gives x,. This output x, is the input 
that leads to x,. In its turn, x, is the input and out comes x, = F(x). This is iteration, 
and it produces the sequence x9, X,, X2,..-. 

The x’s may approach a limit x*, depending on the function F. Sometimes x* also 
depends on the starting value x9. Sometimes there is no limit. Look at a second 
example, which does not need a calculator. 


EXAMPLE 2 x,., = F(x,) =3x,+ 4. Starting from x9 = 0 the sequence ts 
Xy=$°044=4, x, =9°444=6, 13 =9°64457, xg =4-74+4= 75, .... 
Those numbers 0, 4, 6, 7, 75, ... seem to be approaching x* = 8. A computer would 

convince us. So will mathematics, when we see what is special about 8: 
When the x’s approach x*, the limit of x,,, =3x,+4 
is x* =4x*+4. This limiting equation yields x* = 8. 
8 is the “‘steady state’? where input equals output: 8 = F(8). It is the fixed point. 


If we start at x9 = 8, the sequence is 8, 8, 8,.... When we start at xy = 12, the 
sequence goes back toward 8: 


x, =$°12+4=10, x, =3°104+4=9, x3 =3-94+4=85, 
Equation for limit: If the iterations x, , = F(x,) converge to x*, then x* = F(x*). 


To repeat: 8 is special because it equals 3-8 + 4. The number .7391... is special because 
it equals cos .7391.... The graphs of y = x and y = F(x) interseet at x*. To explain why 
the x’s converge (or why they don't) is the job of calculus. 


EXAMPLE 3 x,-, = x2 has two fixed points: 0 = 0? and | = 17. Here F(x) = x’. 


Starting from xg = 4 the sequence 4, 7g, x4g, ... goes quickly to x* =0. The only 


approaches to x* = | are from x9 = | (of course) and from xg = — 1. Starting from 
Xo = 2 we get 4, 16, 256. ... and the sequence diverges ta + x. 


Each limit x* has a “basin of attraction.” The basin contains all] starting points xp 
that lead to x*. For Examples | and 2, every xo Ied to .7391 and 8. The basins were 


3.6 Iterations x, , = F(x,) 


the whole line (that is still to be proved). Example 3 had three basins—the interval 
-1<x,< 1, the two points x9 = + 1, and all the rest. The outer basin |x| > 1 led 
to +, I challenge you to find the limits and the basins of attraction (by calculator) 
for F(x) = x — tan x. 

In Example 3, x* = O is attracting. Points near x* move toward x*. The fixed point 
x* = 1 is repelling. Points near 1 move away. We now find the rule that decides 
whether x* is attracting or repelling. The key is the slope dF jdx at x*. 


3s Start from any xo near a fixed point x* = F(x*): 


x* is attracting if |dF/dx| is below 1 at x* 


x* is repelling if |dF/dx| is above 1 at x*. 


First I will give a calculus proof. Then comes a picture of convergence, by “cobwebs.” 
Both methods throw light on this crucial test for attraction: |dF/dx|< 1. 

First proof: Subtract x* = F(x*) from x,,,, = F(x,). The difference x,,,—x* is 
the same as F(x,}— F(x*). This is AF. The basic idea of calculus is that AF is close 
to F'Ax: 


Xa+1 — X* = P(x,) — F(x*) & F'(x* x, — x"). () 


The “error” x, —x* is multiplied by the slope dF/dx. The next error x,;; — x* is 
smaller or larger, based on |F’}< 1 or |F'| > 1 at x*. Every step multiplies approxi- 
mately by F’(x*). its size controls the speed of convergence. 


In Example 1, F(x) is cos x and F'{x) is —sin x. There is attraction to .7391 
because |sin x*|< 1. Ia Example 2, F is 3x + 4 and F’ is }. There is attraction to 8. 
In Example 3, F is x? and F’ is 2x. There is superattraction to x* = 0 (where F’ = Q). 
There is repulsion from x* = | (where F’ = 2), 


I admit one major difficuity. The approximation in equation (1) only holds near 
x*, If xp is far away, does the sequence still approach x*? When there are several 
attracting points, which x* do we reach? This section starts with good iterations, 
which solve the equation x* = F(x*) or f(x)=0. At the end we discover Newton’s 
method, The next section produces crazy but wonderful iterations, not converging 
and not blowing up. They lead to “fractals” and “Cantor sets’ and “chaos.” 

The mathematics of iterations is not finished. It may never be finished, but we are 
converging on the answers. Please choose a function and join in. 


THE GRAPH OF AN ITERATION: COBWEBS 


The iteration x,,, = F(x,) involves two graphs at the same time. One is the graph 
of y = F(x). The other is the graph of y = x (the 45” line). The iteration jumps back 
and forth between these graphs. It is a very convenient way to see the whole process. 

Example | was x,., = cos x,. Figure 3.19 shows the graph of cos x and the “‘cob- 
web.” Starting at (xg, X,) on the 45° line, the rule is based on x, = F(xq): 


From (x9, Xo) gO up or down to (xq, x;) om the curve. 
From (xo, x,) go across to (x,, x,) om the 45° line. 


These steps are repeated forever. From x, go up to the curve at F(x,}. That height 
is X2. Now cross to the 45° line at (x;,x,). The iterations are aiming for (x*, x*}= 
(.7391, .7391). This is the crossing point of the two graphs y = F(x) and y= x. 


131 


132 


3 Applications of the Derivative 


F(x) = cos x 


you 


54 .739 86 1 


x * 
%; x 4X Ay *o cad | 42 73 4 


Aig. 3.49 Cobwebs go from (x9, x9) to (x9, X;) to (x,, x, line to curve to line. 


Example 2 was x,4, = 4x, + 4. Both graphs are straight lines. The cobweb is one- 
sided, from (0, 0} to (0, 4) to (4, 4) to (4, 6} to (6, 6). Notice how y changes (vertical 
line} and then x changes (horizontal line). The slope of F(x) is 4, so the distance to 8 
is multiplied by 4 at every step. 

Example 3 was x,,, = x7. The graph of y= x? crosses the 45° line at two fixed 
points: 0? = 0 and 1? = 1. Figure 3.20a starts the iteration close to 1, but it quickly 
goes away. This fixed point is repelling because F’(1}= 2. Distance from x* = f is 
doubled (at the start). One path moves down to x* =0—which is superattractive 
because F’= 0. The path from xg > 1 diverges to infinity. 


EXAMPLE 4 F(x) has two attracting points x* (a repelling x* is always between). 


Figure 3.20b shows two crossings with slope zero, The iterations and cobwebs con- 
verge quickly, [n between, the graph of F(x) must cross the 45° line from below. That 
requires a slope greater than one. Cobwebs diverge from this unstable point, which 
separates the basins of attraction. The fixed point x= 2 is in a basin by itself! 


Note 1 To draw cobwebs on a calculator, graph y= F(x) on top of y=x. Ona 
Casio, onc way is to plot (x9, xo) and give the command LINE: PLOT X,Y 
followed by EXE. Now move the cursor vertically to y = F(x) and press EXE. Then 
move horizontally to y= x and press EXE. Continue. Each step draws a line. 


Ag. 3.20 Converging and diverging cobwebs: F(x)=x? and F(x)=x—sin x. 


3.6 Herations x,. = F(x,) 


For the TI-81 (and also the Casio) a short program produces a cobweb. Store F(x) 
in the Y= function slot Y 1. Set the range (square window or autoscaling). Run the 
program and answer the prompt with xo: 


PrgmC:COBWEB <:Disp “INITIAL XO" :zInput xX :ALL-Off 
7¥q-On :"X"5Y4 :Lbl 1 :X58 !:Y¥43T :sLine(S,S,5,T) 
sLine(S,1T,T,T) :T>7X :Pause :Goto 1 


Note 2 The x’s approach x* from one side when 0 < dF/dx < 1. 


Note 3 A basin of attraction can include faraway x's (basins can come in infinitely 
many pieces). This makes the problem interesting. If no fixed points are attracting, 
see Section 3.7 for “cycles” and “chaos.” 


THE ITERATION x,,., =X, — cf(x,) 


At this point we offer the reader a choice. One possibility is to jump ahead to the 
next section on “Newton’s Method.” That method is an iteration to solve f(x) =0. 
The function F(x) combines x, and f(x,) and {’(x,) into an optimal formula for x,.,. 
We will see how quickly Newton’s method works (when it works). It is the outstanding 
algorithm to solve equations, and it is totally built on tangent approximations. 

The other possibility is to understand (through calculus) a whole family of itera- 
tions. This family depends on a number c, which is at our disposal. The best choice 
of c produces Newton’s method. I emphasize that iteration is by no means a new 
and peculiar idea. It is a fundamental technique in scientific computing. 

We start by recognizing that there are many ways to reach f(x") =0. (I write x* 
for the solution.) A good algorithm may switch to Newton as it gets close. The 
iterations use f(x,) to decide on the next point x, : 


Xn+1 F(x,)= Xn ef (x,). (2) 


Notice how F(x) is constructed from {(x)—they are different! We move f to the right 
side and multiply by a “preconditioner” c. The choice of c (or c,, if it changes from 
step to step) is absolutely critical. The starting guess xg is also important—but its 
accuracy is not always under our control. 

Suppose the x, converge to x*. Then the limit of equation (2) is 


x¥ = x* — of (x*). (3) 


That gives f(x*) = 0. If the x,’s have a limit, it solves the right equation. It is a fixed 
point of F (we can assume c, > c #Oand f(x,) + f(x*)). There are two key questions, 
and both of them are answered by the slope F’(x*): 


1. How quickly does x, approach x* (or do the x, diverge)? 
2. What is a good choice of c (or c,)? 


EXAMPLES (f(x)=ax—b is zero at x* = b/a. The iteration x,., =x, — c{ax, — b) 
intends to find b/a without actually dividing. (Early computers could not divide; they 
used iteration.) Subtracting x* from both sides leaves an equation for the error: 


Xn¢1 ~X* =x, — x* — clax, — 5). 
Replace b by ax*. The right side is (1 — ca)(x, — x*). This “error equation”’ is 


(error), 4, = (1 — ca}(error),. (4) 
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At every step the error is multiplied by (1 — ca), which is F'. The error goes to zero if 
|F'| is fess than 1. The absolute value |1 — cal decides everything: 


X, converges to x* if and only if —1<1—ca< 1. (5) 


The perfect choice (if we knew it) is c= 1/a, which turns the multiplier 1— ca into 
zero. Then one iteration gives the exact answer: x, = X9 — (1/a)(ax, — b) = b/a. That 
is the horizontal line in Figure 3.21a, converging in one step. But look at the other 
lines. 

This example did not need calculus. Linear equations never do. The key idea is 
that close to x* the nonlinear equation f(x) = 0 is nearly linear. We apply the tangent 
approximation. You are seeing how calculus is used, in a problem that doesn’t start 
by asking for a derivative. 


THE BEST CHOICE OF c 


The immediate goal is to study the errors x,—x*. They go quickly to zero, if 
the multiplier is small. To understand x,,,=x,—cf(x,), subtract the equation 
x* = x* — of (x*): 

Xt — X¥ = Xq— X*— Cf (Xp) —f(%*)). (6) 


Now calculus enters. When you see a difference of f’s think of df/dx. Replace 
{(x,) — f(x") by A(x, — x"), where A stands for the slope df/dx at x*: 
Xq_a1 — x* @(L—cA)(x, — x*). (7) 


This is the error equation. The new error at step n+ | is approximately the old error 
multiplied by m= t — cA. This corresponds to m= 1— ca in the linear example. We 
keep returning to the basic test |m| = |F’(x*)| <1: 


There is only one difficulty: We don’t know x*. Therefore we don’t know the perfect 
c. It depends on the slope A =f’ (x*) at the unknown solution. However we can come 
close, by using the slope at x,: 


Choose c, = 1/f'(Xq). Then Xq 41 = Xn — F(X W/L Xn) = Fl%q)- 


This is Newton’s method. The multiplier m= 1 — cA is as near to zero as we can make 
it. By building df/dx into F(x), Newton speeded up the convergence of the iteration. 


F(x} F(x) Fx*) 
x—clax—b): good X-fOYfF'(%y) 9.018 


Xx ~liax —b) ; best x-fQ@yYf'(x) 0.000 
- x-fix) 1.435 


ee ope tae 
x ~Zlax —)} : fail 


Xp fix) =2x—-cos x 


Fig. 3.24 The error multiplier ts m= 1 —cf’(x*). Newton has ¢ = 1/f"(x,) and m > 0. 


3.6 erations X,..4= F(X.) 


EXAMPLE 6 Solve f(x) = 2x —cos x = 0 with different iterations (different c’s). 


The line y= 2x crosses the cosine curve somewhere near x = 4. The intersection 
point where 2x* = cos x* has no simple formula. We start from xo = 4 and iterate 
X41 = X, — C(2x, — Cos x,) with three different choices of c. 


Take c= 1 of c=1/f'{x) or update c by Newton’s rule c, = 1/f"(x,): 
Xo = .50 c=1 c= Lf" (x9) c,= 1/f'(x,) 


x= 38 45063 45062669 
x2 = 55 45019 45018365 
x,= 30 45018 45018361... 


The column with c= 1 is diverging (repelled from x"). The second column shows 
convergence (attracted to x*}. The third column (Newton’s method) approaches x* 
so quickly that .4501836 and seven more digits are exact for x3. 

How does this convergence match the prediction? Note that f’(x}=2+ sin x so 
A= 2.435. Look to see whether the actual errors x, — x*, going down each column, 
are multiplied by the predicted m below that column: 


c=1 c= 1/(2 +sin 4) ¢C, = 1/(2 + sin x,) 
Xg —x*= 0.05 4.98 - 107? 498-1072 
X,7x*= — 0.07 443-1074 443-1074 
X2—x*= 0.10 7.88 + 10° ° 3.63- 10-8 
Xy—x* = —0.15 1.41°10°7 2.78 + 107 *° 
multiplier m=-1.4 m= 018 m — 0 (Newton) 


The first column shows a multiplier below —1. The errors grow at every step. Because 
mis negative the errors change sign—the cobweb goes outward, 

The second column shows convergence with m = .018. It takes one genuine Newton 
step, then ¢ is fixed. After n steps the error is closely proportional to m" = (.018)'— 
that is “‘dimear convergence” with a good multiplier. 

The third column shows the “quadratic convergence” of Newton’s method. 
Multiplying the error by m is more attractive than ever, because m— 0. In fact m 
itself is proportional to the error, so at each step the error is squared. Problem 3.8.31 
will show that (error), , , < M(error)?. This squaring carries us from 10~? to 107* to 
10~* to “machine e” in three steps. The number of correct digits is doubled at every 
step as Newton converges. 


Note 1 The choice c=! produces x,., =x, —f(x,). This is “successive substitu- 
tion.” The equation f(x) =0 is rewritten as x = x — f(x), and each x, is substituted 
back to produce x,+,. Iteration with c= 1 does not always fail! 


Note 2 Newton’s method is successive substitution for f/f’, not f. Then m= 0, 


Note 3 Edwards and Penney happened to choose the same example 2x = cos x. But 
they cleverly wrote it as x,., =4cos x,, which has |F’| =[4 sin x| < 1. This iteration 
fits into our family with c = 4, and it succeeds. We asked earlier if its limit is $(.7391). 
No, it is x* = .450.... 
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Note 4 The choice c= 1/f'’(xg) is “modified Newton.” Alter one step of Newton’s 
method, c is fixed. The steps are quicker, because they don’t require a new f’(x,). 
But we need more steps. Millions of dollars are spent on Newton’s method, so speed 
is important. In all its forms, f(x) =0 is the central problem of computing. 


3.6 EXERCISES 


Read-tbrough questions 


X,+, =X? describes,an _a_. After one step xy=__b_, 
After two steps x. = F(x,;}=__¢ _. [fit happens that input = 
output, orx*=  d_  thenx*isa_e@ point. F = x* has 


ft fixed points, atx* =__ 9 __. Starting near a fixed point, 
the x, will converge to it if _h <1. That is because 
Xyty —X* = F(x,)— F(x*) x __] The point is cailed 
i. The x, are repelled if _« . For F = x° the fixed 
points have F’=__ | __. The cobweb goes from (xo, xg) to 
( , }to¢€ , ) and converges to (x*,x*}= om __ This 


is an intersection of y=x? and y=__n_, and it is super- 
attracting because __ 9 


f(x) =0 can be solved iteratively by x,,, =x,—cf(x,), in 
which case F'{x*}=_ p__. Subtracting x* = x* — of(x*}, the 
crror equation is x,,,—x* =m(_@__). The multiplier is 


m=__'_. The errors approach zero if ___s __. The choice 
C,=__! _ produces Newton’s method. The choice c=! is 
“successive U  " ande=__¥__ is modified Newton. Con- 


vergence to x* is | certain. 


We have three ways to study iterations x,,,—F(x,): 
(1} compute x,, x2,... from different x, (2) find the fixed 
points x* and test |dF/dx|<1 (3) draw cobwebs. 


In Problems 1-8 start from x)= .6 and x)= 2. Compute 
X;,X2,... f0 test convergence: 


2 X41 = 2x,(] — Xy) 


4 Xeni = Ul Xm 


6 Xaei =XG+X,—2 


I x.61 =X, 

3 reve 

5 xy01 = 3x,(1—x,) 
8 x,45 =x, 


9 Check dF /dx at all fixed points in Problems !-6. Are they 
attracting or repelling? 


7 Xp+y = $x, — 1 


10 From x, = — t compute the sequence x,4, = — x2. Draw 
the cobweb with its “cycle.” Two steps produce x,,, = x2, 
which has the fixed potnts 


11 Draw the cobwebs for x,,, =4x, — 1 and x,.,=1—4,x, 
starting from x)= 2. Rule: Cobwebs are two-sided when 
dF /dx is : 


12 Draw the cobweb for x,,,=x2?—1 starting from the 
periodic point x») =0. Another periodic point is : 
Start nearby at xp=.1 to see if the iterations are 
attractedto0, —1,0, —1,.... 


Solve equations 13~—16 within 1% by iteration. 
13 x =cos 4x 14 x =cos*x 
15 x =cos \/x 16 x=2x—-—1(”} 


17 For which numbers a does x, 4; = a(x, — x2) converge to 
x*=0? 


18 For which numbers a does x,,, = a{x,— x?) converge to 
x =(a— 1)/a? 


19 Iterate x, 4, =4{x, — x?) to see chaos. Why don't the x, 
approach x* = 3? 


20 One fixed point of F(x} = x? —4 is attracting, the other is 
repelling. By experiment or cobwebs, find the basin of xo’s 
that go to the attractor. 


21 (important} Find the fixed point for F(x)=ax+s. When 
is it attracting? 


22 What happens in the linear case x,,,;—=ax,+4 when 
a=1[ and when g=— 1? 


23 Starting with $1000, you spend half your money each year 
and a rich but foolish aunt gives you a new $1000. What is 
your steady state balance x*? What is x* if you start with a 
million dollars? 


24 The US national debt was once $1 trillion. Inflation 
reduces its real value by 5% each year (so multiply by 
a= .95), but overspending adds another $100 billion. What 
is the steady state debt x*? 


25 x,11~/x, has the fixed point x*=./b, Show that 
|¢dF/dx|=1 at that point--what is the sequence starting 
from X9? 


26 Show that both fixed points of x,,,=x?+x,—3 are 
repelling. What do the iterations do? 


27 A $5 calculator takes square roots but_not cube roots. 
Explain why x,.,=.,/2/x, converges to 2. 


28 Start the cobwebs for x, , =sin x, and x,,, =tanx,. In 
both cases dF /dx = i at x* = 0. (a} Do the iterations converge? 
(b) Propose a theory based on F” for cases when F’ = 1. 


Solve f(x} = 0 in 29-32 by the iteration x,., = x, —e¢f(x,), to 
find a c that succeeds and a ¢ that fails. 

29 f(x}=x?-4 3 f(xy)=x? -—4x43 

31 f@=(x-2)-1 32 f(x}=(1—x)7 1-3 


3.7 Newton's Method (and Chaos) 437 


33 Newton's method computes a new c= 1/f‘{x,) at each 
step. Write out the iteration formulas for f(x)=x>?-—2=0 
and f(x)=sin x—4$=0. 

34 Apply Problem 33 to find the first six decimals of 3/2 
and 7/6. 


35 By experiment find each x* and its basin of attraction, 
when Newton’s method is applied to f(x) =x? — 5x +4. 


36 Test Newton’s method on x? — 1=0, starting far out at 
Xo = 10°. At first the error is reduced by about m= 4. Near 
x* =f the multiplier approaches m = 0. 


37 Find the multiplier m at each fixed point of x,4,= 
X,— (x2 —x,). Predict the convergence for diferent ¢ (to 
which x*?), 


38 Make a tabie of iterations for ¢ = 1 and ¢ = 1/f'(xq) and 
c=t/f'(x,), when f(x} = x? —4 and x) =1. 


39 In the iteration for x? - 2 =0, find dF/dx at x*: 


(b} Newton’s iteration has F(x)=x—f(x)/f'(x). Show 
that F’=0 when f{x}=0. The multiplier for Newton is 


m=0. 


40 What are the solutions of f(x)=x?+2=0 and why is 
Newton’s method sure to fail? But carry out the iteration to 
see whether x, — 00. 


41 Computer project F(x) = x — tan x has fixed points where 
tan x* = 0, So x* is any multiple of x. From x9 = 2.0 and 1.8 
and 1.9, which multiple do you reach? Test points in 
1.7< x9 < 1.9 to find basins of attraction to x, 22, 3x, 4n. 

Between any two basins there are basins for every multiple 
of x. And more basins between these (a fractaf\, Mark them 
on the line from 0 to 2. Magnify the picture around x, = 1.9 
(in color?). 


42 Graph cos x and cos(cos x) and cos(cos(cos x)). Also 
(cos)*x. What are these graphs approaching? 


t Z 
ee (. te 2), graphs approaching? Why so slow? 


3.7 Newton’s Method (and Chaos) 


The equation to be solved is f(x) =0. Its solution x* is the point where the graph 
crosses the x axis. Figure 3.22 shows x* and a starting guess xy. Our goal is to come 
as close as possible to x*, based on the information {(x9) and f‘(xq). 

Section 3.6 reached Newton's formula for x, (the next guess), We now do that directly. 

What do we see at x9? The graph has height f(x») and slope f‘(x,). We know 
where we are, and which direction the curve is going. We don’t know if the curve 
bends (we don’t have f”). The best plan is to follow the tangent line, which uses all 
the information we have. 

Newton replaces f(x} by its linear approximation (= tangent approximation): 


L(x) & {%o) +S’ Xo (x — Xo). (1) 

We want the left side to be zero. The best we can do is to make the right side zero! 

The tangent line crosses the axis at x,, while the curve crosses at x*. The new guess 

x, comes from f(x9) +f'(xo)(x1 — Xo) = 0. Dividing by {’(x9) and solving for x, , this 
is step 1 of Newton's method: 

I {Xo} 

X, =X : 

7° F(X) 

At this new point, compute f(x,) and f’{x,)—the height and slope at x,. They 

give a new tangent line, which crosses at x2. Af every step we want {(x,.,)=0 and 

we settle for {(x,)+J'(x,)(%,41 — X,)= 0. After dividing by f'(x,), the formula for 

X,41 is Newton’s method. 


(2) 


43 Graph sin x and sin(sin x) and (sin)®x. What are these 
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3L_ The tangent line from x, crosses the axis at x,41: 


I (Xn) 
f' (Xn) 


Usually this iteration x, ,, = F(x,) converges quickly to x*. 


Newton's method Xu41 =X, — 


to 


a | 
{= —2 


converge to x* = .5 


' XH = of Xo = 1.21 


follow 
tangent 
line 


Fig. 3.22 Newton’s method along tangent lines from x9 to x; to x2. 


Linear approximation involves three numbers. They are Ax (across) and Af (up) 
and the slope f'(x). If we know two of those numbers, we can estimate the third. It 
is remarkable to realize that calculus has now used all three calculations—they are 
the key to this subject: 


1. Estimate the slope f'(x) from Af/Ax (Section 2.1) 
2. Estimate the change Af from f'(x) Ax (Section 3.1) 
3. Estimate the change Ax from Af/f'(x) (Newton’s method) 


The desired Af is —f(x,,). Formula (3) is exactly Ax = —f(x,)/f"(x,). 


EXAMPLE 4 (Square roots) f(x)=x?—b is zero at x*= ./b and also at — /b. 
Newton’s method is a quick way to find square roots—probably built into your 
calculator. The slope is f’(x,,) = 2x,, and formula (3) for the new guess becomes 
x gag ay elk as 
“n+1 ak | “n x 


2Xn 


(4) 


This simplifies to x,,, ; =3(x, + b/x,). Guess the square root, divide into b, and average 
the two numbers. The ancient Babylonians had this same idea, without knowing 
functions or slopes. They iterated x, ,, = F(x,): 


l b aS mee 
F(x) = 5(x+ 4 and F'(x) = 5(1 a): (5) 


The Babylonians did exactly the right thing. The slope F’ is zero at the solution, when 
x? = b. That makes Newton’s method converge at high speed. The convergence test 
is |F’(x*)| < 1. Newton achieves F’(x*) = 0—which is superconvergence. 


3.7 Newton's Method (and Chaos) 


To find J/4, start the iteration x,,, = 4(x, + 4/x,) at x» = 1. Then x, =9(1 + 4): 
M,=2.5 x2=2.05 x3=2.0006 x, =2,000000009. 


The wrong decimal is twice as far out at each step. The error is squared. Subtracting 
x* =2 from both sides of x,,, = F(x,) gives an error equation which displays that 
square: 

4 


! 1 
tor 2a 5 (mt 2) —2= 5 tea (6) 


This is (error),,, 4(error)?. It explains the speed of Newton’s method. 


Remark 1 You can’t start this iteration at x, = 0. The first step computes 4/0 and 
blows up. Figure 3.22a shows why—the tangent line at zero is horizontal. It will 
never cross the axis. 


Remark2 Starting at x, = — 1, Newton converges to — /2 instead of + <2: That 
is the other x*, Often it is difficult to predict which x* Newton’s method will choose. 
Around every solution is a “‘basin of attraction,” but other parts of the basin may be 
far away. Numerical experiments are needed, with many starts x9. Finding basins of 
attraction was one of the problems that led to fractals. 


1 
EXAMPLE 2 Solve es 0 to find x* = ! without dividing by a. 


Here f(x) = (1/x)— a. Newton uses f’(x) = — 1/x?. Surprisingly, we don’t divide: 


_ Ox) — 4 


=e =x, +x, —ax2. (7) 


Xn+1 = Xy 
Do these iterations converge? I will take a = 2 and aim for x* = 4. Subtracting 4 from 
both sides of (7) changes the iteration into the error equation: 


Xp+1 = 2x, — 2x? becomes x,4,—4= — 2(x,— 4). (8) 
At each step the error is squared. This is terrific if (and only if) you are close to 
x* = 4. Otherwise squaring a large error and multiplying by — 2 is not good: 


Xo= .7O x= 42 x= 487 x5=.4997 x, = 4999998 
Xo=h2b x =-.5 xg =-15 wy =-75  Xg= - 1275 


The algebra in Problem 18 confirms those experiments. There is fast convergence if 
0<x,< 1. There is divergence if x, is negative or x9 > 1. The tangent line goes to a 
negative x,. After that Figure 3.22 shows a long trip backwards. 


In the previous section we drew F(x). The iteration x,., = F({x,) converged to the 
45° line, where x* = F(x"). In this section we are drawing f(x). Now x” is the point 
on the axis where f(x*}= 0. 

To repeat: It is {(x*}=0 that we aim for. But it is the slope F’(x*) that decides 
whether we get there. Example 2 has F(x)= 2x — 2x?. The fixed points are x* =} 
(our solution) and x* = 0 (not attractive). The slopes F’(x*) are zero (typical Newton) 
and 2 (typical repeller). The key to Newton’s method is F’=0 at the solution: 


The slope of F(x)=x—- I(x) is LOS") Then F'(x) = 0 when f(x) = 0. 


$'(x) (f'())? 
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3 Applications of the Derivative 


The examples x? = b and 1/x = a show fast convergence or failure. In Chapter 13, 
and in reality, Newton’s method soives much harder equations. Here I am going to 
choose a third example that came from pure curiosity about what might happen. The 
results are absolutely amazing. The equation is x? = — 1. 


EXAMPLE 3 What happens to Newton’s method if you ask it to solve f(x) = x? + 1=0? 


The only solutions are the imaginary numbers x* =i and x* = — i. There is no real 
square root of —1. Newton’s method might as well give up. But it has no way to 
know that! The tangent line still crosses the axis at a new point x,+,, even if the 


curve y= x?+ 1 never crosses. Equation (5) still gives the iteration for b= — 1: 
| I 
Xn “> x2 = F(x,). (9) 
x, 


The x’s cannot approach i or —i {nothing is imaginary). So what do they do? 

The starting guess xg = | is interesting. It is followed by x, =0. Then x, divides 
by zero and blows up. I expected other sequences to go to infinity. But the experiments 
showed something different (and mystifying). When x, is large, x,,, is less than half 
as large. After x, = 10 comes x,4, =4(10— 2)= 4.95. After much indecision and a 
long wait, a number near zero eventually appears. Then the next guess divides by 
that small number and goes far out again. This reminded me of “chaos,” 

It is tempting to retreat to ordinary examples, where Newton’s method is a big 
success. By trying exercises from the book or equations of your own, you will see 
that the fast convergence to ./4 is very typical. The function can be much more 
complicated than x? — 4 (in practice it certainly is). The iteration for 2x = cos x was 
in the previous section, and the error was squared at every step. If Newton’s method 
starts close to x*, its convergence is overwhelming. That has to be the main point of 
this section: Fellow the tangent line. 

Instead of those good functions, may I stay with this strange example x? + 1 = 0? 
It is not so predictable, and maybe not so important, but somehow it is more interest- 
ing. There is no real solution x*, and Newton’s method x,,, = 4(x, — 1/x,) bounces 
around. We will now discover x,. 


A FORMULA FOR x, 


The key is an exercise from trigonometry books. Most of those problems just give 
practice with sines and cosines, but this one exactly fits $(x, — 1/x,): 


lfcos@ sin@\_ cos 26 i Sie Lt \_ 26 
a\an@ conOs ange. | aN Se Goepy 


In the left equation, the common denominator is 2 sin @ cos 8 (which is sin 24}. The 
numerator is cos?@— sin? @ (which is cos 26). Replace cosine/sine by cotangent, 
and the identity says this: 


If xg=cot@ then x, =cot 26. Then x,=cot 49. Then x,=cot 2"@. 
This is the formula. Our points are on the cotangent curve. Figure 3.23 starts from 
Xo = 2= cot 9, and every iteration doubles the angle. 


Example A The sequence xy = 1, x, = 0, x; = «© matches the cotangents of 2/4, 2/2, 
and z. This sequence blows up because x2 has a division by x, = 0. 


3.7 Newton's Method (and Chaos) 


Fig. 3.23. Newton's method for x? + 1 =0. Iteration gives x, = cot 2"0. 


Example B_ The sequence 1 1/3, —1 /J3, 1 1/3 matches the cotangents of 2/3, 27/3, 
and 4n/3. This sequence cycles forever because xp = x2 =X4=.... 


Example C Start with a large x, (a small @). Then x, is about half as large (at 26). 
Eventually one of the angles 46, 86, ... hits on a large cotangent, and the x’s go far 
out again. This is typical. Examples A and B were special, when 0/n was 3 or 3. 


What we have here is chaos. The x’s can’t converge. They are strongly repelled by 
all points. They are also extremely sensitive to the value of 6. After ten steps 6 is 
multiplied by 21° = 1024. The starting angles 60° and 61° look close, but now they 
are different by 1024°. If that were a multiple of 180°, the cotangents would still be 
close. In fact the x,,’s are 0.6 and 14. 

This chaos in mathematics is also seen in nature. The most familiar example is the 
weather, which is much more delicate than you might think. The headline ‘‘Fore- 
casting Pushed Too Far” appeared in Science (1989). The article said that the snow- 
balling of small errors destroys the forecast after six days. We can’t follow the weather 
equations for a month—the flight of a plane can change everything. This is a revolu- 
tionary idea, that a simple rule can lead to answers that are too sensitive to compute. 

We are accustomed to complicated formulas (or no formulas). We are not 
accustomed to innocent-looking formulas like cot 2” 6, which are absolutely hopeless 
after 100 steps. 


CHAOS FROM A PARABOLA 


Now I get to teil you about new mathematics. First I will change the iteration x, = 
4(x, — 1/x,} into one that is even simpler, By switching from x to z= 1/(1 + x”), each 
new z tums Out to involve only the old z and z?: 


Zn+1 = 42_ — 427. (10) 


This ts the most famous quadratic iteration in the world. There are books about it, 
and Problem 28 shows where it comes from. Our formula for x, leads to z,: 


2,= ——s =§ ——— = (sin 28)". ay 


141 


142 


3 Applications of the Dertvative 


The sine is just as unpredictable as the cotangent, when 2"@ gets large. The new thing 
is to locate this quadratic as the last member (when a= 4} of the family 


Zy41 = 4Z,—az27, O<aKc4, (12) 


Example 2 happened to be the middle member a = 2, converging to $. I would like 
to give a brief and very optional report on this iteration, for different a’s. 

‘The general principle is to start with a number z, between 0 and 1, and compute 
21,22, 23,-... It is fascinating to watch the behavior change as a increases. You can 
See it on your own computer. Here we describe some things to look for. All numbers 
stay between 0 and 1 and they may approach a limit. That happens when a is small: 


for O<a<1 the z, approach z*=0 
for 1 <a<3 the z, approach z* =(a— i)/a 


Those limit points are the solutions of z= F(z). They are the fixed points where 
z* = az* — a{z*)?. But remember the test for approaching a limit: The slope at z* 
cannot be larger than one. Here F = az — az? has F‘=a-— 2az. It is easy to check 
|F’| <1 at the limits predicted above. The hard problem—sometimes impossible— 
is to predict what happens above a= 3. Our case is a= 4. 

The 2’s cannot approach a limit when |F'(z*)| > 1. Something has to happen, and 
there are at least three possibilities: 


The z,’s can cycle or fill the whole interval (0, 1) or approach a Cantor set. 
I start with a random number Zo, take 100 steps, and write down steps 101 to 105: 


a=34 a= 35 a=38 a=40 


2101 = 842 875 336 169 
Z102 = 452 383 848 562 
Z103= 842 827 491 985 
Z104 = 452 501 950 060 
Zi0s = 842 875 182 225 


The first column is converging to a “2-cycle.” It alternates between x = .842 and 
y= 452. Those satisfy y= F(x) and x = F(y)= F(F(x)). If we look at a double step 
when a= 3.4, x and y are fixed points of the double iteration z,,, = F(F(z,)). When 
a increases past 3.45, this cycle becomes unstable. 

At that point the period doubles from 2 ta 4. With a= 3.5 you see a “4-cycle” in 
the table-—it repeats after four steps. The sequence bounces from .875 to .383 to .827 
to .501 and back to .875. This cycle must be attractive or we would not see it. But it 
also becomes unstable as a increases. Next comes an 8-cycle, which is stable in a little 
window (you could compute it} around a = 3.55. The cycles are stable for shorter and 
shorter intervals of a’s. Those stability windows are reduced by the Feigenbaum shrink- 
ing factor 4.6692.... Cycles of length 16 and 32 and 64 can be seen in physical 
experiments, but they are all unstable before a = 3.57. What happens then? 

The new and unexpected behavior is between 3.57 and 4. Down each line of 
Figure 3.24, the computer has plotted the values of 2,199; tO 22999 omitting the first 
thousand points to let a stable period (or chaos} become established. No points 
appeared in the big white wedge. I don’t know why. In the window for period 3, you 
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The period 2, 4,... is the number of z’s in a cycle. 5 


a=3.83 


Fig. 3.24 Period doubling and chaos from iterating F(z) (stolen by special permission from 
Introduction to Applied Mathematics by Gilbert Strang, Wellesley-Cambridge Press). 


see only three z's. Period 3 is followed by 6, 12, 24, .... There is period doubling at the 
end of every window {including all the windows that are too smail to see). You can 
reproduce this figure by iterating z,,, = az, — az? from any z, and plotting the results. 


CANTOR SETS AND FRACTALS 


I can’t tell what happens at a = 3.8. There may be a stable cycle of some long period. 
The z’s may come close to every point between 0 and 1. A third possibility is to 
approach a very thin limit set, which looks like the famous Cantor set: 


To construct the Cantor set, divide [0, 1] into three pieces and remove the open 
interval (4, $). Then remove (§, 3) and (%, §) from what remains. At each step 
take out the middle thirds. The points that are left form the Cantor set. 


All the endpoints 4, 3, §, 3, ... are in the set. So is 3 (Problem 42). Nevertheless the 
lengths of the removed intervals add to 1 and the Cantor set has “measure zero.” 
What is especially striking is its self-similarity: Between 0 and 4 you see the same 
Cantor set three times smaller. From 0 to 4 the Cantor set is there again, scaled down 
by 9. Every section, when blown up, copies the larger picture. 


Fractals That self-similarity is typical of a fractal. There is an infinite sequence of 
scales. A mathematical snowflake starts with a triangle and adds a bump in the 
middle of each side. At every step the bumps lengthen the sides by 4/3. The final 
boundary is self-similar, like an infinitely long coastline. 

The word “fractal” comes from fractional dimension. The snowflake boundary has 
dimension larger than 1 and smaller than 2. The Cantor set has dimension larger 
than 0 and smaller than 1. Covering an ordinary line segment with circles of radius 
r would take ¢/r circles. For fractals it takes c/r? circles—and D is the dimension. 
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3. Applications of the Derivative 


i) 


0 1/3 2/3 | 


Fig. 3.25 Cantor set (middle thirds removed). Fractal snowflake (infinite boundary). 


Our iteration z,,, = 4z,—4z? has a= 4, at the end of Figure 3.24. The sequence 
Zo, 7Z,,-.. goes everywhere and nowhere. Its behavior is chaotic, and statisticaj tests 
find no pattern. For al] practical purposes the numbers are random. 

Think what this means in an experiment (or the stock market). If simple rules 
produce chaos, there is absolutely no way to predict the results. No measurement can 
ever be sufficiently accurate. The newspapers report that Pluto’s orbit is chaotic— 
even though it obcys the law of gravity. The motion is totaily unpredictable over 
long times. I don’t know what that does for astronomy (or astrology). 

The most readable book on this subject is Gleick’s best-seller Chaos: Making a 
New Science. The most dazzling books are The Beauty of Fractals and The Science 
of Fractal Images, in which Peitgen and Richter and Saupe show photographs that 
have been in art museums around the world. The most original books are Mandel- 
brot’s Fractals and Fractal Geometry. Our cover has a fractal from Figure 13.11. 

We return to friendlier problems in which calculus is not helpless. 


NEWTON’S METHOD VS. SECANT METHOD: CALCULATOR PROGRAMS 


The hard part of Newton’s method is to find df/dx. We need it for the slope of the 
tangent line. But calculus can approximate by Af/Ax—using the values of f(x) 
already computed at x, and x,-_,. 

The secant method follows the secant line instead of the tangent line: 


S%q) AS\ _ f0)—SfOn—1) 
wi (4) - Xp— Xena 


The secant line connects the two latest points on the graph of f(x). Its equation is 
y —S (x,) = (Af /Ax}(x — x,). Set y=O to find equation (13) for the new x=x,,4,, 
where the line crosses the axis. 

Prediction: Three secant steps are about as good as two Newton steps. Both should 
give four times as many correct decimals: {error} -»(error)*. Probably the secant 
method is also chaotic for x* + 1=0, 

These Newton and secant programs are for the TI-81. Place the formula for f(x) 
in slot ¥4 and the formula for f‘{x) in slot Y2 on the Y =function edit screen. 
Answer the prompt with the initial x» = X@. The programs pause to display each 
approximation x,, the value f(x,), and the difference x, —x,-,. Press ENTER to 
continue or press ON and select item 2: Quit to break. If f(x,)=0, the programs 
display ROOT AT and the root x,. 


Secant. Xnti = Xq_7 (13) 
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PrgmN: NEWTON :Disp “ENTER FOR MORE” 
Disp "X@="" :Disp "ON 2 TO BREAK" 


zInput X Disp” " 

=XS§ :Disp "XN FXN XN-XNMT" 

:Y¥q>Y >Disp X 

:Lbl 1 :DispY 

:X-Y/Y2>X zDisp D 

:X-S$—D :Pause 

rX+S§ :If Y#@ 

:Y¥qi7Y >Goto 71 
:Disp "ROOT AT" 
:Disp X 


PrgmS:SECANT YT 


:Disp “X@=" rY¥4>Y 

sInput X Disp "ENTER FOR MORE" 
:X3$ >Disp "XN FXN XN-XNM1"” 
rY¥qoT :Disp X 

Disp "X1=" :Disp Y 

:Input X :DispD 

7Y¥417Y :Pause 

zLbL 1 :1f Y#@ 

>X-S3D :Goto 1 

2X 3§ :Disp "ROOT AT” 


2X-YD/CY-T) 3X :Disp xX 


3.7 EXERCISES 


Read-through questions 


When f(x}=0 is linearized to f(x,)+/’(x,\x —x,) =0, the 
solutionx=_ © is Newton’s x,,,.The__> to the curve 
crosses the axis at x,.,, while the _¢ __ crosses at x*. The 
errors at x, and x,4,,; are normally related by 
(error),,,;%M_@_. This is e convergence. The 
number of correct decimals __' at every step. 


For f(x) =x? —b, Newton’s iteration is x,,,; =__@ _. The 
x, converge to_o 9 ifxp>Qandto_! if x, <0. For 
f(x}=x? +1, the iteration becomes x,,, = 1. This can- 


not converge to ___* _. Instead it leads to chaos. Changing 
to z= 1/{x? + 1) yields the parabolic iteration z,.,=__! 


For a<3, z,., = az, —az? converges to a single _™ 
After a= 3 the limit is a 2-cycle, which means _® _. Later 
the limit is a Cantor set, which is a one-dimensional example 
ofa__° _. The Cantor set is self-__P 


1 To solve f(x) =x3—6=0, what iteration comes from 
Newton’s method? 


2 For f(x})=(x —1)/(x + 1} Newton’s formula is x,., = 
F(x,) = . Solve x* = F(x*) and find F’(x*). What 
limit do the x,’s approach? 


3 I believe that Newton only applied his method in public 
to one equation x? — 2x —5=0, Raphson carried the idea 
forward but got partial credit at best. After two steps from 
Xq = 2, how many decimals in x* = 2.09455148 are correct? 


4 Show that Newton’s method for f(x)=x'/3 gives the 


strange formula x,,,=-—2x,. Draw a graph to show the 
iterations. 


§ Find x, if (a) f({x9) =0; (b) f’(x9)=0. 


6 Graph f(x}= x — 3x —1 and estimate its roots x*. Run 
Newton’s method starting from 0, 1, — 4, and 1.1. Experiment 
to decide which x» converge to which root. 


7 Solve x? —6x+5=0 by Newton’s method with x9 = 2.5 
and 3. Draw a graph to show which xo lead to which root. 


8 If f(x} is increasing and concave up (f’>0 and f” > 0) 
show by a graph that Newton’s method converges. From 
which side? 


Solve 9-17 to four decimal places by Newton’s method with a 
computer or calculator. Choase any x» except x*. 
9 x? 10=0 
10 x*— 100 =0 (faster or slower than Problem 9?) 
11 x? —x=0 (which xp to which root?) 
12 x3 —x =0 (which xp to which root?) 
13 x+5 cos x =O {this has three roots) 
14 x+tan x =O {find two roots) (are there more?} 
1§ 1/(1-x}=2 
16 L4xtx2?4+x°4+x4*=2 
17 x3 +({x+1P =10° 


18 (a) Show that x,,, = 2x, — 2x? in Example 2 is the same 
as (If — 2x45 ,}=(1 —2x,)?. 
(b) Prove divergence if |1 — 2x»| > 1. Prove convergence 
if }f — 2xp[ <1 or O< x9 <1. 

19 With a@=3 in Example 2, experiment with the Newton 

iteration x, 4, = 2x, — 3x? to decide which x lead to x* = 4. 


20 Rewrite x,,, = 2x, —ax? as (1 — ax,4;)=(1—ax,)*. For 
which x9 does the sequence !—ax, approach zero (so 
X, — 1/a)? 

21 What is Newton’s method to find the kth root of 7? 
Calculate 7/7 to 7 places. 


22 Find all solutions of x} = 4x — f (5 decimals). 
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Problems 23-29 are about x? + 1 =0 and chaos. 

23 For 6=n/16 when does x,=cot 2"@ blow up? For 
6=n/7 when does cot 26 =cot 8? (The angles 2"6 and 6 
differ by a multiple of 1.) 

24 For @= 2/9 follow the sequence until x, = xp. 


25 For @=1, x, never returns to x9 =cot I. The angles 2” 
and 1 never differ by a multiple of 1 because 


26 Mf zy equals sin? 9, show that z, = 4zq — 422 equals sin? 26, 
27 If y =x? + I, each new y is 
; 1 1\ 
Yao. =Xae tia X_— +1. 
4 Xn 
Show that this equals y7/4(y, — 1). 


28 Turn Problem 27 upside down, I/y,+; = 4(y, — 1)/y2, to 
find the quadratic iteration (10) for z, = 1/y, = 1/(t + x?). 


29 If F(z} = 4z — 4z* what is F(F(z)}? How many solutions to 
z= F{F{z})? How many are not solutions to z = F(z)? 


30 Apply Newton's method to x? — .64x — .36 = Oto find the 
basin of attraction for x* = 1. Also find a pair of points for 
which y= F(z) and z= F{y). In this example Newton does 
not always find a root. 


31 Newton’s method solves x/(t-—x)=0 by x41 = 
. From which x» does it converge? The distance to 
x* =0 is exactly squared, 


Problems 33—41 are about competitors of Newton. 


32 At a double root, Newton only converges linearly. What 
is the iteration to solve x? = 0? 


33 To speed up Newton’s method, find the step Ax from 
S (Xn) + Axf(x,) + HAxPf"(x,) =O. Test on f(x)=x?-1 
from x9 =0 and explain, 

34 Halley’s method uses f, + Axfi,+4Ax(—f,/ fi) f) =0. For 
fodj=x?—1 and xy =!+e, show that x,=1+4 Ofe*)— 
which is cubic convergence. 

35 Apply the secant method to f(x)=x?—4=0, starting 
from x9 = | and x, =2.5. Find Af/Ax and the next point x2 
by hand. Newton uses f‘(x,}=5 to reach x. = 2.05. Which 
is closer to x* = 2? 


36 Draw a graph of f(x) = x? — 4 to show the secant line in 
Problem 35 and the point x, where it crosses the axis. 


3.8 The Mean Vaiue Theorem and I’Hépital’s Rule 


Bisection method If {({x) changes sign between x, and x;, find 
its sign at the midpoint x. = 4(xo + x,). Decide whether f(x) 
changes sign hetween x, and x, or x, and x,. Repeat on that 
half-length (bisected) interval. Continue. Switch to a faster 
method when the interval is small enough. 


37 f(x)=x? — 4 is negative at x = 1, positive at x = 2.5, and 
negative at the midpoint x = 1,75. So x* lies in what interval? 
Take a second step to cut the interval in half again. 


38 Write a code for the bisection method. At each step print 
out an interval that contains x*, The inputs are xp and x,; 
the code calls f(x). Stop if f(x) and f(x,} have the same 
sign. 

39 Three bisection steps reduce the interval by what 
factor? Starting from x, =0 and x, =8, take three steps for 
S(x) = x? — 10. 

40 A direct method is to zoom in where the graph crosses the 


axis. Solve 10x? —8.3x?+42.295x —.21141=0 by several 
zooms. 


41 If the zoom factor is 10, then the number of correct 
decimals for every zoom. Compare with Newton. 


42. The number $ equals $(1 + $+ a + +}. Show that it is in 
the Cantor set. It survives when middle thirds are removed. 


43 The solution to f(x) =(x — 1.9)/(x —2.0}=0 ts x* = 1.9. 
Try Newton’s method from xg = 1.5, 2.1, and 1.95. Extra 
credit: Which x,’s give convergence? 

44 Apply the secant method to solve cosx=0 from 
Xo = 308. 

45 Try Newton’s method on cosx=0 from x, =.308. If 
cot Xq is exactly x, show that x, =xo9+ (and x2 = x; +7). 
From x, =.308169071 does Newton’s method ever stop? 


46 Use the Newton and secant programs to solve 
x? ~ 10x? + 22x +6 =0 from xp = 2 and 1.39. 


47 Newton’s method for sinx=0 is x,4,; =x, —tan x,. 
Graph sin x and three iterations from xp =2 and x9 = 1.8. 
Predict the result for xg = 1.9 and test. This leads to the com- 
puter project in Problem 3.6.41, which finds fractals. 

48 Graph Y,(x}=3.4(x—x?) and Y,(x)= ¥,(¥i(x)) in the 
square window {0,0}<(x, y}<(i, 1). Then graph ¥3(x)= 
Y,(Y,(x)) and Y,,..., Yo. The cycle is from 842 to .452. 


49 Repeat Problem 48 with 3.4 changed to 2 or 3.5 or 4. 


Now comes one of the cornerstones of calculus: the Mean Value Theorem. It connects 
the local picture {slope at a point) to the global picture (average slope across an 
interval}. In other words it relates df/dx to Af/Ax. Calculus depends on this connec- 


3.8 The Mean Value Theorem and I’H6pital’s Rule 
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tion, which we saw first for velocities. If the average velocity is 75, is there a moment 
when the instantaneous velocity is 75? 

Without more information, the answer to that question is no. The velocity could 
be 100 and then 50—averaging 75 but never equal to 75. If we allow a jump in 
velocity, it can jump right over its average. At that moment the velocity does not 
exist. (The distance function in Figure 3.26a has no derivative at x = 1.) We will take 
away this cheap escape by requiring a derivative at all points inside the interval. 

In Figure 3.26b the distance increases by 150 when t¢ increases by 2. There is a 
derivative df/dt at all interior points (but an infinite slope at t= 0). The average 
velocity is 


At 2-0 2 


The conclusion of the theorem is that df/dt=75 at some point inside the interval. 
There is at least one point where f’(c) = 75. 

This is not a constructive theorem. The value of c is not known. We don’t find c, 
we just claim (with proof) that such a point exists. 


3M Mean Value Theorem Suppose f(x) is continuous in the closed interval 
a<x <b and has a derivative everywhere in the open interval a < x < b. Then 


S(b)—F(@) 
b-a 


=f'(c) at some point a<c <b. (1) 


The left side is the average slope Af/Ax. It equals df/dx ac c. The notation for a 
closed interval [with endpoints] is [a, b]. For an open interval (without endpoints) 
we write (a,b). Thus f’ is defined in (a, b), and f remains continuous at a and b. A 
derivative is allowed at those endpoints too—but the theorem doesn’t require it. 

The proof is based on a special case—when f(a)=0 and f(b)=0. Suppose the 
function starts at zero and returns to zero. The average slope or velocity is zero. We 
have to prove that f’(c)=0 at a point in between. This special case (keeping the 
assumptions on f{(x)) is called Rolle’s theorem. 

Geometrically, if f goes away from zero and comes back, then f’ = 0 at the turn. 


Proof Ata point inside the interval where f(x) reaches its maximum or minimum, 
df/dx must be zero. That is an acceptable point c. Figure 3.27a shows the difference 
between f= 0 (assumed at a and b) and f’ = 0 (proved at c). 
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Smail problem: The maximum could be reached at the ends a and 8, if f(x)<Oin 
between. At those endpoints df/dx might not be zero. But in that case the minimum 
is reached at an interior point c, which is equally acceptable. The key to our proof 
is that @ continuous function on [a, b] reaches its maximum and minimum. This is the 
Extreme Value Theorem.t 

It is ironic that Roile himself did not believe the logic behind calculus. He may not 
have believed his own theorem! Probably he didn’t know what it meant—the lan- 
guage of “‘evanescent quantities” (Newton) and “‘infinitesimals” (Leibniz) was exciting 
but frustrating. Limits were close but never reached. Curves had infinitely many flat 
sides. Rolle didn’t accept that reasoning, and what was really serious, he didn’t accept 
the conclusions. The Académie des Sciences had to stop his battles (he fought against 
ordinary mathematicians, not Newton and Leibniz). So he went back to number 
theory, but his special case when f(a) = f{b} = 0 leads directly to the big one. 


slope df/dx  -~ 
equals 
slope Af/Ax _ _ 


7 


f(a) 


fig. 3.27 Rolle’s theorem is when f(a) = f(b) =0 in the Mean Value Theorem. 


Proof of the Mean Value Theorem We are looking for a point where df /dx equals 
AfjAx. The idea is to tilt the graph back to Rolle’s special case (when Af was zero). 
In Figure 3.27b, the distance F(x) between the curve and the dotted secant line comes 
from subtraction: 


Fey=se)-| fa+ xa]. Q) 


At a and 6, this distance is F(a)= F(b) = 0. Rolle’s theorem applies to F(x). There is 
an interior point where F’(c)=0. At that point take the derivative of equation (2): 
0=f'(c)— (Af/Ax). The desired point ¢ is found, proving the theorem. 


EXAMPLE 4 The function f(x) = fx goes from zero at x = 0 to ten at x = 100. Its 
average slope is Af/Ax = 10/100. The derivative f’(x)= 1/2./x exists in the open 
interval (0, 100), even though it blows up at the end x=0. By the Mean Value 
Theorem there must be a point where 10/100 = f'(c) = 1/2/c. That point is c = 25, 

The truth is that nobody cares about the exact value of c. Its existence is what 
matters. Notice how it affects the linear approximation /(x) = f(a) + f‘(a}(x — a), 
which was basic to this chapter. Close becomes exact ( ~ becomes = ) when f’ is 
computed at c instead of a: 


tIf f{x} doesn’t reach its maximum M, then 1/(M —f(x)) would be continuous but also 
approach infinity. Essential fact: A continuous function on [a,b] cannot approach infinity. 
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30. The derivative at c gives an exact prediction of f(x): 
F(x) = f(a) + f(x — a). (3) 


The Mean Value Theorem is rewritten here as Af=f'(c)Ax. Now a<c<x. 


EXAMPLE 2 The function f(x)=sin x starts from f(0)=0. The linear prediction 
(tangent line) uses the slope cos 0 = 1. The exact prediction uses the slope cos ¢ at an 
unknown point between 0 and x: 


(approximate) sin x x x (exact) sin x = (cos c)x. (4) 


The approximation is useful, because everything is computed at x = a= 0. The exact 
formula is interesting, because cos c <1 proves again that sin x <x. The slope is 
below 1, so the sine graph stays below the 45° line. 


EXAMPLE 3 /f f'(c)=0 at all points in an interval then f(x) is constant. 


Proof When f’ is everywhere zero, the theorem gives Af=0. Every pair of points 
has f(b) = f(a). The graph is a horizontal line. That deceptively simple case is a key 
to the Fundamental Theorem of Calculus. 

Most applications of Af=f'(c)Ax do not end up with a number. They end up with 
another theorem (like this one). The goal is to connect derivatives (local) to differences 
(global). But the next application—/’H6pital’s Rule—manages to produce a number 
out of 0/0. 


L’HOPITAL’S RULE 


When f(x) and g(x) both approach zero, what happens to their ratio {(x)/g(x)? 


flx) _ x? sin x x — sin x 0 

——-=-—- or —— or ——— all become — at x=0. 

g(x) x x 1—cos x 0 
Since 0/0 is meaningless, we cannot work separately with f(x) and g(x). This is a 
“race toward zero,” in which two functions become small while their ratio might do 
anything. The problem is to find the limit of f(x)/g(x). 

One such limit is already studied. It is the derivative! Af/Ax automatically builds 

in a race toward zero, whose limit is df/dx: 


f(x) — f(a) > 0 Bae Hi 


I(x) —f(@) _ ,, 

x- a—-0 x-a X—-@ f(a). 0) 
The idea of Hopital is to use f'/g’ to handle f/g. The derivative is the special case 
g(x) = x — a, with g’ = 1. The Rule is followed by examples and proofs. 


3P l’Hépital’s Rule Suppose f(x) and g(x) both approach zero as x > a. Then 
J(x)/g(x) approaches the same limit as f’(x)/g'(x), if that second limit exists: 


lim f(x) = lim Ls) Normally this limit is f(a) (6) 


xa g(x) xa g'(x)" g(a)” 


This is not the quotient rule! The derivatives of f(x) and g(x) are taken separately. 
Geometrically, l H6pital is saying that when functions go to zero their slopes control 
their size. An easy case is f= 6(x — a) and g = 2(x — a). The ratio f/g is exactly 6/2, 
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f(x) = 6{x — a) 


B(x) = 2x - a) 


f(x}, f(a} f(x) f(a) 
. 3.28 —— ry me ~ 
Fig. 3. (a) a) is exactly ea) 3. {b) eK) approaches e(a) 3. 


the ratio of their slopes. Figure 3.28 shows these straight lines dropping to zero, 
controlled by 6 and 2. 

The next figure shows the same limit 6/2, when the curves are tangent to the lines. 
That picture is the key to |’Hdpital’s rule. 

Generally the limit of {/g can be a finite number £ or + 00 or — oo. (Also the limit 
point x= a can represent a finite number or + a or — oo. We keep it finite.) The 
one absolute requirement is that f(x) and g(x) must separately approach zero—we 
insist on 0/0. Otherwise there is no reason why equation (6) should be true. With 
f(x) =x and g(x) =x — 1, don’t use |’H6pital: 


f(x) a f'{x) 
“4 but 
g(x} ani & (x) 
Ordinary ratios approach lim f(x) divided by lim g(x). PHdépital enters only for 0/0. 


1—cos x 


EXAMPLE 4 (an old friend) lim equals lim =. This equals zero. 


' 2 
exams 228% tects to La, Atx—o the mit ie. 
g sinx g OS X | 
EXAMMIES me endo: 2m At ero thee a 
1- x g sin x 0 


Solution Apply the Rule to f'/g’. It has the same limit as f”/g”: 


ee rere ae: free) _ sin x0 
if . 0 and a 0 then compute vO) Gax i 0. 


The reason behind ’Hépital’s Rule is that the following fractions are the same: 
fx) _ $0) —Sa) | g(x)~ g(a) 


R(x) xa x-a (7) 
That is just algebra; the limit hasn’t happened yet. The factors x — a cancel, and the 
numbers f{a) and g(a) are zero by assumption, Now take the limit on the right side 
of (7) as x approaches a. 

What normally happens is that one part approaches f’ at x = a. The other part 
approaches g’(a). We hope g’(a) is not zero. In this case we can divide one limit by 
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the other limit. That gives the “normal” answer 


lim a = limit of (7)= f(a) 


8 
xa Q(X g(a) ®) 


This is also ’H6pital’s answer. When f'(x) >/f'(a) and separately g’(x) > g’(a), his 
overall limit is f’(a)/g’(a). He published this rule in the first textbook ever written on 
differential calculus. (That was in 1696—the limit was actually discovered by his 
teacher Bernoulli.) Three hundred years later we apply his name to other cases 
permitted in (6), when f’/g’ might approach a limit even if the separate parts do not. 

To prove this more general form of l’H6pital’s Rule, we need a more general Mean 
Value Theorem. I regard the discussion below as optional in a calculus course 
(but required in a calculus book). The important idea already came in equation (8). 


Remark The basic “indeterminate” is 1 — «. If f(x) and g(x) approach infinity, 
anything is possible for f(x) — g(x). We could have x? — x or x — x? or (x + 2)—x. 
Their limits are 00 and — oo and 2. 

At the next level are 0/0 and 00/00 and 0+ o. To find the limit in these cases, try 
’Hopital’s Rule. See Problem 24 when f(x)/g(x) approaches co/c0. When f(x) +0 
and g(x) > 00, apply the 0/0 rule to f(x)/(1/g(x)). 

The next level has 0° and 1” and «0°. Those come from limits of f(x)®*. If f(x) 
approaches 0, 1, or «0 while g(x) approaches 0, 00, or 0, we need more information. 
A really curious example is x'""*, which shows all three possibilities 0° and 1” and 
00°. This function is actually a constant! It equals e. 

To go back down a level, take logarithms. Then g(x) In f(x) returns to 0/0 and 
0- oo and I’H6pital’s Rule. But logarithms and e have to wait for Chapter 6. 


THE GENERALIZED MEAN VALUE THEOREM 


The MVT can be extended to two functions. The extension is due to Cauchy, who 
cleared up the whole idea of limits. You will recognize the special case g = x as the 
ordinary Mean Value Theorem. 


3Q Generalized MVT If f(x) and g(x) are continuous on [a,b] and 
differentiable on (a, b), there is a point a < c < b where 


C f(b) — f(a) }g’(c) = [g(b) — g(a) 1 f'(c). (9) 


The proof comes by constructing a new function that has F(a) = F(b): 
F(x) = [ f(b) — f(a) g(x) — [g(b) — g(a) f(x). 
The ordinary Mean Value Theorem leads to F’(c) = 0—which is equation (9). 


Application 1 (Proofofl’H6pital’s Rule) The rule deals with f(a)/g(a) = 0/0. Insert- 
ing those zeros into equation (9) leaves f(b)g'(c) = g(b)f’(c). Therefore 
fib) _ f'(0 
gb) g'(c) 


As b approaches a, so does c. The point c is squeezed between a and b. The limit of 
equation (10) as b> a and c ~ a is l'H6pital’s Rule. 


(10) 


152 3 Applications of the Derivative 


Application 2. (Error in linear approximation) Section 3.2 stated that the distance 
between a curve and its tangent line grows like (x — a)*. Now we can prove this, and 
find out more. Linear approximation is 


f(x) =f(a) +f’ (a\(x — a) + error e(x). (11) 
The pattern suggests an error involving {”(x) and (x — a)’. The key example f= x? 
shows the need for a factor 3 (to cancel {” = 2). The error in linear approximation is 
e(x)=4f"(c)(x—a)? with a<ec<x., (12) 

Key idea Compare the error e(x) to (x — a)’. Both are zero at x = a: 

e=f(x)-f(Q-f@ax-a C=fW-f@ e=f"(x) 

B= (x= a)" 8 = 2(x— a) g'=2 
The Generalized Mean Value Theorem finds a point C between a and x where 
e(x)/2(x) = e'(C)/g’(C). This is equation (10) with different letters. After checking 


e'(a)= 2’(a)=0, apply the same theorem to e’{x) and g’{x). It produces a point c 
between a and C—certainly between a and x—where 
e(x) _ e"(c) 


©) _ 2) e) 
0) #0) and therefore mee 


With g=(x—a)* and g"=2 and e”=f", the equation on the right is e{x})= 
$f"(c)(x — a)?. The error formula is proved. A very good approximation is 


4f"(a)(x - a)’. 
EXAMPLE? (x)= ./x near a= 100: \/102= 10+ (35) 24 5 (a) 92. 


That last term predicts e = — .0005. The actual error is ,/ 102 — 10.1 = ~ .000496. 


3.8 EXERCISES 
Read-through questions Find all points 0 < c < 2 where f(2)—(0} =/f'(c)(2 — 0). 
The Mean Value Theorem equates the average slope Af/Ax 1 f(x)=x3 2 f(d=sin zx 


overan 9 [a,5] tothe slope df/dx atan unknown _b_. = _ - 
The statement is__¢__. It requires f(x}1o be __d__ on the + JG) tan tax ee a tai 
@ interval [a,b], witha __f on the open interval (a, b). § f(x) =(x— 1)'° 6 f(x) =(x—-1)° 
Rolle’s theorem is the special case when f{a)=f(b)=0, and 
the point c satisfies __9 . The proof chooses ¢ as the point In 7-10 show that no point ¢ yields f(1)—f(—1) =f (cX2). 


where f reaches its __h__. Explain why the Mean Value Theorem fails to apply. 


Consequences of the Mean Value Theorem include: 7 f(x)=|x—4 8 f(x) = unit step function 
If f‘(x}=0 everywhere in an interval then f{x)=__! 


; oats wee 
The prediction f(x)=f(a)+_]_{(x—a) is exact for 9 f(x)= lal" 10 f(x) = I/x 

some c between a and x. The quadratic prediction 11 Show that sec? x and tan? x have the same derivative, and 
f(x) =f(a) +f (ax ~ a) +__k_{x — a)? is exact for another draw a conclusion about {(x) = sec?x — tan? x, 


c. The error in f(a)+/‘(a)(x —a) is less than $M(x — a)? 


2 2 ere 
where Mic the maximum | 12. Show that csc? x and cot” x have the same derivative and 


find f(x) =csc* x — cot? x. 
A chief consequence is l"H6pital’s Rule, which applies when 

f(x) and g(x)>__m__ as xa, In that case the limit of Evaluate the limits in 13-22 by PHépital’s Rule. 

S(x}/g(x) equals the limit of _ 1, provided this limit exists. < ei 

Normally this itmit is {’(a}/g’(a). If this is also 0/0, go on to x'—9 14 lim x 


13 lim ~—— 
the limit of __o 3 x= oy rein) 
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_ (L4x)72-1 Roma 
15 lim Ss) 16g 22 

zx x x70 x 
17 lim =— esti — 

xx SIN xX x1 SIN X 

_ (L4exy¥-1 iat 

x70 x x70 x 

. sin x—tan x = as 
1 lim ==] 2 ig Ves = 


23 For f= x? —4 and g =x +2, the ratio {'/g’ approaches 4 
as x — 2. What is the limit of f{x)/e(x)? What goes wrong in 
l H6pital’s Rule? 
24 P’Hépital’s Rute still holds for {(x)/2(x) > 00/00: L is 
~ f(x) etx) 4. a’ (x)/2?) 
lim —— =lim =lim ——"— = 1? lim =. 
ge) U/S@) FOP) f'@) 
Then £ equals lim [f’{x)/2’(x)] if this limit exists. Where did 
we use the rule for 0/0? What other limit rule was used? 
1+ (1/x) . xt+x 
t—(1x)' 26 Compute iim a 


, *+COS x 
27 Compute lim ———— 
xoa X+SIn X 


l'H6pital gives no answer. 


25 Compute lim 


by common sense. Show that 


. CCX . 
28 Compute jim cone by common sense or trickery. 
29 The Mean Value Theorem applied to /{x) = x? guarantees 


that some number c between f and 4 has a certain property. 
Say what the property is and find c. 


W If |df/dx| <1 at all points, prove this fact: 
if (x) —f(y}| < |x — y| at all x and y. 


31 The error in Newton’s method is squared at each step: 
[X,+2-—x*| < M|x, — x*|?. The proof starts from 0 = f(x*) = 
FOE)AL (ep) — xy) +4" (xt — x,)%. Divide by f’ (x), 
recognize x,4,, and estimate M. 


32 (Roile’s theorem backward) Suppose f’(c)=0. Are there 
necessarily two points around c where f(a) =f (b)? 


33 Suppose f{0) = 0. If f(x}/x has a limit as x — 0, that limit 
is better known to us as __———sC«swsW: L’ HGpital’s Rule looks 
instead at the limit of ; 

Conclusion from l’H6pital: The limit of f’(x), if it exists, 
agrees with f’{0), Thus f'(x) cannot have a “removable 


34 Itis possible that f’(x)/g'(x) has no limit but f(x)/e(x) > L. 
This is why |’H6pital included an “if.” 
(a) Find E as x +0 when f(x) = x? cos(1/x) and g(x) = x. 
Remember that cosines are below 1. 
(b) From the formula f’(x) = sin (1/x) + 2x cos{t/x) show 
that f’/e’ has no limit as x - 0, 


35 Stein’s calculus book asks for the limiting ratio of 
f(x)=triangular area ABC to g{x)=curved area ABC. 
(a) Guess the limit of f/g as the angle x goes to zero, 
(b) Explain why f(x) is 4(sinx—sinxcos x} and g({x) is 
4x — sin x cos x). (c) Compute the true limit of f(x)/g(x). 


oo 


36 If you drive 3000 miles from New York to L.A. in 100 
hours (sleeping and eating and going backwards are allowed) 
then at some moment your speed is 


37 As x-+ oo l’Ho6pital’s Rule still applies, The limit of 
S(x)ig(x) equals the limit of f'(x)/g’{x), if that limit exists. 
What is the limit as the graphs become parallel in Figure B? 


38 Prove that f(x} is increasing when its slope is positive: if 


fc) > 0 at all points c, then f(b) > f(a) at all pairs of points 
b> a. 
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CHAPTER 4 


Derivatives by the Chain Rule 


eee §=864.4 the Chainkuic i 


You remember that the derivative of f(x)g{x) is not (df/dx)(dg/dx). The derivative 
of sin x times x* is not cos x times 2x. The product rule gave two terms, not one 
term. But there is another way of combining the sine function f and the squaring 
function g into a single function. The derivative of that new function does involve 
the cosine times 2x (but with a certain twist). We will first explain the new function, 
and then find the ‘‘cAhafn rufe”’ for its derivative. 

May I say here that the chain rule is important. It is easy to learn, and you will 
use it often. I see it as the third basic way to find derivatives of new functions from 
derivatives of old functions. (So far the old functions are x", sin x. and cos x. Still 
ahead are e* and log x.) When f and g are added and multiplied. derivatives come 
from the sum rude and product rule. This section combines f and g in a third way. 

The new function is sin(x?)—the sine of x?. It is created out of the two original 
functions: if x = 3 then x? =9 and sin(x?)= sin 9. There is a “chain” of functions, 
combining sin x and x? into the composite function sin(x*}. You start with x, then 
find 9x), then find f(@(x)): 

The squaring function gives » = x. This is g(x). 

The sine function produces z = sin y = sin(x*). This is f(g(x)). 
The “inside function” g(x) gives y. This is the input to the “outside function” f(y). That 
is called composition. It starts with x and ends with z. The composite function ts 


sometimes written f- yz (the circle shows the difference from an ordinary product fg). 
More often you will see f(g(x)): 


2x) = fe g (x) = flex). (1) 
Other examples are cos 2x and (2x)*, with g = 2x. On a calculator you input x, then 
push the “g™ burton, then push the “f" button: 
From x compute y = g{x) From y compute z= f(y). 


There is not a button for every function! But the squaring function and sine function 
are on most calculators, and they are used in that order. Figure 4.1a shows how 
154 squaring will stretch and squeeze the sine function. 
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That graph of sin x? is a crazy FM signal (the Frequency is Modulated). The wave 
goes up and down like sin x, but not at the same places. Changing to sin g(x) moves 
the peaks left and right. Compare with a product g{x) sin x, which is an AM signal 
{the Amplitude is Modulated). 


Remark f(g(x)) is usually different from g{f{x)). The order of f and g is usually 
important. For f(x) = sin x and g(x) = x?, the chain in the opposite order gf f(x) gives 
something different: 


First apply the sine function: y = sin x 
Then apply the squaring function: z = (sin x)”. 


That result is often written sin?x, to save on parentheses. It is never written sin x?, 
which is totally different. Compare them in Figure 4.1. 


z=siny 


f(g) 
(sinv¥x )? 


1 


y=sin (x7) 


sin(V¥n 2) 
Fig. 4.4 f(g(x)) is different from g{ f(x). Apply g then f, or f then g. 


EXAMPLE 4 The composite function fog can be deceptive. If g(x) = x? and f(y) = y*, 
how does /(g(x)) differ from the ordinary product f{x)g(x)? The ordinary product is 
x’, The chain starts with y = x°, and then z = y* = x!?, The composition of x? and 
y* gives flg(x)) = x'?. 


EXAMPLE 2 In Newton’s method, F(x) is composed with itself. This is iteration. 
Every output x, is fed back as input, to find x,,; = F(x,). The example F(x) =4x+4 
has F{F(x))= 43x + 4)+ 4. That produces z= $x + 6. 

The derivative of F(x) is $. The derivative of z= F(F{x)) is t, which is 5 times 4. 
We multiply derivatives. This is a special case of the chain rule. 

An extremely special case is f(x) = x and g(x) = x. The ordinary product is x”. The 
chain fle(x)) produces only x! The output from the “identity function” is g{x) = x.t 


When the second identity function operates on x it produces x again. The derivative 
is 1 times 1. I can give more composite functions in a table: 


y=ax) z=fly) z=flex) 


cos x y (cos x}? 
2 2 22" 
x+5 y7s x 


The last one adds 5 to get y. Then it subtracts § to reach z. So z= x. Here output 


TA calculator has no button for the identity function. It wouldn't do anything, 
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equals input: f{g(x))=x. These ‘inverse functions” are in Section 4.3. The other 
examples create new functions z(x) and we want their derivatives. 


THE DERIVATIVE OF f(g(x)) 


What is the derivative of z = sin x? It is the limit of Az/Ax. Therefore we look at a 
nearby point x + Ax. That change in x produces a change in y = x?—which moves 
to y+ Ay=(x+ Ax)’, From this change in y, there is a change in z=f(y). It is a 
“domino effect,” in which each changed input yields a changed output: Ax produces 
Ay produces Az. We have to connect the final Az to the original Ax. 

The key is to write Az/Ax as Az/Ay times Ay/Ax. Then let Ax approach zero. 
In the limit, dz/dx is given by the “chain rule”: 

x mn he becomes the chain rule & = ae (2) 

As Ax goes to zero, the ratio Ay/Ax approaches dy/dx. Therefore Ay must be going 
to zero, and Az/Ay approaches dz/dy. The limit of a product is the product of the 
separate limits (end of quick proof). We multiply derivatives: 


4A Chain Rale Suppose e(x) has a derivative at x and f(y) has a derivative 
at y = g(x). Then the derivative of z = f(g(x)) is 


=$'(@x) (3): 3) 


The slope at x is df/dy (at y) times dg/dx {at x). 


Caution The chain rule does not say that the derivative of sin x? is (cos x)(2x). 
True, cos y is the derivative of sin y. The point is that cos y must be evaluated at y 
(not at x). We do not want df/dx at x, we want df/dy at y = x?: 


The derivative of sin x? is (cos x7} times (2x). (4) 


EXAMPLE 3 If z = (sin x)? then dz/dx = (2 sin x)(cos x). Here y = sin x is inside. 


In this order, z= y? leads to dz/dy = 2y. It does not lead to 2x. The inside function 
sin x produces dy/dx =cos x. The answer is 2ycos x. We have not yet found the 
function whose derivative is 2x cos x. 


EXAMPLE 4 The derivative of z = sin 3x is we = —— = 3¢os 3x. 
dx dydx 


: A A 
Ag. 4.2 The chain rule: re = i: Ax 


Ay approaches ma aa, 


dx dydx 


4.1 The Chain Rule 


The outside function is z= sin y. The inside function is y = 3x. Then dz/dy = cos y— 
this is cos 3x, not cos x. Remember the other factor dy/dx = 3. 

I can explain that factor 3, especially if x is switched to t. The distance is z = sin 31. 
That oscillates like sin ¢ except three times as fast. The speeded-up function sin 3t 
completes a wave at time 2n/3 (instead of 2x). Naturally the velocity contains the 
extra factor 3 from the chain rule. 


EXAMPLES Let z=/(y)= y”. Find the derivative of /(g(x)) = [g(x)]". 
In this case dz/dy is ny"~'. The chain rule multiplies by dy/dx: 
dz_n-1 AY 


d d 
Bats or Lele = alec (5) 


This is the power rule! It was already discovered in Section 2.5, Square roots (when 
n= 1/2) are frequent and important. Suppose .y = x? — 1: 
d 


dx 


x= 1s 500 — 1)? @x)= (6) 


x | 


Question A Buick uses 1/20 of a gallon of gas per mile. You drive at 60 miles per 
hour. How many gallons per hour? 

Answer (Gallons/hour) = (gatlons/mile)(miles/hour). The chain rule is (dp/dt)= 
(dy/dx)(dx/dt). The answer is (1/20)(60) = 3 gallons/hour. 


Proof of the chain rule The discussion above was correctly based on 


Az Az Ay me dz _ dz dy 
dx dydx 
It was here, over the chain rule, that the “‘battle of notation” was won by Leibniz. 
His notation practically tells you what to do: Take the limit of each term. (1 have to 
mention that when Ax is approaching zero, it is theoretically possible that Ay might 


hit zero. If that happens, Az/Ay becomes 0/0. We have to assign it the correct meaning, 
which is dz/dy.) As Ax > 0, 


(7) 


Ay : Oe hy pi 

re > g(x) and Ay SO) =f (a(x). 

Then Az/Ax approaches f'(y} times g’{x), which is the chain rule (dz/dy}(dy/dx). In the 
table below, the derivative of (sin x)? is 3(sin x)? cos x. That cxtra factor cos x is easy 
to forget. It is even easier to forget the —1 in the last example. 


z=(x?+tP  dzidx=5(x?+1)* times 3x? 
z=(sin x) dzidx = 3 sin?x times cos x 
z=(1-—x) dzidx=2(1—x) times —1 


important All kinds of letters are used for the chain rule. We named the output z. 
Very often it is called y, and the inside function is called u: 
. ; dy d 
The derivative of y = sin u(x) is ‘Y= COS =iae 
dx dx 

Examples with du/dx are extremely common. [ have to ask you to accept whatever 
letters may come. What never changes is the key idea—derivative of outside function 
times derivative of inside function. 


157 


158 
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EXAMPLE6 The chain rule is barely needed for sin(x — 1). Strictly speaking the 
inside function is u=x—1. Then du/dx is just 1 (not —1). Jf y=sin(x— 1) then 
dy/dx = cos(x — 1). The graph is shifted and the slope shifts too. 

Notice especially: The cosine is computed at x — 1 and not at the unshilted x. 


RECOGNIZING f(y) AND g(x) 


A big part of the chain rule is recognizing the chain. The table started with (x* + 1)°. 
You look at it for a second. Then you see it as u>. The inside function is u= x* + 1. 
With practice this decomposition (the opposite of composition) gets easy: 


cos (2x + 1) is cos u /1+ sin t is Ju x sin x is ... (product rule!) 


In calculations, the careful way is to write down all the functions: 

z=cosu u=2x+1 = dz/dx =(—sin u)(2)= — 2 sin (2x + 1). 
The quick way is to keep in your mind “the derivative of what’s inside.” The slope 
of cos(2x + 1) is —sin(2x + 1), times 2 from the chain rule. The derivative of 2x + 1 
is remembered— without z or u or for zg. 
EXAMPLE 7 sin ./1— x is a chain of z=sin y, y= Ju, u=1— x (three functions). 
With that triple chain you will have the hang of the chain rule: 


l 
The derivative of sin ./1— x is (cos \/1 1). 
ive /1--x is (cos / (es) ) 
This is (dz/dy)(dy/du)(du/dx). Evaluate them at the right places y, u, x. 


Finally there is the question of second derivatives. The chain rule gives dz/dx as a 
product, so d?z/dx? needs the product rule: 


dz dzdy d*z dzd*y d {dz\ dy 
—$= = — = +t | _, 

_dx dydx rate dx? dydx? = dx\dy/ dx (8) 
uo» upt vy! v 


That last term needs the chain rule again. It becomes d?z/dy” times (dy/dx)?. 


EXAMPLES The derivative of sin x? is 2x cos x*. Then the product rule gives 
d?z/dx* = 2 cos x? — 4x? sin x”. In this case y’ = 2 and (y’)* = 4x?. 


4.4 EXERCISES 


Read-through questions cos u(x) has dp/dx = __m__. The power rule for y = [u{x)]}" is 


z= f(z(x)} comes from z=/f(y) and y=__a__. At x = 2, the 
chain (x* — 1)? equals__b__. Its inside functionisy=__¢_, 
its outside function isz=__4__. Then dz/dx equals _ e 


the chain rule dy/dx =__ __. The slope of 5g{x}is_ © and 


the slope of e(5x) is__p . When f= cosine and g = sine and 


x =0, the numbers f(g(x)) and ef f(x)) and f(x)g(x} are__a@_. 


The first factor is evaluated at y=__¢ | (not at y=x). . dz 
For z= sin(x* — 1) the derivative is__g . The triple chain In 1~10 identify f(y) and g(x). From their derivatives find —. 


z=cos(x + 1)* has a shift and a __» __ and a cosine. Then 
dzjdx =__\ 


The proof of the chain rule begins with Az/Ax= 
ee em Sa 


1 z=(x? —3) 2 z=(x°-3P 
3 z= cosx*) 4 z=tan 2x 
)and ends with __|__. Changing letters, y = § z=./sin x 6 z=sin x 
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7 z=tan(t/x)+ l/tan x 8 z=sin(cos x) 


9 z=cosx?+x+1)} 10 z=,/x 


In 11-16 write down dz/dx. Don’t write down f and g. 
I z=sin(I7x) 12 2=tan(x + 1) 
l4z- (x?)p!? 


16 2=(9x + 4)>? 


13 z=cos(cos x) 
15 z=x’ sinx 
Problems 17-22 involve three functions z(y), y(u), and u(x). 
Find dz/dx from {dz/dy)(dy/du)(dujdx). 


17 z=sin /x+1 


19 z=./l+sinx 


21 z=sin(1/sin x) 


18 z=\/sin(x + 1) 

20 2 =sin{,/x + 1) 

22 z=(sin x*)? 

In 23-26 find dz/dx by the chain rule and also by rewriting =. 
23 z=((x7/y* 24 z=(3xy 


BB z=(xt+))?+sinfxt+a) 26 z= /1—cos?x 

27 HE f(x)=x? +1 what is (f(x)? If U(x) is the unit step 
function {from 0 to | at x =0) draw the graphs of sin U(x} 
and U(sin x}. If R(x) is the ramp function {x + |x{), draw the 
gtaphs of R{x) and R(sin x}. 


28 (Recommended) If g(x)= x? lind f(y) so that f(g(x)) = 
x? 41. Then find A(y) so that A(g(x)) =x. Then find k(y) so 
that k(g{x)) = 1. 


29 If f(y} = y — 2 find g(x) so that f(g(x)) =x. Then find A(x} 
so that f{A(x}) = x*. Then find k(x) so that /(k(x)) = 1. 


30 Find two different pairs f(y), g(x) so that f(e(x)}= 
Vl-x?. 


HM The derivative of /(/(x)) ts 
your formula on f(x) = 1/x. 


32 If f(3) =3 and 93) =5 and f‘{3) = 2 and g'(3) = 4, find the 
derivative at x = 3 if possible for 


(a) fixie) (DY fle) a(fix) (fe) 


33 For F(x)=4x+8, show how iteration gives F(F(x)) = 
4x +12. Find peau ciner called F'3x), The derivative 
of F(x) is 


34 In Problem 33 the limit of F(x) is a constant C= 
. From any start (try x =0) the iterations x, = 
F(x,) converge to C. 


35 Suppose g(x)= 3x + 1 and fi3) = Hy — LD). Then f(g(x)) = 
and gf f(y) = . These are inverse functions. 


36 Suppose p(x) is continuous at x = 4, say g(4} = 7. Suppose 
Jty) is continuous at y= 7, say f(7)=9. Then f(g(x)) is con- 
tinuous at x = 4 and f{g(4)} = 9. 

Proof ¢ is given. Because is continuous, there is a 
6 such that |/(g(x))—9|<s whenever |g(x)—7|< 0. Then 


_ Is it (dffdx}?? Test 


because is continuous, there is a @ such that 
|g(x} — 7| <6 whenever |x —4| < 8, Conclusion: [f |x ~ 4] <6 
then . This shows that f(g{x)) approaches f(9(4)). 
37 Only six functions can be constructed by compositions (in 
any sequence) of g(x) = 1—x and f(x) = 1/x. Starting with g 
and f, find the other four. 
38 If g(x}= 1-—~x then g(e(x)=1—(1 —x) =x. If gfx) = 1/x 
then g(g(x})=1/(L/x)=x. Draw graphs of those g’s and 
explain from the graphs why g(g(x}}= x. Find two more g’s 
with this special property. 
39 Construct functions so that f(g(x)) is always zero, but f(y) 
is not always zero. 
40 True or false 

{a) If fix) =f(— x) then f'(x) = f'(— x). 

{b) The derivative of the identity function is zero. 

(c) The derivative of f(L/x) is —1/Cf(x)¥. 

(d) The derivative of f(1 + x) 1s f(1 + x). 

(e) The second derivative of f{g(x)) is f"{a(x)e"(x) 


41 On the same graph draw the parabola y=x? and the 
curve 2 = sin (keep } upwards, wtth x and z across). Starting 
at x =3 find your way to z=sin 9. 


42 On the same graph draw y = sin x and z = y? (y upwards 
for both). Starting at x = 2/4 find z =(sin x)’ on the graph. 


43 Find the second derivative of 
(a) sin(x? +1) (b) fx? = 1 


d fdz\ _ (a?2\ (dy 
44 Explain why ie) = (Gs)(4e) in equation (8). 


Check this when 2 = y?, p=. 


(c} cos Ss 


Final practice with the chain rule and other rules (and other 
letters!), Find the x or ¢ derivative of z or y. 


45 z=flidn) 46 z= u=x? 
47 y=sin u(x)cos u(x) 48 y= vf ult) 


49 y= xix) 50 y=fix7) + (fo) 
§2 z= 1;u"e) 
§3 -=fw),u=e?, b= vit §4 y=u,u=x,x=1)t 


85 If f=x* and g=x° then f’ = 4x7 and g’ = 3x”. The chain 
rule multiplies derivatives to get 12x°. But f(e(x))=x'? and 
its derivative is not 12x°. Where is the Naw? 


Sl z= Vl-uu= Yl—-x 


56 The derivative of y = sin(sin x) is dyidx = 
cos{cos x} sin{cos x)jcos x cas(sin x)cos x cos(cos x)cos x. 


57 (a) A book has 400 words per page. There are 9 pages per 
section. So there are words per section. 
({b) You read 200 words per minute. So you read 
pages per minute. How many minutes per section? 
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58 (a) You walk in a train at 3 miles per hour. The train 59 Coke costs 1/3 dollar per bottle. The buyer gets 
moves at 50 miles per hour, Your ground speed is bottles per dollar, If dy/dx = 1/3 then dx/dy = 
miles per hour. 


(b} You walk in a tratn at 3 miles per hour, The train is 60 (Computer) Graph F(x)=sin x and G(x) = sin{sin x)-~ 
shown on TY (1 mile train = 20 inches on TY screen). not much difference. Do the same for F'(x) and G’{x). Then 
Your speed across the screen is inches per hour. plot F(x) and G*(x} to see where the difference shows up. 


EE 894.2 implicit Differentiation and Related Rates xx 


We start with the equations xy=2 and y°+xy=3. As x changes, these y’s will 
change—to keep (x, y) on the curve. We want to know dy/dx at a typical point. For 
xy = 2 that is no trouble, but the slope of y5 + xy = 3 requires a new idea. 

In the first case, solve for y = 2/x and take its derivative: dy/dx = — 2/x*. The curve 
is a hyperbola. At x = 2 the slope is — 2/4 = —1/2. 

The problem with y*° + xy =3 is that it can’t be solved for y. Galois proved that 
there is no solution formula for fifth-degree equations.t The function y{x) cannot 
be given explicitly. All we have is the implicit definition of y, as a solution to 
y°+xp=3. The point x = 2, y= 1 satisfies the equation and lies on the curve, but 
how to find dy/dx? 

This section answers that question. It is a situation that often occurs. Equations 
like sin y+ sin x = 1 or y sin y= x (maybe even sin y = x) are difficult or impossible 
to solve directly for y. Nevertheless we can find dy/dx at any point. 

The way out is implicit differentiation. Work with the equation as it stands. Find 
the x derivative of every term in y° + xy = 3. That includes the constant term 3, whose 
derivative is zero. 


EXAMPLE 4 The power rule for y° and the product rule for xy yield 


dy | dy 
Sy# —+x—+y=0, 
sree i meni ag (1) 
Now substitute the typical point x = 2 and y= 1, and solve for dy/dx: 
dy dy l 
—+ = — -. 
ax 1=0 produces ay 5 


This is implicit differentiation (ID), and you see the idea: Include dyp/dx from the 
chain rule, even if y is not known explicitly as a function of x. 


dy 
= + 
55 t2 (2) 


d 
EXAMPLE 2 sin y+sin x = 1 leads to cos ys + cos x= 0 


_ 4 
tsin yo = 


: d 

EXAMPLE 3 y sin y= x leads to p cos y re 
Knowing the slope makes it easier to draw the curve. We still need points (x, y) 

that satisfy the equation. Sometimes we can solve for x. Dividing y* + xy =3 by y 


+That was before he went to the famous duel, and met his end, Fourth-degree equations do 
have a solution formula, but it is practically neyer used. 


4.2 Implicit Differentiation and Related Rates 


gives x = 3/y — y*. Now the derivative (the x derivative!) is 


3 dy dy 
= -—_—— 4 3 —_— = — =]. 
( , 2 VT ty 1 (3) 
Again dy/dx = — 1/7. All these examples confirm the main point of the section: 


The example xy = 2, done implicitly, gives x dy/dx + y= 0. The slope dy/dx is — y/x. 
That agrees with the explicit slope — 2/x?. 

ID is explained better by examples than theory (maybe everything is). The essential 
theory can be boiled down to one idea: “Go ahead and differentiate.” 


EXAMPLE 4 Find the tangent direction to the circle x? + y? = 25. 
We can solve for y= +./25— x?, or operate directly on x? + y? = 25: 


dy dy x 
2x + 2y— =0 —=--, 4 
ee = dx y @) 
Compare with the radius, which has slope y/x. The radius goes across x and up y. 
The tangent goes across — y and up x. The slopes multiply to give (— x/y)(y/x) = — 1. 
To emphasize implicit differentiation, go on to the second derivative. The top of the 
circle is concave down, so d?y/dx? is negative. Use the quotient rule on — x/y: 


dy x d*y__ ydx/dx—xdy/dx__ y+(x’/y)_ yy’ +x? (5) 
ss - 


dx y dee y? = y? y 


RELATED RATES 


There is a group of problems that has never found a perfect place in calculus. They 
seem to fit here—as applications of the chain rule. The problem is to compute 
df/dt, but the odd thing is that we are given another derivative dg/dt. To find df/dt, 
we need a relation between f and g. 

The chain rule is df/dt = (df/dg)(dg/dt). Here the variable is t because that is typical 
in applications. From the rate of change of g we find the rate of change of f. This is 
the problem of related rates, and examples will make the point. 


EXAMPLE 5 The radius of a circle is growing by dr/dt = 7. How fast is the circum- 
ference growing? Remember that C = 2zr (this relates C to r). 

dC dCdr 
Solution eae (27)(7) = 14x. 
That is pretty basic, but its implications are amazing. Suppose you want to put a 
rope around the earth that any 7-footer can walk under. If the distance is 24,000 
miles, what is the additional length of the rope? Answer: Only 147 feet. 

More realistically, if two lanes on a circular track are separated by 5 feet, how 
much head start should the outside runner get? Only 107 feet. If your speed around 
a turn is 55 and the car in the next lane goes 56, who wins? See Problem 14. 

Examples 6-8 are from the 1988 Advanced Placement Exams (copyright 1989 by 
the College Entrance Examination Board). Their questions are carefully prepared. 
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4 Dorivatives by the Chain Rule 


x A 100 c 


Ag. 4.3 Rectangle for Example 6, shadow for Example 7, balloon for Example 8. 


EXAMPLE 6 The sides of the rectangle increase in such a way that dz/dt=1 and 
dx/dt = 3dy/dt. At the instant when x = 4 and y= 3, what is the value of dx/dt? 


Solution The key relation is x? + y? = z?. Take its derivative (implicitly): 


dx dy = ax dy 


dz 
= ote adel +6—=10, 

2x rT +2y i 2z at produces 8 7 6 7 10 
We used all information, including z = 5, except for dx/dt= 3dy/dt. The term 6dy/dt 


equals 2dx/dt, so we have 10dx/dt = 10. Answer: dx/dt = i. 


EXAMPLE 7 A person 2 meters tall walks directly away from a streetlight that is 8 
meters above the ground. If the person’s shadow is lengthening at the rate of 4/9 
meters per second, at what rate in meters per second is the person walking? 


Solution Draw a figure! You must relate the shadow length s to the distance x from 
the streetlight. The problem gives ds/dt = 4/9 and asks for dx/dt: 


na : xs dx 6ds__ 4\ 4 

By similar triangles E73: Pie on a ) =., 

Note This problem was hard. I drew three figures before catching on to x and s. 
It is interesting that we never knew x or s or the angle. 


EXAMPLE 8 An observer at point A is watching balloon B as it rises from point C. 
(The figure is given.) The balloon is rising at a constant rate of 3 meters per second 
(this means dy/dt = 3) and the observer is 100 meters from point C. 


{a) Find the rate of change in z at the instant when y = 50. (They want dz/dt.) 


ay 


dz 
2= ,2 1 2 a2 = 
z= y? +1007 => 27 aya 


1 d 
A= 3(100\y)= S0y 4 = S07 = 50-3 = 150, 


(c) Find the rate of change in @ when y = 50. (They want d@/dt.) 


100 2 
= 50 §=— = — 
y => cos 50/5 Ts 


2 
y vote Le 2 HA (2) 3 3 


100 7 “Yar (100de t= 75) 100 125 


Read-through questions 


4.2 implicit Differentiation and Related Rates 


In all problems I first wrote down a relation from the figure. Then I took its derivative. 
Then I substituted known information. (The substitution is after taking the derivative 
of tan 6 = y/100. If we substitute y = 50 too soon, the derivative of 50/100 is useless.) 
“Candidates are advised to show their work in order to minimize the risk of not 
receiving credit for it.” 50% solved Example 6 and 21% solved Example 7. From 
12,000 candidates, the average on Example 8 (free response) was 6.1 out of 9. 


EXAMPLE9 4 is a lighthouse and BC is the shoreline (same figure as the balloon). 
The light at A turns once a second (d@/dt = 22 radians/second). How quickly does 
the receiving point B move up the shoreline? 


Solution The figure shows y= 100 tan 8. The speed dy/dt is 100 sec?6 d6/dt. This is 
200z sec?@, so B speeds up as sec @ increases. 


Paradox When @ approaches a right angle, sec @ approaches infinity. So does 
dy/dt. B moves faster than light! This contradicts Einstein’s theory of relativity. The 
paradox is resolved (1 hope) in Problem 18. 

If you walk around a light at A, your shadow at B seems to go faster than light. 
Same problem. This speed is impossible—something has been forgotten. 


Smaller paradox (not destroying the theory of relativity). The figure shows y = z sin 8. 
Apparently dy/dt = (dz/dt) sin 6. This is totally wrong. Not only is it wrong, the exact 
opposite is true: dz/dt = (dy/dt) sin @. If you can explain that (Problem 15), then ID 
and related rates hold no terrors. 


4.2 EXERCISES 
9 x=tan y 


10 =x atx=1 
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For x*+y°=2 the derivative dy/dx comes from __a 
differentialion. We don’t have to solve for __b _. Term by 
term the derivativeis 3x7+__¢ =0. Solving for dy/dx gives 

g@ .Atx=y=1] this slopeis__e . The equation of the 
tangent line is y-1=_ ¢ 


A second example is y?=x. The x derivalive of this 
equation is _g_. Therefore dy/dx=_h _. Replacing y 
by ./x, this is dy/dx=_1_. 


In related rates, we are given dg/dt and we want df/dt. We 
need a relation between fand J _. If f=, then (df/dt) = 
k (dg/dt). If f?+g7=1, then dffdt=__1. If the 
sides of a cube grow by ds/dt = 2, then its volume grows by 
dV/dt=__m_. To find a number (8 is wrong), you also need 


to know _n 


By implicit differentiation find dy/dx in 1-10. 


Lytx7=1 2 x*y+y7x=1] 

3 (x—y? =4 4 /x+Jfy=3atx=4 
5 x=F(y) 6 flx} + Fly) =xy 

7 x*y=y’x 8 x=sin y 


11 Show that the hyperbolas xy = C are perpendicular to the 
hyperbolas x? — »? = D. (Perpendicular means that the pro- 
duct of slopes is —1.) 


12 Show that the circles (x — 2)? + y? = 2and x? +(y—27 = 
2 are tangent at the point (1, 1). 


13 At 25 meters/second, does your car turn faster or slower 
than a car traveling 5 meters further out at 26 meters/second? 
Your radius is (a) 50 meters (b) 100 meters. 


14 Equation (4) is 2x + 2y dy/dx =0 (on a circle). Directly by 
ID reach d*y/dx? in equation (5). 


Problems 15-18 resolve the speed of light paradox in 
Example 9. 


18 (Small paradox first) The right triangle has z? = y? + 1007. 
Take the ¢ derivalive to show that 2’ = y sin @. 


16 (Even smaller paradox) As B moves up the line, why is 
dy/dt larger than dz/dt? Certainly z is larger than y. But as 0 
increases they become 


17 (Faster than light) The derivative of y= 100tan 6 in 
Example9 is y= 100sec@ 6’ = 200n sec?@. Therefore y 
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passes c (the speed of light) when sec*# passes 
Such a speed is impossible—we forget that light takes time 
to reach B., 


# increases by 2n 


B in 1 second 
z(2) 
yt) t is arrival time 
4 Pies of light 


100 Gis different from 2nt 


18 (Explanation by ID) Light travels from A to 8 in time 
z/c, distance over speed. Its arrival time is t = 0/2x + z/c so 
9'/2n=1—2'/e. Then 2'=y'sin@ and y' = 100sec@ @ (all 
these are ID) lead to 


y’ = 200ne¢/(c cos? + 200n sin 9) 


As @ approaches 7/2, this speed approaches : 

Note: y’ still exceeds c for some negative angle. That is for 
Einstein to explain. See the 1985 College Math Journal, page 
186, and the 1960 Scientific American, “Things that go faster 
than ight.” 


19 If a plane follows the curve y =/(x), and its ground speed 
is dx/dt = 500 mph, how fast is the plane going up? How fast 
is the plane going? 


20 Why can’t we differentiate x = 7 and reach | = 0? 


Problems 21-29 are applications of related rates. 


21 (Calculus classic) The bottom of a 10-foot ladder is going 
away from the wall at dx/dt = 2 feet per second. How fast is 
the top going down the wall? Draw the right triangle to find 
dy/dt when the height y is (a) 6 feet (b) 5 feet (c) zero. 


22 The top of the 10-foot ladder can go faster than light. At 
what height y does dy/dt = —c? 


23 How fast does the level of a Coke go down if you drink 
a cubic inch a second? The cup is a cylinder of radius 
2 inches—first write down the volume. 


24 A jet flies at 8 miles up and 560 miles per hour. How fast 
is it approaching you when (a) it ts 16 mites from you; (b) its 


a 3 4.3 


Inverse Functions and Their Derivatives 


shadow is 8 miles from you (the sun ts overhead); (c) the plane 
is 8 miles from you (exactly above)? 


25 Starting from a 3-4-5 mght triangle, the short sides 
increase by 2 meters/second but the angle between them 
decreases by | radian/second. How fast does the area increase 
or decrease? 


26 A pass receiver is at x =4, y=8t. The ball thrown at 
t=3is at x =e(t — 3), y = 10e{¢ — 3). 
(a) Choose c so the ball meets the receiver. 


*(b) At that instant the distance D between them is chang- 
ing at what rate? 


27 A thief is 10 meters away (8 meters ahead of you, across 
a street 6 meters wide). The thief runs on that side at 7 meters/ 
second, you run at 9 meters/second. How fast are you 
approaching if (a) you follow on your side; (b) you run toward 
the thief; (c) you run away on your side? 


28 A spherical raindrop evaporates at a rate equal to twice 
its surface area. Find dr/dt. 


29 Starting fom P=V=S5 and maintaining PV = T, find 
dV/dt if dP/dt = 2 and dT/dt = 3. 


3 (a) The crankshaft AB turns twice a second so d@/dt = 


(6) Differentiate the cosine law 6? = 3? + x? ~ 2 (3x cos 0) 

to find the piston speed dx/dt when @ = 2/2 and 0 =n. 
31 A camera turns at C to follow a rocket at &. 

{a) Relate dz/dt to dy/dt when y = 10, 

(b) Relate d@/dt to dy/dt based on y = 10 tan @. 

(c) Relate d70/dt? to d?y/dt? and dy/dt. 


There is a remarkable special case of the chain rule. It occurs when f(y) and g(x) are 
“inverse functions.” That idea is expressed by a very short and powerful equation: 


S(g(x)} = x. Here is what that means. 


Inverse functions: 


Start with any input, say x = 5. Compute y = g(x), say y = 3. Then 


compute f(y), and the answer must be 5. What one function does, the inverse function 


4.3 Inverse Functione and Their Derlvatives 


undoes. If g(5)= 3 then f(3) = 5. The inverse function f takes the output y back to the 


input x. 


EXAMPLE 4 ¢(x) = x — 2 and f(y) = y+ 2 are inverse functions. Starting with x= 5, 
the function g subtracts 2. That produces y= 3. Then the function f adds 2. That 
brings back x = 5. To say it directly: The inverse of y= x—2 is x =y+ 2. 


EXAMPLE 2 y= 9(x) = (x — 32) and x=/f(y)=%y +32 are inverse functions (for 
temperature). Here x is degrees Fahrenheit and y is degrees Celsius. From x = 32 
(freezing in Fahrenheit) you find y = 0 (freezing in Celsius). The inverse function takes 
y = 0 back to x = 32. Figure 4.4 shows how x = 50°F matches y = 10°C. 


Notice that §(x — 32) subtracts 32 first. The inverse $y + 32 adds 32 fast. In the 
same way g multiplies last by 3 while f multiplies first by $. 


domain of f =range of g 
y20 


x= y? y=Vx 
f & 


x20 
range of f= domain of g¢ 


Fig. 4.4 °F to °C to °F. Always g7 (g(x) =x and g(g7"{y)) = y. If f=g 1 then g=f—!. 


The inverse function is written {= g~' and pronounced “‘g inverse.” It is not 1/g(x). 


If the demand y is a function of the price x, then the price is a function of the demand. 
Those are inverse functions. Their derivatives obey a fundamental rule: dy/dx times 
dx/dy equals 1. In Example 2, dy/dx is 5/9 and dx/dy is 9/5. 

There is another important point. When f and g are applied in the opposite order, 
they still come back to the start. First fadds 2, then g subtracts 2. The chain g(f{y)) = 
(y + 2)— 2 brings back y. If fis the inverse of g then g is the inverse of f. The relation 
is completely symmetric, and so is the definition: 


Inverse function. If y = g(x) then x= 2° ‘(y). If x= '(y) then y = g(x). 


The loop in the figure goes from x to y to x. The composition g~ '(g(x)) is the “identity 
function.” Instead of a new point z it returns to the original x. This will make the 
chain rule particularly easy—leading to (dy/dx)(dx/dy) = 1. 


EXAMPLE3 y= ag(x)= ./x and x = f(y) = y? are inverse functions. 


Starting from x =9 we find y= 3. The inverse gives 37 =9. The square of ./x is 
f(g(x)) = x. In the opposite direction, the square root of y* is g(f(y)) = y. 
Caution That example does not allow x to be negative. The domain of g—the set 
of numbers with square roots—is restricted to x 2 0. This matches the range of g~', 
The outputs y? are nonnegative. With domain of g = range of g~', the equation x = 
(,/x)? is possible and true. The nonnegative x goes into g and comes out of g™'. 
In this example y is also nonnegative. You might think we could square anything, 
but y must come back as the square root of y?. So y>0. 
To summarize: The domain of a function matches the range of its inverse. The inputs 


to g~! are the outputs from g. The inputs to g are the outputs from g'. 
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4 Derivatives by the Chain Rule 


Ef g(x} = y then solving that equation for x gives x = g~ '(y): 
if y=3x-—6 thenx=4(y+6) (this is g~*(y)) 


ify=x?+1 thenx=3/y—1 (this is g~*(y) 


In practice that is how g~* is computed: Solve g(x)= y. This is the reason inverses 
are important. Every time we solve an equation we are computing a value of g'. 
Not all equations have one solution. Not ail functions have inverses. For each y, 
the equation g(x}= y is only allowed to produce one x. That solution is x = g~ 1(y). 
If there is a second solution, then g~‘ will not be a function—because a function 


cannot produce two x’s from the same y. 


EXAMPLE 4 There is more than one solution to sin x=4. Many angles have the 
same sine. On the interval O< x <x, the inverse of y=sin x is not a function. 
Figure 4.5 shows how two x’s give the same y. 

Prevent x from passing 7/2 and the sine has an inverse. Write x = sin“ *y. 


The function g has no inverse if two points x, and x2 give g(x,)= g(x2). Its inverse 
would have to bring the same y back to x, and x,. No function can do that; g~ '(y) 
cannot equal both x, and x,. There must be only one x for each y. 


To be invertible over an interval, g must be steadily increasing or steadily decreasing. 


x=sin7!y 


x 2 1 wl % 


Aig. 4.5 Inverse exists (one x for each y}. No inverse function (two x’s for one y). 


THE DERIVATIVE OF g™' 


It is time for calculus. Forgive me for this very humble example. 


EXAMPLES (ordinary multiplication) The inverse of y = g{x) = 3x is x =fly) = 4). 


This shows with special clarity the rule for derivatives: The slopes dy/dx=3 and 
dx/dy = 4 muttiply to give 1. This rule holds for all inverse functions, even if their 
slopes are not constant. It is a crucial application of the chain rule to the derivative 


of fl g(x)) = x. 
4C (Derivative of inverse function) From f(g{x))=x the chain rule gives 
S'(ex)ie'(x) = 1, Writing y = g(x) and x = fy), this rule looks better: 


dx i (1) 


dy dyjdx’ 
The slope of x = g~ ‘{y} times the slope of y = g(x) equals one. 


This is the chain rule with a special feature. Since {({g(x))= x, the derivative of both 
sides is 1. If we know g’ we now know f’. That rule will be tested on a familiar 
example. In the next section it leads to totally new derivatives. 


4.3 Inverse Functions and Their Derivatives 


EXAMPLE 6 The inverse of y= x is x = y'/°, We can find dx/dy two ways: 


: _ ax 1 a, saad ee ere 
directly: ars y indirectly: ae 32 
The equation (dx/dy)(dy/dx) = 1 is not ordinary algebra, but it is true. Those deriva- 
tives are limits of fractions. The fractions are (Ax/Ay)(Ay/Ax) = 1 and we let Ax > 0. 


fig. 4.6 Graphs of inverse functions: x =4y is the mirror image of y = 3x. 


Before going to new functions, I want to draw graphs. Figure 4.6 shows y= J/x 
and y= 3x. What is special is that the same graphs also show the inverse functions. 
The inverse of y= ./x is x= y?. The pair x= 4, y=2 is the same for both. That is 
the whole point of inverse functions—if 2 = g(4) then 4=g~'(2). Notice that the 
graphs go steadily up. 

The only problem is, the graph of x = g™ '(y) is on its side. To change the slope 
from 3 to }, you would have to turn the figure. After that turn there is another 
problem—the axes don’t point to the nght and up. You also have to look in a mirror! 
(The typesetter refused to print the letters backward. He thinks it’s crazy but it’s not.) 
To keep the book in position, and the typesetter in position, we need a better idea. 

The graph of x = 4y comes from turning the picture across the 45° line. The y axis 
becomes horizontal and x goes upward. The point (2, 6) on the line y = 3x goes into 
the point (6,2) on the line x= yy. The eyes see a reflection across the 45° line 
(Figure 4.6c). The mathematics sees the same pairs x and y. The special properties of 
g and g ' allow us to know two functions—and draw two graphs—at the same 
time.t The graph of x = g~1(y) is the mirror image of the graph of y = g(x). 


EXPONENTIALS AND LOGARITHMS 


I would like to add two more examples of inverse functions, because they are so 
important. Both examples involve the exponential and the logarithm. One is made up 
of linear pieces that imitate 2*; it appeared in Chapter 1. The other is the true function 
2*, which is not yet defined—and it is not going to be defined here. The functions b* 
and log,y are so overwhelmingly important that they deserve and will get a whole 
chapter of the book (at least). But you have to see the graphs. 

The slopes in the linear model are powers of 2. So are the heights y at the start of 
each piece. The slopes 1, 2, 4, ... equal the heights 1, 2,4, ... at these special points. 

The inverse is a discrete model for the logarithm (to base 2). The logarithm of 1 is 
0, because 2° = 1. The logarithm of 2 is 1, because 2! = 2. The logarithm of 2/ is the 
exponent j. Thus the model gives the correct x =log,y at the breakpoints y= 
1,2, 4, 8, .... The slopes are 1, 3,4, 4, ... because dx/dy = 1/(dy/dx). 


tI have seen graphs with y= g(x) and also y=g~4{x). For me that is wrong: it has to be 
x=g '(y). If y=sin x then x =sin™*y. 
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The model is good, but the real thing is better. The figure on the right shows the 
true exponential y= 2. At x= 0, 1,2,... the heights y are the same as before. But 
now the height at x = 4 is the number 21/7, which is w The height at x = .10 is the 
tenth root 2'/!° = 1.07.... The slope at x = 0 is no longer 1—it is closer to Ay/Ax = 
.07/.10. The exact slope is a number ¢ (near .7) that we are not yet prepared to reveal. 

The special property of y= 2* is that the slope at all points is cy. The slope is 
proportional to the function, The exponential solves dy/dx = cy. 

Now look at the inverse function—the logarithm. Its graph is the mirror image: 


If y = 2* then x = logoy. If 2'/!° = 1.07 then log, 1.07 ~ 1/10. 


What the exponential does, the logarithm undoes—and vice versa. The logarithm of 
2* is the exponent x. Since the exponential starts with slope c, the logarithm must 
start with slope 1/c. Check that numerically, The logarithm of 1.07 is near 1/10. The 
slope is near .10/.07. The beautiful property is that dx/dy = I/cy. 


x= log, y 


—_+——_+ + 
4 


Aig. 4.7 Piecewise linear models and smooth curves: y = 2* and x = log, y. Base 6 = 2. 


I have to mention that calculus avoids logarithms to base 2. The reason lies in that 
mysterious number c. It is the “natural logarithm” of 2, which is .693147...—and 
who wants that? Also 1/.693147... enters the slope of log, y. Then (dx/dyXdy/dx)= 1. 
The right choice is to use “natural logarithms” throughout. In place of 2, they are 
based on the special number e: 


y=e is the inverse of x = In y. (2} 


The derivatives of those functions are sensational—they are saved for Chapter 6. 
Together with x” and sin x and cos x, they are the backbone of calculus. 


Note It is almost possible to go directly to Chapter 6. The inverse functions x = 
sin”'y and x =tan™'y can be done quickly. The reason for including integrals first 
(Chapter 5) is that they solve diflerential equations with no guesswork: 


dy dx 1 dy 
—_—= —_— = + = —_= = + ‘ 
7 ene Pra leads to [ax {3 or x=Iny+C 
integrals have applications of all kinds, spread through the rest of the book. But do 
not lose sight of 2* and e*. They solve dp/dx = cy—the key to applied caiculus. 


THE INVERSE OF A CHAIN A( gG0) 


The functions g(x) =x — 2 and A(y) = 3y were easy to invert. For g_' we added 2, 
and for k~ ' we divided by 3. Now the question is: If we create the composite function 
z= h(g(x)), or z = 3{x — 2), what is its inverse? 


4.3 inverse Functions and Their Derivatives 169 
Virtually all known functions are created in this way, from chains of simpler 


functions. The problem is to invert a chain using the inverse of each piece. The answer 
is one of the fundamental rules of mathematics: 


4D The inverse of z = A(g(x)) is a chain of inverses in the opposite order: 


x= g~ (ho 1(2)}. (3) 
h~* is applied first because h was applied last: g~ *(h~ '(h(e(x)))) = x. 


That fast equation looks like a mess, but it holds the key. In the middle you see 
h-! and h. That part of the chain does nothing! The inverse functions cancel, to leave 
g '(g(x)). But that is x. The whole chain collapses, when g~' and hk! are in the 
correct order—which is opposite to the order of A(g(x)}. 


EXAMPLE 7) 2 = h(g(x)) = 3(x — 2) and x=g"!{h7-3(z)) = 424 2. 


First h~' divides by 3. Then g"' adds 2. The inverse of hog is g”'ch™'. It can be 
found directly by solving z = 3(x — 2). A chain of inverses is like writing in prose—we 
do it without knowing it. 


EXAMPLES Invert z=./x—2 by writing z? =x — 2 and then x =z? + 2, 


The inverse adds 2 and takes the square—but not in that order. That would give 
(z + 2)°, which is wrong. The correct order is z? + 2. 

The domains and ranges are explained by Figure 4.8. We start with x22. 
Subtracting 2 gives y20. Taking the square root gives z 20. Taking the square 
brings back y 2 0. Adding 2 brings back x > 2—-which is in the original domain of g, 


T=H=Aleqy) veg dol) r=2v-[ ate) 


yee) 2240) yeh!) vag Oo yoy c=y-l yertl ve 1y 


Fig. 4.8 The chain g~ '(h7 '(h(g(x)))) = x is one-to-one at every step. 


EXAMPLE 9 Inverse matrices (4B)°'=B~'A™' (this linear algebra is optional). 


Suppose a vector x is multiplied by a squarc matrix B: y = g(x) = Bx. The inverse 
function multiplies by the inverse matrix: x = g~'(y)= Bo 'y. It is like multiplication 
by B=3 and B”! = 1/3, except that x and y are vectors. 

Now suppose a second function multiplies by another matrix A: z = hi{g(x)) = ABx. 
The problem is to recover x from z. The first step is to invert A, because that came 
last: Bx=A~‘z. Then the second step multiplies by B~' and brings back x= 
B™'A~'z, The product B ‘A ‘ inverts the product AB. The rule for matrix inverses 
is like the rule for function inverses— in fact it is a special case. 

I had better not wander too far from calculus. The next section introduces the 
inverses of the sine and cosine and tangent. and finds their derivatives. Remember 
that the ultimate source is the chain rulc. 
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4.3 EXERCISES 


Read-throngh questions 


The functions g(x) =x—4 and f{y)=y+4are_ 9 func- 
tions, because f{e(x)J=_ 6. Also e(f{y))=_¢_. The 
notation isf=eg-7andg=_ 4. The composition_e __ is 
the identity function. By definition x = g~ 1(y) if and only if 
y=__1__. When » is in the range of g, itisinthe  g of 
g *. Similarly x isin the oh of g when it is in the __| 
of g~‘. If g has an inverse then g(x,)__|] g(x.) at any two 
points. The function g must be steadily __k_ or steadily 

I. 


The chain rule applied to f(9(x)) = x gives (df/dy)(_m__)= 
n_. The slope of g~' times the slope of g equals __© 
More directly dx/dy=1/_ Pp. For y=2x+1 and x= 
4(y — 1), the slopes are dy/dx=_ 4 and dx/dy~_r_. 
For y=x?andx=__s_, the slopesaredy/dx=__t and 

dx/dy=_ 4. Substituting x? for y gives dx/dy=__v 


Then (dx/dy)(dy/dx}= _#__. 


The graph of y = g(x) is also the graph of x=__x__, but 
with x across and y up. For an ordinary graph of g~', take 
the reflection in the line ___y__. If (3, 8) is on the graph of g, 
then its mirror image (__2 _) is on the graph of g~'. Those 
particular points satisfy 8= 23 and 3=_ A 


The inverse of the chain z = A(g{x)) is the chainx=__B 
If g(x) = 3x and A(y}=y? thenz=_C¢__. Its inverse is x = 
D__, which is the composition of _E and _ F 


Solve equations t-10 for x, to find the inverse function x = 
g ‘(y). When more than one x gives the same jy, write 
“no inverse.” 


1 y=3x~-—6 2 y=Ax+B 
3 y=x’-1 4 y=xj/(x— 1} [solve xy — y =x] 
§ y=l4+x7! 6 y=|(2| 
Ty=x~t 8 y=2x4(x| 
9 y=sinx 10 y=x) [draw graph] 
1! Solving yoo gives xy—ay=1 orx= dg Now 
solve that equation for y. 
1 1 
12 Solving y= =~ gives xy~y=x-+1orx="). Draw 


the graph to see why fand f—! are the same. Compute. dy/dx 
and dx/dy. 


13 Suppose fis increasing and f{2) = 3 and f(3) = 5. What can 
you say about f ~ +(4)? 


14 Suppose f{2) = 3 and /(3) =5 and /(5)= 5. What can you 
say about f~?? 


t5 Suppose f{2}=3 and f(3)=5 and f(5)=0. How do you 
know that there is no function f~*? 

16 Vertical Ene test. If no vertical line touches its graph twice 
then f{x) is a function (one y for each x). Hortzontal ine 
test. If no horizontal line touches its graph twice then f(x) is 
invertible because 


17 If f(x) and g(x) are increasing, which two of these might 
not be increasing? 

Sx) + a(x) Six)gtx) F(a(x)) fo) 1/fix) 
18 If y=1/x then x=1/y. If y=1—x then x=1—y. The 
graphs are their own mirror images in the 45° line. Construct 
two more functions with this property f=f~? or fU/{x))} =x. 
19 For which numbers mt are these functions invertible? 

(a)y=mx+b (b)y=mx+x° (c) y= mx + sin x 

20 From its graph show that y = |x| + cx is invertible if ¢ > f 


and also if c < — 1. The inverse of a piecewise linear function 
is plecewise 


In 21-26 find dy/dx in terms of x and dx/dy in terms of y. 


2 y=x5 22 y=1/(x—-1) 
23 y=x?-1 24 y=1/x3 

x ax+b 
Pes ee a 


27 “Hf dy/dx = 1/y then dx/dy = and x= 


28 If dx/dy = l/y then dy/dx = (these functions are 
y=e and x=In y, soon to be honored properly). 


29 The slopes of f(x) =4x° and g(x} = — I/x are x? and 1/x?. 
Why isn’t f= g~ #2 What is g~?? Show that g(g7!)' = 1. 


30 At the points x;, x;, x3 a piecewise constant function 
jumps to y;, yz, y3. Draw its graph starting from y(0)=0. 
The mirror image its piecewise constant with jumps at the 
points to the heights . Why isn’t this the 
inverse function? 


In 31-38 draw the graph of y = 9(x). Separately draw its mirror 
image x = g '()). 


31 y=5x-10 32 y=cosx,O0<x<n 

33 y=1f(x+ 1) 34 y= {x|—2x 

3 y=1O 36 y=\/1-x*,0<x<1 
37 y=2™* 3 y=1//1—x7,0<x<1 


In 39-42 find dx/dy at the given point. 
39 y=sin x atx=2/6 40 y=tan x atx=2/4 


41 y=sin x? at x=3 42 y=x—sinx atx=0 


44 inverses of Trigonometric Functions A741 


43 If y is a decreasing function of x, then x is a 
function of y. Prove by graphs and by the chain rule. 


44 If f(x) > x for all x, show that f~'(y) < y. 
45 True or false, with example: 
(a) If f(x) is invertible so is h(x) = (lx). 


(b) If f(x) is invertible so is h(x) = fUf{x)). 
(c) f~ '(y) has a derivative at every y. 


In the ehains 46-51 write down o({x) and /(y) and their inverses. 
Then find x = g~'(f~*{z)). 


46 z= 5(x—4) aT z= (x7 
48 z=(6+x)> 49 7=64+x° 
50 z=44x+4)4+4 51 z=log(10") 


52 Solving /(x} =0 is a large part of applied mathematics. 
Express the solution x” in terms of f~ !: x* = ; 
§3 (a) Show by example that d?x/dy? is not 1/(d7y/dx?), 


(b) If y is in meters and x is in seconds, then d7y/dx? is in 
and d2x/dy? is in 


4.4 Inverses of Trigonometric Functions 


54 Newton’s method solves f(x*)=0 by applying a linear 
approximation to f~!: 

J~*0) =f" (y) + Gf /dy)(O — y). 
For y = f(x} this is Newton’s equation x* + x + 
55 If the demand is t/(p+ t)? when the price is p, then the 
demand is y when the price is . If the range of prices 
is p > 0, what is the range of demands? 
56 If dF/dx=f{x) show that the derivative of G(y)= 
yf Xy)— FU) is f-*00). 
57 For each number y find the maximum value of yx — 2x‘. 


This maximum is a function G(y). Verify that the derivatives 
of G{y} and 2x* are inverse functions. 


58 (for professors only) If G{y)} is the maximum value of 
yx — F(x), prove that F(x) is the maximum value of xy — G{y). 
Assume that /(x)=dF/dx is increasing, like 8x° in 
Problem 57. 


59 Suppose the richest x percent of people in the world have 
10./x percent of the wealth. Then y percent of the wealth is 
held by percent of the people. 


Mathematics is built on hasic functions like the sine, and on basic ideas like the 
inverse. Therefore if is totally natural to invert the sine function. The graph of x= 
sin” ‘y is a mirror image of y = sin x. This is a case where we pay close attention to 
the domains, since the sine goes up and down infinitely often. We only want one piece 


of that curve, in Figure 4.9. 


For the hold line the domain is restricted. The angle x lies between — 17/2 and + 7/2. 
On that interval the sine is increasing, so each y comes from exactly one angle x. If 
the whole sine curve is allowed, infinitely many angles would have sin x = 0. The sine 


sin! y + cos! y = + 


Ag. 4.9 Graphs of sin x and sin” +y. Their slopes are cos x and t/,/1— y?. 
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function could not have an inverse. By restricting to an interval where sin x is increas- 
ing, we make the function invertible. 
The inverse function brings y back to x. It is x =sin”'y (the inverse sine): 


x=sin ‘y when y= sin x and |x| < 2/2. (1) 


The inverse starts with a number y between —1 and 1. It produces an angle x = 
sin” 'y—the angle whose sine is y. The angle x is between — 7/2 and 2/2, with the 
required sine. Historically x was calied the “arc sine” of y, and arcsin is used in 
computing. The mathematical notation is sin~'. This has nothing to do with 1/sin x. 

The figure shows the 30° angle x = 7/6. Its sine is y=}. The inverse sine of } is 1/6. 
Again: The symbol sin™'(1) stands for the angle whose sine is 1 {this angle is x = 
7/2). We are seeing g~ 1(g{x}) = x: 


sin” ‘(sin x)= x for — 5 <x<t sin(sin-'y)= y for -l<y<l. 


EXAMPLE 1 (important) If sin x = y find a formula for cos x. 


Solution We are given the sine, we want the cosine. The key to this problem must 
be cos?x = | — sin?x. When the sine is y, the cosine is the square root of 1 — y?: 


cos x = cos(sin™1y)=./1—- y’. (2) 


This formula is crucial for computing derivatives. We use it immediately. 
THE DERIVATIVE OF THE INVERSE SINE 


The calculus problem is to find the slope of the inverse function f{y)= sin” 'y. 
The chain rule gives (slope of inverse function) = tslope of original function). 
Certainly the slope of sin x is cos x. To switch from x to y, use equation (2): 
=sin x gives dy =cos x so that Le ae (3} 
7 & dx dy cosx (/{—y? 


This derivative 1/./1— y* gives a new v-f pair that is extremely valuable in calculus: 
velocity u(t)= 1/./1— 2? ‘distance f(t)=sin™'t. 


Inverse functions will soon produce two more pairs, from the derivatives of tan” 'y 
and sec” 'y. The table at the end lists all the essential facts. 


EXAMPLE 2 The slope of sin™'y at y=1 is infinite: 1/./1— y? = 1/0. Expiain. 
At y= 1 the graph of y =sin x is horizontal. The slope is zero. So its mirror image 
is vertical. The slope 1/0 is an extreme case of the chain rule. 


Question What is d/dx (sin-'x)? Answer 1/./f—x?. I just changed letters. 
THE INVERSE COSINE AND ITS DERIVATIVE 


Whatever is done for the sine can be done for the cosine. But the domain and range 
have to be watched. The graph cannot be allowed to go up and down. Each y from 
—1 to 1 should be the cosine of only one angle x. That puts x between 0 and x. Then 
the cosine is steadily decreasing and y = cos x has an inverse: 


cos (cos x)= x and cos(cos~!y) = y. (4) 


(0, 1) 


4.4 Inverses of Trigonometric Functions 


The cosine of the angle x = 0 is the number y= 1. The inverse cosine of y = 1 is the 
angle x = 0. Those both express the same fact, that cos 0 = 1. 

For the slope of cos~'y, we could copy the calculation that succeeded for sin“ 'y. 
The chain rule could be applied as in (3). But there is a faster way, because of a 
special relation between cosy and sin™'y. Those angles always add to a right angle: 


cos~'y+sin~'y=2/2. (5) 


Figure 4.9c shows the angles and Figure 4.10c shows the graphs. The sum is 7/2 (the 
dotted line), and its derivative is zero. So the derivatives of cos~'y and sin™'y must 
add to zero. Those derivatives have opposite sign. There is a minus for the inverse 
cosine, and its graph goes downward: 


The derivative of x=cos 'y is dx/dy=—1/,./1—y?. (6) 


(10/2, 0) 


(—1,—1/2) 


Fig. 4.10 The graphs of y=cos x and x =cos~'y. Notice the domain 0 <x <2. 


Question How can two functions x=sin~'y and x= —cos~'y have the same 
derivative? 
Answer sin ‘'y must be the same as —cos~'y + C. Equation (5) gives C = 2/2. 


THE INVERSE TANGENT AND ITS DERIVATIVE 


The tangent is sin x/cos x. The inverse tangent is not sin” 'y/cos~'y. The inverse 
function produces the angle whose tangent is y. Figure 4.11 shows that angle, which 
is between — 72/2 and 7/2. The tangent can be any number, but the inverse tangent 
is in the open interval — 1/2 <x < 1/2. (The interval is ‘““open”’ because its endpoints 
are not included.) The tangents of 2/2 and —7/2 are not defined. 

The slope of y = tan x is dy/dx = sec?x. What is the slope of x = tan” 'y? 


d | 1 | 
By the chain rule = =—- = ——_,_ = —_. (7) 


df 


4E The derivative of f(y) =tan~ 'y is as = 
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4 Derivatives by the Chain Rule 


-3 -2 -1 (00) Ts 2 
Aig. 4.44 x=tan~'y has slope 1/(1 + y’). x=sec™ 'y has slope 1/|p|./y? — 1. 


EXAMPLE 3 The tangent of x = 7/4 is y = 1. We check slopes. On the inverse tangent 
curve, dx/dy = 1/(1 + y?)= 4. On the tangent curve, dy/dx = sec?x. At 2/4 the secant 
squared equals 2. The slopes dx/dy = 4% and dy/dx = 2 muitiply to give 1. 


Important Soon will come the following question. What function has the derivative 
1/(1 + x”)? One reason for reading this section is to learn the answer. The function 
is in equation (8)—if we change letters. It is f(x) = tan” ‘x that has slope 1/(1 + x”). 


Csex 


i secx : 
sinx lanx 


cos x | cot x 


Ag. 4.42 cos?x+sin?x=1 and L+tan?x=sec?x and 1 +cot?x =csc*x. 


INVERSE COTANGENT, INVERSE SECANT, INVERSE COSECANT 


There is no way we can avoid completing this miserable list! But it can be painiess. 
The idea is to use 1/(dy/dx) for y= cot x and y=sec x and y= csc x: 
- 1 -1 
epee an Ba and ee (9) 
dy csc*x dy secxtanx dy cscxcotx 
In the middle equation, replace sec x by y and tan x by + ./y? — 1. Choose the sign 
for positive slope (compare Figure 4.11). That gives the middle equation in (10): 


The derivatives of cot” 'y and sec” *y and csc” ‘y are 
—1 


y=—-— SL feaety=$—__. (10) 
h/yy-1 dy bi/y?- 1 


Note about the inverse secant When y is negative there is a choice for x = sec” ‘y. 
We selected the angle in the second quadrant (between 1/2 and 2). Its cosine is 
negative, so its secant is negative. This choice makes scc”1y=cos~+(1/y), which 
matches sec x = 1/cos x. It also makes sec” 'y an increasing function, where cos” ‘y 
is a decreasing function. So we needed the absoiute value |p| in the derivative. 


= -1 


d gt Se, EN d 
Fria Try ies 


44 Inverses of Trigonometric Functions 


Some mathematical tables make a different choice. The angle x could be in the 
third quadrant (between —x and — 2/2). Then the slope omits the absolute value. 
Summary For the six inverse functions it is only necessary to learn three derivatives. 
The other three just have minus signs, as we saw for sin~*y and cos~*y. Each inverse 
function and its “cofunction” add to 2/2, so their derivatives add to zero. Here are 
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the six functions for quick reference, with the three new derivatives. 


function f(y) inputs y 
sin-‘y, cos” 1y ly <1 
tan~+y, cot” 'y all y 


outputs x slope dx/dy 


mR 1 
|- > 4] [0, x] + ag 


qr 1 
(- oy 3) (0, m) + j +y2 


* 
sectyeety let 0a7%]- 5,5] 


1 
+ 
bh/y?- 1 


If y=cos x or y=sin x then |y| < 1. For y=sec x and y = csc x the opposite is true; 
we must have |y| 2 1. The graph of sec” y misses all the points -1<y< 1. 

Also, that graph misses x = x/2—-where the cosine is zero. The secant of 2/2 would 
be 1/0 (impossible). Similarly csc~ 1y misses x = 0, because y = csc O cannot be 1/sin 0. 
The asterisks in the table are to remove those points x = 2/2 and x= 0. 

The column of derivatives is what we need and use in calculus. 


4.4 EXERCISES 


Read-through questions 


The relation x=sin~'y means that 9 is the sine of 
6. Thus x is the angle whose sine is_¢ _. The number 
y lies between ds and __@__. The angie x lies between 
ft and _g_. {If we want the inverse to exist, there 
cannot be two angles with the same sine.} The cosine of the 
angle sin™'y is h__. The derivative of x=sin™'y is 
dxjdy=__J_. 


The relation x =cos~'y means that yequals_ | _. Again 
the number y lies between __ kk and __!__., This time the 
angle x lies between om and ____ (so that each y comes 
from only one angle x). The sum sin~'y+cos"‘y=_ 0. 
(The angles are called__p__, and they addtoa__q_angle.} 


Therefore the derivative of x=cos~'y is dx/dy=__t__, the 
same as for sin™'yexeept fora 8 ___ sign. 


The relation x=tan~'y means that y=_t_. The 
number y lies between _u and _v¥__. The angle x lies 
between _w_and__x__. The derivative is dx/dy=__¥ 


Since tan™'y+cot™'y=__2 __, the derivative of cot~'y is 
the same except fora _A__ sign. 


The relation x =sec”'y means that __8 . The number y 
never lies between C and_ D__. The angle x lies between 
E and Ff _, but neveratx=_ &_. The derivative of 


x=sec”'yis dx/dy=_H__ 


In 1-4, find the angles sin~'y and cos”'y and tan™'y in 


radians. 
ly=0 2y=-I Zy=1 4y= 3 
5 We know that sinz =0, Why isn’t 7 =sin~ ‘0? 
6 Suppose sin x = y. Under what restriction is x =sin™ ‘y? 


7 Sketch the graph of x =sin™'y and locate the points with 
slope dx/dy = 2. 


8 Find dx/dy if x =sin~' 4y. Draw the graph. 


9 If y=cos x find a formula for sin x. First draw a right 
triangle with angle x and near side y—what are the other 
two sides? 


10 If y=sin x find a formula for tan x. First draw a right 
triangle with angle x and far side y—what are the other sides? 


11 Take the x derivative of sin~ ‘(sin x) = x by the chain rule. 
Check that d(sin™'y)/dy = 1/,/1 — y? gives a correct result. 


12 Take the y derivative of cos(cos—!y) = y by the chain rule. 
Check that d(cos~'y)/dy = —1/,/1 — y? gives a correct result. 


13 At y=Oand y =1, find the slope dx/dy of x = sin='y and 
x=cos~'y and x=tan~‘y. 


14 Atx=Oand x = 1, find the slope dx/dy of x =sin~'y and 
x=cos”'y and x =tan7'y. 
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15 True or false, with reason: 
(a) (sin™ 'y? + (cos”'yP = 1 
{b) sin” 'y = cos” 'y has no solution 
(c) sin” 'y is an increasing function 
(d) sin™'y is an odd function 
(e} sin”! y and —cos”'y have the same slope—so they are 
the same. 


(f) sin{cos x) = cos{sin x} 
16 Find tan(cos” (sin x)} by drawing a triangle with sides 
sin x, cos x, f. 
Compute the derivatives in 17-28 (using the letters as given), 
(7 w=sin7'x 18 u=tan~ ‘2x 
20 z=sin™ ‘(cos x) 
22 z=(sin™‘x)"! 


24 z={1+x7)tan~'x 


19 z=sin” ‘(sin 3x) 
21 2 =(sin~)xy 

23 z= /i—y? sin” 'y 
25 x=sec” {y+ 1) 


27 u=sin'y/cos"' /t — y? 


28 u=sin™'y+cos"'y+tan™'y 


26 u=sec™ (sec x?) 


29 Draw a right triangle to show why tan” *y+cot™'y=n/2, 
3 Draw a right triangle to show why tan” 'y =cot™ ‘{1/y). 
31 If y=tan x find sec x in terms of y. 

32 Draw the graphs of y=cot x and x =cot™ 'y. 


33 Find the slope dx/dy of x =tan~'y at 
{a} y= —3 (b} x =0 {c) x= — 2/4 


34 Find a function u(t} whose slope satisfies u’ + t?u’ = 1. 
35 What is the second derivative d*x/dy? of x = sin” ‘y? 
3% What is d7u/dy? for u=tan™'y? 


Find the derivatives in 37-44. 
37 y=see 5x 38 x =sec™ !2y 
39 u=sec™ '(x") 

41 tan y=(x-1)/(x + 1) 


43 y=sec™' /x? 44 


45 Differentiate cos” 1(t/y) to find the slope of sec” 'y in a 
new way. 


40 u=sec™ '(tan x) 
42 z=(sin x)(sin~ ‘x) 


44 z=sin(cos” 'x}—cos(sin”'x} 


46 The domain and range of x =esc~'y are : 
47 Find a function u(y) such that du/dy = 4//1 — y?. 
48 Solve the differential equation dujdx = 1/(t + 4x”). 
49 If dujdx = 2/./1 — x? find u(1)— u{0). 


50 (recommended) With u(x) =(x — 1)/(x + 1), find the deriv- 
ative of tan” ‘ix). This is also the derivative of . So 
the difference between the two functions is a 


§1 Find u(x} and tan’ ‘u(x) and tan” 'x at x =O and x = 0. 
Conclusion based on Problem 50: tan” ‘u{x) —tan™?x equals 
the number 


§2 Find wx) and tan~'u(x) and tan™'x as x = — oo. Now 
tan ‘w(x)— tan” 'x equals . Something has hap- 
pened to tan” ‘u{x), At what x do u(x} and tan” 'u(x) change 


instantly? 
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CHAPTER 5 


Integrals 


Eee = 5.1 The idea of the Integral 14 


This chapter is about the idea of integration, and also about the technique of integ- 
ration. We explain how it is done in principle, and then how it is done in practice. 
Integration is a problem of adding up infinitely many things, each of which is infini- 
tesimally small. Doing the addition is not recommended. The whole point of calculus 
is to offer a better way. 

The probiem of integration is to find a limit of sums. The key is to work backward 
from a lmit of diflerences (which is the derivative). We can integrate v(x) if it turns 
up as the derivative of another function f(x). The integral of v = cos x is f= sin x. The 
integral of e =x is f= 4x7. Basically, f(x) is an ‘‘antiderivative’, The list of f*s will 
grow much longer (Section 5.4 is crucial). A selection is inside the cover of this book. 
If we don't find a suitable f(x), numerical integration can still give an excellent answer. 

I could go directly to the formulas for integrals, which allow you to compute areas 
under the most amazing curves. (Area is the clearest example of adding up inknitely 
many infinitely thin rectangles, so it always comes first. It is certainly not the only 
problem that integral calculus can solve.) But Lam really unwilling just to write down 
formulas, and skip over all the ideas. Newton and Leibniz had an absolutely brilliant 
intuition, and there is no reason why we can't share it. 

They started with something simple. We will do the same. 


SUMS AND DIFFERENCES 


Integrals and derivatives can be mostly explained by working (very briefly) with sums 
and differences. Instead of functions, we have n ordinary numbers. The key idea is 
nothing morc than a basic fact of algebra. In the limit as n — ac, it becomes the basic 
fact of calculus. The step of “going to the limit” is the essential difference between 
algebra and calculus! It has to be taken, in order to add up infinitely many 
infinitesimals— but we start out this side of it. 
To see what happens before the limiting step, we need twa sets of n numbers. The 
first set will be ¢,,¢2,....0,. where t suggests velocity. The second set of numbers 
will be f,.f3,....f,. where f recalls the idea of distance. You might think d would 
be a better symbol for distance, but that is needed for the dx and dy of calculus. ‘rand 
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5 Integrale 


A first example has n= 4: 
04, 02, 03, vg = 1, 2 3,4 Sis fas Safe = 1, 3, 6, 10. 


The relation between the v’s and f's is seen in that example. When you are given 
1, 3, 6, 10, how do you produce 1, 2, 3, 4? By taking differences. The difference 
between 10 and 6 is 4. Subtracting 6 — 3 is 3. The difference f, —f, = 3-1 is v2 =2. 
Each pv is the diflerence between two f’s: 


u; is the difference f;— f,-,. 


This is the discrete form of the derivative. I admit to a small difficulty at j = 1, from 
the fact that there is no fo. The first v should be f; — fo, and the natural idea is to 
agree that fo is zero. This need for a starting point will come back to haunt us (or 
help us) in calculus. 

Now look again at those same numbers—but start with v. From v = 1, 2, 3, 4 how 
do you produce f= |, 3, 6, 10? Ry taking sums. The first two v’s add to 3, which is f. 
The first three v’s add to f, = 6. The sum of all four v’s is 1 +2+3+4= 10. Taking 
sums is the opposite of taking differences. 

That idea from algebra is the key to calculus. The sum f; involves ali the numbers 
v, +9, ++ +0;. The diflerence v; involves only the two numbers f;— f;_,. The fact 
that one reverses the other is the “Fundamental Theorem.” Calculus will change sums 
to integrals and differences to derivatives—but why not let the key idea come through 
now? 


SA Fundamental Theorem of Catcutus (before limits): 


If each v,=f,—f,-,, them v, +0, + +0, =f, fo. 


The differences of the f’s add up to f, —fy. All f’s in between are canceled, leaving 
only the last f, and the starting fg. The sum “telescopes”: 


vy tog touyt- toy, =(f, -fot+(h —fit Us f+ + - Sn) 


The number f, is canceled by —/f,. Similarly —f, cancels f, and —f; cancels fy. 
Eventually f, and —fo are left. When fQ is zero, the sum is the final f,. 
That completes the algebra. We add the v’s by finding the f’s. 


Question How do you add the odd numbers 1+ 3+ 5+ -- + 99 (the v's)? 
Answer They are the differences between 0, 1, 4,9, .... These f’s are squares. By the 
Fundamental Theorem, the sum of 50 odd numbers is (50). 


The tricky part is to discover the right f’s! Their differences must produce the v’s. 
In calculus, the tricky part is to find the right f(x). Its derivative must produce v(x). 
It is remarkable how often f can be found—more often for integrals than for sums. 
Our next step is to understand how the integral is a limit of sums. 


SUMS APPROACH INTEGRALS 


Suppose you start a successful company. The rate of income is increasing. After 
x years, the income per year is ,/x million dollars. In the first four years you reach 
JI P 2, WE? and /4 million dollars. Those numbers are displayed in a bar graph 
(Figure 5.1a, for investors). I realize that most start-up companies make losses, but 
your company is an exception. If the example is too good to be true, please keep 
reading. 


5.1 The idea of the Integral 


Income 
per year 


3 4 
Fig. §.4 Total income = total area of rectangles = 6.15. 


The graph shows four rectangles, of heights V1, Ja; J/3, /4. Since the base of 
each rectangle is one year, those numbers are also the areas of the rectangles, One 
investor, possibly weak in arithmetic, asks a simple question: What is the total income 
for all four years? There are two ways to answer, and I will give both, 

The first answer is \/1+./2+./3+./4. Addition gives 6.15 million dollars. 
Figure 5.1b shows this total—which is reached at year 4. This is exactly like velocities 
and distances, but now v is the mscome per year and fis the total income. Algebraically, 
fy is still oy + + 0;. 

The second answer comes from geometry. The total income is the total area of the 
rectangles. We are emphasizing the correspondence between addition and area. That 
point may scem obvious, but it becomes important when a second investor (smarter 
than the first) asks a harder question. 


Here is the problem. The incomes as stated are false. The company did not make 
a million dollars the first year. Alter three months, when x was 1/4, the rate of income 
was only Jx = 1/2. The bar graph showed Jt = | for the whole year, but that was 
an overstatement. The income in three months was not more than 1/2 times 1/4, the 
rate multiplied by the time. 

All other quarters and years were also overstated. Figure 5.2a is closer to reality, 
with 4 years divided into 16 quarters. It gives a new estimate for total income. 

Again there are two ways to find the total. We add J/1/4 + J+ e+ /16/4, 
remembering to multiply them ali by 1/4 (because each rate applies to 1/4 year). 
This is also the area of the 16 rectangles. The area approach is better because the 1/4 
is automatic. Each rectangle has base 1/4, so that factor enters each area. The total 
area is now 5.56 million dollars, closer to the truth. 

You see what is coming. The next step divides time into weeks. Alter one week the 
rate ./x is only ./1/52. That is the height of the first rectangle—its base is Ax = 
1/52. There is a rectangle for every week. Then a hard-working investor divides time 
into days, and the base of each rectangle is Ax = 1/365. At that point there are 
4 x 365 = 1460 rectangles, or 1461 because of leap year, with a total area below 54 
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million dollars. The calculation is elementary but depressing—adding up thousands 
of square roots, each multiplied by Ax from the base. There has to be a better way. 


The better way, in fact the best way, is calculus. The whole idea is to allow for 
continuous change. The geometry problem is to find the area under the square root 
curve, That question cannot be answered by arithmetic, because it involves a fimit. 
The rectangles have base Ax and heights (AX, /2Ax, Apsa Ae There are 4/Ax 
rectangles—more and more terms from thinner and thinner rectangles. The area is 


the limit of the sum as Ax - 0. 


This limiting area is the ‘integral.’ We are looking for a number below 5}. 


Algebra (area of n rectangles): Compute e; +: +, by finding f's. 
Key idea: If tc; =f, —f;-., then the sum is f, — fo. 

Calculus (area under curve): Compute the limit of Ax[v{Ax) + v(2Ax) + --], 
Key idea: If (x) = dfidx then area = integral to he explained next. 


5.4 EXERCISES 


Read-through questions 


The problem of summation ts 1o add uv, ~ - + &,. {tis solved 
if we find f°s such that v,=__o _. Then e, + -- ~ ¢, equals 
b_. ~The cancellation in (f; -—fo)+(fo-ff)t0+ 


(f,—f,-1) leaves only — ¢ . Taking sums is the _ dof 


taking differences. 


The differences between 0,1,4.9 are ty.t.,¢;=__e _ 
For f; _ j? the difference between f,, and f, is r;y9=__t 


From this pattern |] + 3+ 354-7" + 19 equals _g . 


For functions, finding the integral is the reverse of __h 
If the derivative of f(x) is x(x}, then the _ i of ax) is f{x). 
If cx) = 10x then f{x)=__] . Thisisthe__k ofa triangle 
with base x and height (0x. 

Integrals begin with sums. The trtangle under c= 10x out 
tox=4hasarea_ |. I is approximated by four rectangles 
of heights 10, 20, 30, 40 and area m_. It is better approxi- 
mated by eight rectangles of heights nn and area __ 6 
For # rectangles covering the triangle the area is the sum of 

p_. As a-~ « this sum should approach the number 


q__. That is the integral of v= 10x from 0 to 4. 
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Problems 1-6 are about sums /, and differences «;. 


1 With v= 1, 2, 4, 8, the formula for p; is 
Find f,,/5./3,.f4 starting from fo =0. What is f5? 


(not 24). 


2 The saine c=1,2,4.8.... are the differences between 
f= 1.2.4.8, 16... Now fo=1 and f,= 24. (a) Check that 
25 — 2+ equals vs. (b) What is 1+2+4+8+ 16? 


3 The differences between f= 1, 1/2, 1/4, 1/8 are v= 
~ 1/2, —1/4, —1/8. These negative v's do not add up to these 
positive f's. Verify that ry +e. +63 =f4 —fo is still true. 


4 Any constant C can be added to the antiderivative fix) 
because the of a constant is zero. Any C can be 
added to fo.f,... because the between the f’s is 
not changed. 


5 Show that f,=r/i(r — has f;-f;-, =r’ |. Therefore the 
geometric scries [+r+--+r/7' adds up to 
(remember to subtract fo). 


6 The sums f;=(r/—1t)i(r—1) also have f,—-fj_y =r. 
Now fy =_____. Therefore 1 +r +--+ 1/7! adds up to 
f; The sum 1+r+-" +r" equals 


7 Suppose e(x) = 3 for x < 1 and e(x) = 7 for x > L. Find the 
area f(x) from 0 to x, under the graph of ¢(x). (Two pieces.) 


8 fe=1, -2,3, —4,.... write down the f’s starting from 
fo =0. Find formulas for xv; and f; when j ts odd and j is even, 


Problems 9—16 are about the company earning Vix per year. 


9 When time ts divided into weeks there are 4 x 52 = 208 
rectangles. Write down the first area, the 208th area, and the 
jth area, 


10 How do you know that the sum over 208 weeks is smaller 
than the sum over 16 quarters? 


11 A pessimist would use Vx at the beginning of cach time 
period as the income rate for that period. Redraw Figure 5.1 
(both parts) using heights \0, 1. \2, .3. How much lower 
is the estimate of total income? 


12 The same pessimist would redraw Figure 5.2 with heights 
0, 4/14, .... What is the height of the last rectangle? How 
much does this change reduce the total rectangular arca 5.56? 


13 At every step from years to weeks to days to hours, the 
pessimist’s area goes and the optimist’s area goes 
. The difference between them ts the area of the last 


14 The optimist and pessimist arrive at the same limit as 
years are divided into weeks, days, hours, seconds. Draw the 
x curve between the rectangles to show why the pessimist 
is always too low and the optimist is too high. 


15 (Important) Let f(x) be the area under the /x curve, above 
the interval from 0 to x. The area to x + Ax is f(x + Ax), The 
extra area is Af= , This is almost a rectangle with 
base and height Rf So Af/Ax is close to 

As Ax +0 we suspect that dfidx = 


16 Draw the we curve from x =Q to 4 and put triangles 
below to prove that the area under it is more than 5. Look 
left and right from the point where Jt =, 


Problems 17-22 are about a company whose expense rate 
u(x) = 6 — x is decreasing. 


. The total 
. This is the area 


17 The expenses drop to zero at x= 
expense during those years equals 
of 


18 The rectangles of heights 6, 5,4, 3,2,1 give a total 
estimated expense of . Draw them enclosing the 
triangle to show why this total is too high. 


19 How many rectangles (enclosing the triangle} would you 
need before their areas are within 1 of the correct triangular 
area? 


20 The accountant uses 2-year intervals and computes e = 
5,3, 1 at the midpoints (the odd-numbered years). What is 
her estimate, how accurate Is it, and why? 


21 What is the area f(x) under the line v{x) = 6 — x above the 
interval from 2 to x? What is the derivative of this f(x)? 


22 What is the area f(x) under the line w(x) = 6 — x above the 
interval fron: x to 6? What is the derivative of this f(x)? 


23 With Ax = 1:3, find the area of the three rectangles that 
enclose the graph of (x) = x?. 


24 Draw graphs of bax and v= x? from 0 to I. Which 
areas add to !? The same is true for ¢ = x? and c = 


25 From x to x+Ax, the area under c= x? is Af This 
is almost a rectangle with base Ax and height . So 
Af‘Ax is close to . In the limit we find dfjdx = x* 
and f(x) = 


26 Compute the area of 208 rectangles under i{x} = x from 
x=Otox=4. 


182 3 Integrals 
Re §8=865.2 Antiderivaive: Ea 


The symbol J was invented by Leibniz to represent the integral. It is a stretched-out 
S, from the Latin word for sum. This symbol is a powerful reminder of the whole 
construction: Sum approaches integral, § approaches |, and rectangular area 
approaches curved area: 
curved area = { v(x) dx =f ./x dx. (1) 
The rectangles of base Ax lead to this limit—the integral of /x. The “dx” indicates 
that Ax approaches zero. The heights v; of the rectangles are the heights u(x) of the 
curve. The sum of », times Ax approaches “the integral of v of x dx.” You can imagine 
an infinitely thin rectangle above every point, instead of ordinary rectangles above 
special points. 
We now find the area under the square root curve. The “dimits of integration” are 
0 and 4, The lower limit is x = 0, where the area begins. (The start could be any point 
x =a.) The upper limit is x = 4, since we stop after four years. (The finish could be 
any point x = b.) The area of the rectangles is a sum of base Ax times heights ./x. 
The curved area is the limit of this sum. That limit is the integral of /x from 0 to 4: 


x=a 
im, Ie /Ax)(Ax) + (./2Ax)(Ax) + + (/4(Ax) | = | /x dx. (2) 
x x=0 
The outstanding problem of integral calculus is still to be solved. What is this limiting 
area? We have a symbol for the answer, involving { and \/x and dx—but we don’t 
have a number. 


THE ANTIDERIVATIVE 


I wish I knew who discovered the area under the graph of Jx. It may have been 
Newton. The answer was available earlier, but the key idea was shared by Newton 
and Leibniz. They understood the parallels between sums and integrals, and between 
differences and derivatives. I can give the answer, by following that analogy. I can’t 
give the proof (yet)—it is the Fundamental Theorem of Calculus. 

In algebra the difference f; ~ f;_, is v;. When we add, the sum of the v’s is f, — fo- 
dn caicutus the derivative of f(x) is u(x). When we integrate, the area under the v(x) 
curve is f(x) minus f(0). Our problem asks for the area out to x = 4; 


5B (Discrete vs. continuous, rectangles vs. curved areas, addition vs. 
integration) The integral of v(x) is the difference in f(x): 


If dfjdx = \/x then area = (77% \/x dx = f4)— (0). (3) 


What is f(x)? Instead of the derivative of /x, we need its “antiderivative.” We have 
to find a function f(x) whose derivative is a5 x It is the opposite of Chapters 2-4, and 
requires us to work backwards. The derivative of x" is nx"~!1—now we need the 
antiderivative. The quick formula is f(x) = x"*1/(n + 1)—we aim to understand it. 


Solution Since the derivative lowers the exponent, the antiderivative raises it. We 
go from x?/? to x?/?. But then the derivative is (3/2)x!/?. It contains an unwanted 
factor 3/2. To cancel that factor, put 2/3 into the antiderivative: 


S(x) = 4x3? has the required derivative (x)= x!!? = ./x. 
q 


5.2 Antiderivatives 
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fig. 5.3 The integral of xx) = Jx is the exact area 16/3 under the curve. 


There you see the key to integrals: Work backward from derivatives (and adjust). 
Now comes a number—the exact area. At x = 4 we find x? = 8. Multiply by 2/3 
to get 16/3. Then subtract /(0) = 0: 
uae cht RRS FF Cy i ee 
i Vx dx= 34 ts 300°” = 38) = 


=0 


16 
~— 4 
; 4 
The total income over four years is 16/3 = 5§ million dollars. This is f{4) — (0). The 
sum from thousands of rectangles was slowly approaching this exact area 5}. 

Other areas The income in the first year, at x = 1, is 3(1)*/? = 4 million dollars. 
(The false income was | million dollars.) The total income after x years is $x°/, 
which is the antiderivative f(x). The square root curve covers 2/3 of the overall rectangle 
it sits in. The rectangle goes out to x and up to ,/x, with area x*/?, and 2/3 of that 
rectangle is below the curve, (1/3 is above.) 


Other antiderivatives The derivative of x* is 5x*. Therefore the antiderivative of x* 
is x°/5. Divide by 5 (or n + 1) to cancel the 5 (or n + £) from the derivative. And don’t 
allown+1=0: 


The derivative (x) =x" has the antiderivative f(x) = x"*'/(n + 1). 


EXAMPLE 4 The antiderivative of x? is 4x*. This is the area under the parabola 
wx) = x”. The area out to x = 1 is $(1)° — $(0), or 1/3. 


Remark on ./x and x? The 2/3 from /x and the 1/3 from x? add to 1. Those are 
the areas below and above the ./x curve, in the corner of Figure 5.3. If you turn the 
curve by 90°, it becomes the parabola. The functions y = af and x = y” are inverses! 
The areas for these inverse functions add to a square of area I. 


AREA UNDER A STRAIGHT LINE 


You already know the area of a triangle. The region is below the diagonal line v = x 
in Figure 5.4. The base is 4, the height is 4, and the area is 3(4)(4) = 8. Integration is 
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Fig. 5.4 Triangular area 8 as the limit of rectangular areas 10, 9, 84, .... 


not required But if you allow calculus to repeat that answer, and build up the integral 
fix) = 3x? as the limiting area of many rectangies, you will have the beginning of 
nehing important. 

The four rectangles have area 1+ 2+3+4=10. That is greater than 8, because 
the triangle is inside. 10 is a first approximation to the triangular area 8, and to 
improve it we need more rectangles. 

The next rectangles will be thinner. of width Ax = 1/2 instead of the original 
Ax = 1. There will be eight rectangles instead of four. They extend above the line, 
so the answer is still too high. The new heights are !/2, 1, 3/2, 2, 5/2, 3, 7/2, 4. The 
total area in Figure 5.4b is the sum of the base Ax = 1/2 times those heights: 


area=3(¢+1+43+2+-- + 4)=9 (which is closer to 8). 


Question What is the area of 16 reclangles; Their viene are }. 5... 4. 
Answer With base Ax = 4 the area is #4 +44+-°°+4)= 


The effort of doing the addition is increasing. A formula for the sums is needed, and 
will be established soon. (The next answer would be 84.) But more important than 
the formula is the idea. We are carrying out a limiting process, one step at a time. The 
area of the rectangles is approaching the area of the triangle. as Ax decreases. The 
same limiting process will apply to other areas, in which the region is much more 
complicated. Therefore we pause to comment on what is important. 


Area Under a Curve 

What requirements are imposed on those thinner and thinner ea iste It is not 
essential that they all have the same width. And it is not required that they cover the 
triangle completely. The rectangles could lic below the curve. The limiting answer 
will still be 8 even if the widths Ax are unequal and the rectangles fit inside the 
triangle or across it. We only impose two rules: 

i. The largest width Ax,,,, Must approach zcro. 

2. The top of each rectangle must touch or cross the curve. 


The area under the graph is defined to be the limit of these rectangular areas, if that 
limit exists. For the straight line, the limit does exist and equals 8. That Jimit is 
independent of the particular widths and heights—as we absolutely insist it should 
be. 

Section 5.5 allows any continuous u(x). The question wili be the same— Does the 
limit exist? The answer will be the same— Yes. That limit will be the integral of v(x). 
and it will be the area under the curve, It will be f(x). 


5.2 Antiderivatives 


EXAMPLE 2 The triangular area from 0 to x is }(base)(height) = $(x)(x). That is 
Six) = 4x7. Its derivative is o(x) =x. But notice that 4x? + 1 has the same derivative. 
So does f= 4x? + C, for any constant C. There is a “constant of integration” in f(x), 
which is wiped out in its derivative v(x). 


EXAMPLE 3 Suppose the velocity is decreasing: o(x) = 4— x. If we sample v at x= 
1, 2, 3, 4, the rectangles lie under the graph. Because v is decreasing, the right end of 
each interval gives v,j;,. Then the rectangular area 3+ 2+ 1+0=6 is less than the 
exact area 8. The rectangles are inside the triangle, and eight rectangles with hase 4 
come closer: 

rectangular area = $(35 + 3+ °° +4+0)=7. 


Sixteen rectangles would have area 74, We repeat that the rectangles need not have 
the same widths Ax, but it makes these calculations easier. 

What is the area out to an arbitrary point (like x =3 or x= 1)? We could insert 
rectangles, but the Fundamental Theorem offers a faster way. Any antiderivative of 
4— x will give the area. We look for a function whose derivative is 4 — x. The derivative 
of 4x is 4, the derivative of 4x? is x, so work backward: 


to achieve df/dx = 4— x choose f(x) = 4x — 4x7. 


Calculus skips past the rectangles and computes f(3)= 74. The area between x = 1 
and x = 3 is the difference 1; — 34 = 4, In Figure 5.5, this is the area of the trapezoid. 


The f-curve flattens out when the v-curve touches zero. No new area is being added. 
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Fig. 5.5 The area is Af= 74 — 34 = 4. Since v(x) decreases, f(x) bends down. 


INDEFINITE INTEGRALS AND DEFINITE INTEGRALS 
We have to distinguish two different kinds of integrals. They both use the antideriva- 
tive f(x). The definite one involves the limits 0 and 4, the indefinite one doesn’t: 
The indefinite integral is a function f(x) = 4x — 4x?. 
The definite integral from x = 0 to x = 4 is the umber f(4) — f(0). 


The definite integral is definitely 8. But the indefinite integral is not necessarily 
4x — 4x. We can change f(x) by a constant without changing its derivative (since the 
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Read-through questions 


Integration yields the 9 under a curve y = o{x). It starts 
from rectangles with base > and heights x) and areas 


§ Integrals 


derivative of a constant is zero). The following functions are also antiderivatives: 
Jix)=4x—4x7 41, fxhs4x—$27-9, 0 f= 4x-fx7 + C. 


The first two are particular examples. The last is the general case. The constant C 
can be anything (including zero), to give all functions with the required derivative. 
The theory of calculus will show that there are no others. The indefinite integral is 
the most general antiderivative (with no limits): 


indefinite integral {(x)= feos dx =4x—4x?+C. (5} 


By contrast, the definite integral is a number. It contains no arbitrary constant C. 
More that that, it contains no variable x. The definite integral is determined by the 
function v(x) and the limits of integration (also known as the endpoints). It is the area 
under the graph between those endpoints. 

To see the relation of indefinite to definite, answer this question: What is the definite 
integral between x=1 and x=3? The indefinite integral gives /(3)= 73+ C and 
f(1) = 34 + C. To find the area between the limits, subtract f at one limit from f at the 
other limit: 


(22, e00 dx = (3) - f= (4 FC) - BEE CVS 4. (6) 


The constant cancels itseif! The definite integral is the difference between the values 
of the indefinite integral. C disappears in the subtraction. 

The difference (3) — f(1) is like f, — fo. The sum of v; from | to n has become ‘“‘rhe 
integral of u(x) from | te 3.” Section 5.3 computes other areas from sums, and 5.4 
computes many more from antiderivatives. Then we come back to the definite integral 
and the Fundamenta! Theorem: 


b bog 
| u(x) ix=| af dx = f(b) — fla). (7) 
qa dx 


a 


5.2 EXERCISES 


definite integral {/, 1x) dx =f(1) — (0). 


a 5 2 
c_. As Ax 0 the area v,Ax+-- +v,Ax becomes the cee ais mney 
d___ of e(x}. The symbol for the indefinite integral of v(x) is 3 Liy’x for x7 '"?) 4 (./x}* (or x") 
e P z 
5 xls +(2x)'3 6 xhid igs 
The problem of integration is solved if we find f(x) such 7 2sin x +sin 2x & sec?x + ] 
that . Then fis the _9@ _ of x, and (3 u(x} dx equals : . 
h i__. The limits of integration are__i__. This 9 x cos x (by experiment) 10 x sin x (by experiment} 
isa integral, whichisa__! and not a function f(x). Il sin x cos x 12 sin?x cos x 
The example v(x) =x has f{x)=__m_. It also has fix) = 13 0 (find atl f) 14 —I (find all f) 


. The area under e(x) from 2to6is 9 _. Theconstant 
is canceled in computing the difference _e = minus _9._. Beas if fof oe hencahe definite: siniol 
If e(x} = x® then fix) = __1 ——— pT AGH BF 


The sum ¢, +-°+8,=f,~—fo leads to the Fundamental 


the 


tytoo +e, 15 


integral is f(b) — f(a). Finding the _¥__under the integral. The difference of f's is divided by 


t-graph is the opposite of finding the _ * _ of the /-graph. approach the derivative. 


Find an antiderivative {(x) for x(x) in 1-14. Then compute the 


15 If dfidx = v(x) then the definite integral of v(x) from a to 


16 The areas include a factor Ax, the base of cach rectangle. 
Theorem i o(x)dx=__s The __t__ integral is f(x) and So the sum of z’s is multiplied by to approach the 
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17 The areas of 4, 8, and 16 rectangles were 10, 9, and 84, 
containing the triangle out to x = 4. Find a formula for the 
area Ay of N rectangles and test it for N =3 and N=6. 


48 Draw four rectangles with base 1 below the py = x line, and 
find the total area. What is the area witb N rectangles? 


19 Draw y=sin x from 0 to x. Throe rectangles (base 7/3} 
and six rectangles (base 1/6) contain an arch of the sine func- 
tion, Find the areas and guess the limit. 


20 Draw an example where three lower rectangles under a 
curve (heights m,, m3, m3) have less area than two rectangles. 


21 Draw y= t/x? for 0<x <1 with two rectangles under it 
(base 1/2), What is their area, and what is the area for four 
rectangles? Guess the limit. 


22 Repeat Problem 2! for y = 1/x. 


23 (with calculator) For n(x) = 1)fx take enough rectangles 
over 0<x <1 to convince any reasonable professor that the 
area is 2. Find f(x) and verify that {1} -—f{0) = 2. 


24 Find the area under the parabola v= x? from x=0 to 


x=4, Relate it to the area 16/3 below x. 


25 For pv, and vp, in the figure estimate the areas f(2) and f{4). 
Start witb f[0) = 0. 


2 2 
By (x) 
1 1 P(x) 
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5.3. Summation versus Integration 


v4 (x) 


Di) 


0 10 


26 Draw y = v{x) so that the area f{x) increases until x = 1, 
stays constant to x = 2, and decreases to /{3) = I. 


27 Describe the indefinite integrals of v, and v,. Do the areas 
increase? Increase then decrease? ... 


28 For v,(x) find the area (4) — (1). Draw f(x). 


29 The graph of B(t) shows the birth rate: births per unit time 
at time r. Dit) is the death rate. In what way do these numbers 
appear on the graph? 

1. The change in population from 1 =0 to t = 10. 

2. The time T when the population was largest. 

3. The time t* when the population tncreased fastest. 


30 Draw the graph of a function y,(x) whose area function 
is v4(x). 


31 If v2(x) is an antiderivative of y.(x), draw y.(x). 


32 Suppose v(x) increases from v(0} = 0 to v(3)= 4, The area 
under y= v(x) plus the area on the left side of x =v7'(y) 
equals 
33 True or false, when f(x) is an antiderivative of v(x). 

(a) 2f(x) is an antiderivative of 2v(x) (try examples) 

(b) f(2x) is an antiderivative of v(2x)} 

(c) f{x) + | is an antiderivative of r{(x) + 1 

(d) f{x +1} is an antidertvative of v(x + 1). 

(e) (fx) is an antiderivative of (v(x))*. 


This section does integration the hard way. We find explicit formulas for f, = 
v, + ~ +,. From areas of rectangles, the limits produce the area f(x) under a curve. 
According to the Fundamental Theorem, df/dx should return us to o{(x)—and we 


verify in each case that it does. 


May I recail that there is sometimes an easier way? Hf we can find an f(x) whose 
derivative is v(x), then the integral of v is f, Sums and limits are not required, when f 
is spotted directly. The next section, which explains how to look for f(x}, will displace 
this one. (If we can’t find an antiderivative we fall back on summation.) Given a 
successful f, adding any constant produces another f—since tbe derivative of tbe 
constant is zero. The right constant achieves /(0) = 0, with no extra effort. 
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This section constructs /(.x) from sums. The next section searches for antiderivatives. 
THE SIGMA NOTATION 


In a section about sums, there has to be a decent way to express them. Consider 
1? + 2? + 324 4. The individual terms are v;=j?. Their sum can be written in sum- 
mation notation, using the capital Greek letter Z (pronounced sigma): 


r-% 
12+ 2? + 3? + 4? is written } j?. 
jr 
Spoken aloud. that becomes “the sum of j* from j = 1 to 4.” [t equals 30. The limits 
on j (written below and above £) indicate where to start and stop: 
rn 9 
tote = Yor, and yt +¥9= Dy. {1) 
f=) k=3 
The & at the end of (1) makes an additional point. There is nothing special about the 
letter j. That is a “dummy variable.” no better and no worse than & (or i). Dummy 
variables are only on one side (the side with £), and they have no effect on the sum. 
The upper limit n is on both sides. Here are six sums: 


A 4 

kale S at ey ¥ ¢ Iy=-1+1-1+1=0 

k=1 j=. 

QO ? 
(4-W=1+3-54+74+9=8 Ye = eo | only one term | 

del i=0 

¥ j2=| aningless? Fett t= =? [infin A tae 
if =| meaningless? 2 7” aha = 2 | infinite series” 


The numbers | and n or } and 4 (or 0 und «) are the fower limit and upper limit. 
The dummy variable i or j or & is the index of summation. I hope it seems reasonable 
that the infinite series 1 + 5 + 4 + + adds to 2. We will come back to it in Chapter 10.7 

A sum like £}_,6 looks meaningless, but it is actually 6~6+-" +6=6n, 
It follows the rules. In fact Ee _/° is not meaningless either. Every term is j° and 
by the same rules. that sum is 4)*. However the i was probably intended to be j. 
Then the sum is 1+4+9+ 16= 30. 


Question What happens to these sums when the upper limits are changed to n? 
Answer The sum depends on the stopping point vn. A formula is required (when 
possible). Integrals stop at x. sums stop at 7, and we now look for special cases when 


fix) or f, can be found. 


A SPECIAL SUMMATION FORMULA 


How do you add the first 100 whole numbers’? The problem is to compute 


LO0 
Dd LEP 2e enh 98-09 + 100 = 2 


fet 


+Zeno the Greek believed it was impossible to get anywhere. since he would only go halfway 
and then half again and half again. Infinite sertes would have changed his whole life. 


5.3 Summation versus Integration 


If you were Gauss, you would sce the answer at once, (He solved this problem at a 
ridiculous age, which gave his friends the idea of getting him into another class.) His 
solution was to combine 1 + 100, and 2 + 99, and 3 + 98, always adding to 101. There 
are fifty of those combinations. Thus the sum is (50)(101) = 5050. 

The sum from | to n uses the same idea. The first and last terms add to n+ 1. The 
next terms n— ! and 2 also add to n+ 1. If 7 is even (as 100 was) then there are 4n 
parts. Therefore the sum is $n times 1+ 1: 


n 
¥ jRit2t~ + @—N+n= Sant by (2) 
j= 
The important term is 4n*, but the exact sum is $n? + 4n. 

What happens if n is an odd number (like n = 99)? Formula (2) remains true. The 
combinations | + 99 and 2+ 98 still add to n+ 1= 100. There are 399) = 495 such 


pairs, beeause the middle term (which is 50) has nothing to combine with. Thus 
1+2+-+++99 equais 494 times 100, or 4950. 


Remark That sum had to be 4950, because it is 5050 minus 100. The sum up to 99 
equals the sum up to 100 with the last term removed. Our key formula f, —f,-1 =o 
has turned up again! 

EXAMPLE Find the sum 101 + 102+ --- + 200 of the second hundred numbers. 


First solution This is the sum from 1 to 200 minus the sum from [ to 100: 


200 200 100 
Ue Aa De See Se (3} 
101 i i 


The middie sum is $(200)(201) and the last is $(100)(101). Their difference is 15050. 
Note! 1 left out “j=” in the limits, It is there, but not written. 


Second solution The answer 15050 is exactly the sum of the first hundred numbers 
(which was $050) plus an additional 10000. Believing that a number like 10000 ean 
never turn up by accident, we look for a reason. It is found through changing the 
limits of summation: 


200 100 
>, jis the same sum as }° (k + 100). (4) 
jrio. k=1 


This is important, to be able to shift limits around. Often the lower limit is moved 
to zero or one, for convenience. Both sums have 100 terms (that doesn’t change). The 
dummy variable j is replaced by another dummy variable k. They are related by 
j=k+ 100 or equivalently by k =j— 100, 

The variable must change everywhere—in the lower limit and the upper limit as 
well as inside the sum. If starts at 101, then k =j — 100 starts at 1. If j ends at 200, 
k ends at 100. Ifj appears in the sum. it is replaced by k + 100 (and if j* appeared it 
would become (k + 100)?). 

From equation (4) you see why the answer is 15050. The sum 1 + 2+ --- + 100 is 
5050 as before. 100 is added to each of those 100 terms. That gives 10000. 


EXAMPLES OF CHANGING THE VARIABLE (and the limits) 


2' cquals 


Me 


2/-' (here i=j-— 1). Both sums are 1+2+4+4+8 
1 


TiMe 


0 


" n-3 
>. v, equals }) tj43 (here /=j+3 and j=i— 3). Both sums are v3 + ~- +b. 
i=3 j=0 
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Why change n to n — 3? Because the upper limit is i=n. So j+3=n and j=n-— 3. 

A final step is possible, and you will often see it. The new variable j can be changed 
back te i. Dummy variables have no meaning of their own, but at first the result 
looks surprising: 


$ & 6 
¥ 2 equals ¥ 2/7? equals ¥ 2'-?. 
i=0 j= i=4 


With practice you might do that in one step, skipping the temporary letter j. Every 
ion the left becomes i— 1 on the right. Then i= 0, ..., 5 changes to i= 1, ..., 6. (At 
first two steps are safer.) This may seem a minor point, but soon we will be changing 
the limits on integrals instead of sums. Integration is parallel to summation, and it 
is better to see a “change of variable” here first. 


Note about 1+ 2+: +n. The good thing is that Gauss found the sum n(n + 1). 
The bad thing is that his method looked too much like a trick. I would like to show 
how this fits the fundamental rule connecting sums and differences: 


ifv,tvgte t+o,=f, then v, =f, —f,-3. (5) 


Gauss says that f, is ¢n{m + 1). Reducing n by 1, his formula for f,_; is t(n— 1)n. The 
difference {, ~f,-, should be the last term n in the surn. 


fa ~Sp-n = han + Ide Dn = he tn nt nan. (6) 


This is the one term », =n that is included in f, but not in f,_,. 

There is a deeper point here. For any sum f,, there are two things to check. The 
f’s must begin correctly and they must change correctly. The underlying idea is 
mathematical induction: Assume the staternent is true below n. Prove it for n. 


Goal: To prove that 1+ 2+ -:+n=4n(n+ 1). This is the guess f.. 
Proof by induction. Check f, (it equals 1). Check f, —f,,-, (it equals n). 


For n= 1 the answer 4n(n+ 1)=4-1-2 is correct. For n=2 this formula $-2-3 
agrees with 1+ 2. But that separate test is not necessary! If f, is right, and if the 
change {, — f,-, is right for every n, then {, must be right. Equation (6) was the key 
test, to show that the change in f’s agrees with v. 

That is the logic behind mathematical induction, but I am not happy with most 
of the exercises that use it. There is absolutely no excitement. The answer is given by 
some higher power {like Gauss), and it is proved correct by some lower power {like 
us). It is much better when we lower powers find the answer for ourselves.t Therefore 
I will try to do that for the second problem, which is the sus of squares. 


THE SUM OF /? AND THE INTEGRAL OF x? 


An important calculation comes next. Jt is the area in Figure 5.6. One region is made 
up of rectangles, so its area is a sum of n pieces. The other region lies under the 
parabola v = x*. It cannot be divided into rectangles, and calculus is needed. 

The first problem is to find f, = 12+ 2? +3* + -- +n?. This is a sum of squares, 
with f, = 1 and f, = 5 and f, = 14. The goal is to find the pattern in that sequence. 
By trying to guess f, we are copying what will soon be done for integrals. 

Calculus looks for an f(x) whose derivative is x(x). There fis an antiderivative (or 


+The goal of real teaching is for the student to find the answer. And also the probiem. 
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(nAx)? 


Area 
1274+ 22432=14 


(12+... +7) (Av 


1 2 3=n Ax 1 2 3=nAx 
Fig. 5.6 Rectangles enclosing v = x? have area (47° + $n? + gn)(Ax)* & YnAx)> = 4x3. 


an integral), Aigebra looks for f,’s whose differences produce »,. Here f, could be 
called an antidifference (better to call it a sum). 

The best start is a good guess. Copying directly from integrals, we might try 
f, =4n°. To test if it is right, check whether f, —f,-, produces v, = n?: 


4m? —4(n— 1) = 40? — (nn — 3? + Sn In? at 5. 
We see n*, but also —n + 4. The guess 4n° needs correction terms. To cancel 4 in the 
3 3 


difference, I subtract $1 from the sum. To put back n in the difference, I add 
1+2+---+n=4n(n+ 1) to the sum. The new guess (which should be right) is 


f= yn? t gala t 1) — fa = Gnd + gn? + gn. (7) 


To check this answer, verify first that /, = 1. Also f, =5 and f, = 14. To be certain, 
verify that f, —f,-, =. For calculus the important term is 37°: 


The sum > j? of the first n squares is : n> plus corrections ; n* and : n. 
j=l 

In practice $n° is an excellent estimate. The sum of the first 100 squares is approxi- 
mately 4(100)*, or a third of a million. If we need the exact answer, equation (7) is 
available: the sum is 338,350. Many applications (example: the number of steps to 
solve 100 linear equations) can settle for $n?. 

What is fascinating is the contrast with calculus. Calculus has no correction terms! 
They get washed away in the limit of thin rectangles. When the sum is replaced by 
the integral (the area), we get an absolutely clean answer: 


The integral of u = x? from x =0 to x =n is exactly 4n°. 


The area under the parabola, out to the point x = 100, is precisely a third of a million. 
We have to explain why, with many rectangles. 

The idea is to approach an infinite number of infinitely thin rectangles. A hundred 
rectangles gave an area of 338,350. Now take a thousand rectangles. Their heights 
are (qh), (5), .... because the curve is v= x?. The base of every rectangle is 


Ax = 75, and we add heights times base: 
area of rectangles = Layee + ye tot 1000\"/ 1 
8 10) \10/ " \10/ \10 to J \10}" 


Factor out (75). What you have left is 17+ 2? + +; + £0007, which fits the sum of 
squares formula. The exact area of the thousand rectangles is 333,833.5. I could try 
to guess ten thousand rectangles but I won’t. 


Main point: The area is approaching 333,333.333.... But the calculations are getting 
worse. It is time for algebra—which means that we keep “Ax” and avoid numbers. 
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The interval of length 100 is divided into n pieces of length Ax. (Thus x = 100/Ax.) 
The jth rectangle meets the curve v = x’, so its height is (jAx)?. Its base is Ax, and 
we add areas: 


area = (Ax)*(Ax) + (2Ax)*(Ax} + + + (nAxP(Ax) = = (jAx)?(Ax), (8) 


Factor out (Ax)*, leaving a sum of n squares. The area is (Ax) times f,, and n= ae 


“aeal loo)’, 1 100! 
*) 3\ Ax 2\ Ax 6 


This equation shows what is happening. The leading term is a third of a million, 
as predicted. The other terms are approaching zero’ They contain Ax, and as the 
rectangles get thinner they disappear. They only account for the small corners of 
rectangles that lic above the curve. The vanishing of those corners will eventually be 
proved for any continuous functions—the area from the correction terms goes to 
zero-—but here in equation (9) you see it explicitly. 

The area under the curve came from the central idea of integration: 100/Ax rectan- 
gles of width Ax approach the limiting area = $(100)°. The rectangular area is Xv, Ax. 
The exact area is felx )dx. In the limit X becomes § and v; becomes ix) and Ax 
becomes dx. 


£00 1 
( )] = — 1007+ = 100% Ax)+ > = 1O0AX (9) 
Ax 3 


That completes the calculation for a parabola. it used the formula for a sum of 
squares, which was special. But the underlying idea is much more general. The limit 
of the sums agrees with the antiderivative: The antiderivative of t(x) = x? is fix) =4x°. 
According to the Fundamental Theorem, the area under r(x) 1s f(x): 


19° ex) dx = f(100) — f{0) = 4(100). 


O 
That Fundamental Theorem is not yet proved! I mean it is not proved by us. Whether 
Leibniz or Newton managed to prove it, I am not quite sure. But it can be done, 
Starting from sums of differences, the difficulty is that we have too many limits at 
once. The sums of r;Ax are approaching the integral, The differences Af:Ax approach 
the derivative. A real proof has to separate those steps, and Seetion 5.7 will do it. 
Proved or not, you are seeing the main point. What was true for the numbers /, 
and rv; is true in the limit for e(x) and f(x}. Now e{x) can vary continuously, but it 1s 
still the slope of fix). The reverse of slope is area. 


(1424344)? =154274+3 ey, 
Proof without words by Roger Nelsen (Mathematics Magazine 1990). 


Finally we review the area under r= x. The sum of | ~ 2+ +--+ is $n? + 4n. This 
gives the area of n= 4/Ax rectangles, going out to x= 4. The heights are jAx, the 
bases are Ax, and we add areas: 


4idx 4\? 1/4 
ps ( jAx) (ania? 5 (x) + 3() |-8+ 20 (10) 
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With Ax =1 the area is 1+ 2+3+4=10. With eight rectangles and Ax =4, the 
area was 8 + 2Ax = 9, Sixteen rectangles of width 4 brought the correction 2Ax down 
to 4. The exact area is 8. The error is proportional to Ax. 


Important note There you see a question in applied mathematics. If there is an error, 
what size is it? How does it behave as Ax ~» 0? The Ax term disappears in the limit, 
and {Ax) disappears faster. But to get an error of 10°° we need eight million 
rectangles: 


2Ax = 2+ 4/8,000,000 = 10-°. 


That is horrifying! The numbers 10, 9, 84, 84, ... seem to approach the area 8 in a 
satisfactory way, but the convergence is much too slow. It takes twice as much work 
to get one more binary digit in the answer—which is absolutely unacceptable. Some- 
how the Ax term must be removed. If the correction is (Ax)? instead of Ax, then a 
thousand rectangles will reach an accuracy of 107°. 

The problem is that the rectangles are unbalanced. Their right sides touch the graph 
of v, but their left sides are much too high. The best is to cross the graph in the middle 
of the interval—this is the midpeint rule. Then the rectangle sits halfway across the 
line v = x, and the error is zero. Section 5.8 comes back to this rule—and to Simpson’s 
rule that fits parabolas and removes the (Ax)* term and is built into many calculators. 

Finally we try the quick way, The area under v= x is f= 4x", because df/dx is v. 
The area out to x = 4 is $(4)? = 8. Done. 


l 


unbalanced centered 

Error rectangles Error rectangles 
1/4 1/4 
1/9 9 


t+ $4 $$ 4 4 tt Work 
] 4 9 1 2 


Fig. 5.7 Endpoint rules: error ~ 1/(work) ~ 1/n. Midpoint rule is better: error ~ 1/(work)’. 


Work 


Optional: pth powers Our sums are following a pattern. First, | + --- + nis $n? plus 
gn. The sum of squares is 4n* plus correction terms. The sum of pth powers is 


17+ 2? 4+ +n? = ~ i n’*) plus correction terms. (il) 


The correction involves lower powers of n, and you know what is coming. Those 
corrections disappear in caiculus. The area under pv = x? from 0 to n is 


nett. (12) 


x= 


a nidx 
x? dx= lim JAx)PAXx) = 
| 0 jm Gj vt ) pt+l 
Calculus doesn’t care if the upper limit 7 is an integer, and it doesn’t care if the power 
p is an integer. We only need p + 1 > 0 to be sure n”*' is genuinely the leading term. 
The antiderivative af vp = x? is f= x?*"}(p+ 1). 

We are close to interesting experiments, The correction terms disappear and the 
sum approaches the integral. Here are actual numbers for p= 1, when the sum and 
integral are easy: S, = 1+ +n and I, =[x dx = $n. The difference is D, = yn. The 
thing to watch is the relative error E, = D,/I,: 


n S, I, D,=S,-1, En=0,/I, 
100 5050 5000 50 O10 
200 20100 20000 100 005 
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The number 20100 is $(200)(201). Please write down the next line n = 400, and please 
find a formula for E,. You can guess E, from the table, or you can derive it from 
knowing S, and [,. The formula should show that E, goes to zero. More important, 
it should show how quick (or slow) that convergence will be. 

One more number—a third of a million—was mentioned earlier. It came from 
integrating x? from 0 to 100, whicb compares to the sum $;o9 of 100 squares: 


np S, =} D=S-I E=DiI 
100 2 «=: 338350«S «3333334 50164 01505 
200 2 2686700 26666662 200334 «= .0075125 


These numbers suggest a new idea, to keep n fixed and change p. The computer can 
find sums without a formula! With its help we go to fourth powers and square roots: 


n Pp S=+e tne f=nPttipti) D=S-1  E,,=DiI 
1004 2050333330 4(100)° 50333330 0.0252 
100 4 671.4629 4(100)>/ 4.7963 0.0072 


In this and future tables we don’t expect exact values. The last entries are rounded 
off, and the goal is to sce the pattern. The errors E, , are sure to obey a systematic 
rule—they are proportional to 1/n and to an unknown number C(p) that depends 
on p. | hope you can push the experiments far enough to discover C(p). This is not 
an exercise with an answer in the back of the book— it is mathematics. 


5.3 $s 


Read-through questions 


The Greek letter __© indicates summation. In £{ p, the 
dummy variable is__> _. The limits are __¢ __, so the first 
term is __4 and the last term is _@ . When v,=j this 
sum equals _t . For #= 100 the leading term is _@__. 
The correction term is _4 _. The leading term equals the 
integral of y= x from 0 to 100, which is wntten _! _. The 


sum is the total__! of 100 rectangles. The correction term 
is the area between the __&« andthe __! 


The sum Z/_, i? is the same as Ej, __™ and equals 


«_. Thesum £7_,v,isthesameas © _v,, 4 and equals 
P_. For f,=Zj_, v, the difference f, —f,-1 equals _4 


The formula for 17 +27+--+n? isf,=__! _. To prove 
it by mathematical induction, check f, =__§ and check 
Sa—Sa-1 =__+__. The area under the parabola v = x? from 
x=Otox=9is_ ¥_. This is close to the area of __¥ 
rectangles of base Ax. The correction terms approach zero 
very _ 


4 5 
1 Compute the numbers }° 1/n and 2, (i —3). 
n=1 is 


3 A 
2 Compute YU? —j) and y 1/24. 
j= jet 


3 Evaluate the sum 7 2! and %, 2 
4 Evaluate y (—1)'i and 3 (— 197. 
iS i 

5 Write these sums in sigma notation and compute them: 

24446454100 14345+-4+199 1-44+4-4 
6 Express these sums in sigma notation: 

Bp — Pag + Vy — Vg OW + VgWe to + O,W, Dy +23 + Ys 
7 Convert these sums to sigma notation: 


_ 2m. 4x ; 
ag + a,x + °° +4,x* sin + sin + + ain 2n 


i} jtm—-jy 


yn nA oe x A a n . 
aror=(*\a+(" a Thy +(")p ey bi 


Loo 1000 
9 With electronic help compute ¥ 1/f and ¥ Up. 
7 t 


' 
8 The binomial formula uses coefficients (") tees 


19 10 
10 On a computer find Powys times pa 1/j! 
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11 Simplify ¥ (a, +b)? + ¥ (@,—by to ¥° 
f=1 i=1 1 


n 2 LJ 
12 Show tat ( a) # > a? and Y abt ¥ a, ¥ . 
i=1 t 


® 10 f | 1 
13 “Tele: ” the sums 2k. 2*-1) and — =}. 
SCOPE Ps ( ) an pr a 7) 
All but two terms cancel, 


# 12 
14 Simplify the sums py Uj —fp-1) and 2, Uj41 Fp. 
= < 


1t 


ai 


8 6 9 
15 True or false: (a) & v= 2 Vi-2 =) 2 y= 2d D2 
16 3 i<s 


6 a 

s=0 and 2 ie Py —- 2 
17 The antiderivative of d7f/dx? is dffdx. What is the sum 
(2-2 +fo+a— 2 th) t+ +o~ Wa + for 
18 Induction. Verify that 174+2?+-4+n? is fl= 
nin +1}(2n+18)/6 by checking that f, is correct and 
to —-f,- os n*, ; 
19 Prove by induction: 1+ 3+4- +(2n—1)=n7. 
20 Verify that 13+ 27+ --- +7 is f,=477(n + 1)? by check- 
ing f, and f, —f,-1. The text has a proof without words. 


21 Suppose f, has the form an+bn?+cn?. If you know 
ii=1, R=5, fy= 14, turn those into three equations for 
a, b,c. The solutions a =3, b=4, c=4 give what formula? 


22 Find q in the formula 1° + -- +29 = qn? + correction. 


23 Add n= 400 to the table for S, = 1 + --- + and find the 
relative error E,. Guess and prove a formula for E,,. 


24 Add n= SO to the table for S, = 17 + - + n? and compute 
Eso. Find an approximate formula for £,. 


25 Add p=4 and p=3 to the table for Syo.,= 
1? + --- + 100°. Guess an approximate formula for Ey99.p- 


26 Guess C(p) in the formula E,,, = C(p)/n. 


27 Show that |1 — 5| <|1]+|—5], Always |e, + v2| < Je,| + le2| 
unless 
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28 Let S be the sum 1 + x + x” + -+ of the (infinite) geometric 
series. Then xS =x+x?4x7+- is the same as § minus 

. Therefore § = . None of this makes sense 
if x = 2 because 


2 3 3 
29 The double sum } [3 ira is v= } (1+) plus 
#=1|jS1L f=1 


v2= }, (24+), Compute v, and », and the double sum. 
j= 


re 
30 The double sum } (z wu) is (Wy) + Wi,2 + #13) + 
i 


3/2 
The double sum J} (3 Ww; :) is 
i 
(Wa + 21) + (W12 + W2,2} + 


31 Find the flaw in the proof that 2"=1 for every 
n=0,1,2,.... For »=0 we have 2°=1. If 2"=1 for every 
n<N, then 2% = 2"~'.2%-ligN-2 = 1/1 =1, 


32 White out all terms to see why the following are true: 
3 3 2/3 2 3 
Faye ¥(5 uos)=($H)(4) 
1 1 i=1 \i=l t 1 


33 The average of 6, 11, 4 is 5=4(64+11+4). Then 
(6—p+(11-—)+(4-a)= . The average of 
V1, ---, 0, S b= . Prove that &(v;—v)=0. 


A 2 A fn 

34 The Schwarz inequality is (5 ab <(§ a? ) (5 Ht) 
i i 

Compute both sides if a, =2, a,=3, b,=1, b, =4. Then 


compute both sides for any 4,, a2, b;, b2. The proof in 
Section 11.1 uses vectors. 


. Compare. 


35 Suppose n rectangles with base Ax touch the graph of v(x) 
at the points x = Ax, 2Ax, ..., nAx. Express the total rectan- 
gular area in sigma notation. 


% If 1/Ax rectangles with base Ax touch the graph of u(x} 
at the Jeft end of each interval (thus at x =0, Ax, 2Ax,...) 
express the total area in sigma notation. 


1jax (JA — a4 
37 The sum Ax . SGAx) wv )Ax)} 
Pa 


s 
In the limit this becomes [} dx = 


equals 


This section integrates the easy way, by looking for antiderivatives. We leave aside 
sums of rectangular areas, and their limits as Ax +0. Instead we search for an f{{x) 
with the required derivative v(x). In practice, this approach is more or less indepen- 
dent of the approach through sums—but it gives the same answer. And also, the 
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search for an antiderivative may not succeed. We may not find f. In that case we go 
back to rectangles, or on to something better in Section 5.8. 

A computer is ready to integrate v, but not by discovering /. It integrates between 
specified limits, to obtain a aumber (the definite integral). Here we hope to find a 
function (the indefinite integral). That requires 2 symbolic integration code like 
MACSYMA or Mathematica or MAPLE, or a reasonably nice v(x), or both. An 
expression for f(x) can have tremendous advantages over a list of numbers. 

Thus our goal is to find antiderivatives and use them. The techniques will be further 
developed in Chapter 7—this section is short but good. First we write down what 
we know. On each line, f(x) is an antiderivative of v(x) because df /dx = v(x). 


Known pairs Function v(x) Antiderivative {{x) 


Powers of x — Mt )it+c 


n= —1 is not included, because n+1 would be zero. v= x‘ will lead us to f= In x. 


Trigonometric functions cos x sinx+C 
sin x —cos x+C 

sec? x tanx+C 

esc? x —cotx+C 

sec x tan x sec x +C 

csc x cot x —osc x +C 

Inverse functions 1} J1i-x? sin-'x+C 

1/(1 + x?) tan 'x+C 


1/|x{./x? -— 1 sec ix +C 


You recognize that each integration formula came directly from a differentiation 
formula. The integral of the cosine is the sine, because the derivative of the sine is 
the cosine. For emphasis we list three derivatives above three integrals: 


d d d fx"*} i 
Fa rd ae 1 =(55) =x 
tt 
| ea=e [ tarax4e | va +C 
nt] 


There are two ways to make this list longer. One is to find the derivative of a new 
F(x). Then f goes in one column and v = df/dx goes in the other column.t The other 
possibility is to use rules for derivatives to find rules for integrals. That is the way to 
extend the list, enormously and easily, 


RULES FOR INTEGRALS 
Among the rules for derivatives, three were of supreme importance. They were /inear- 


ity, the product rufe, and the chain rule. Everything flowed from those three, In the 


tWe will soon meet e*, which goes in both columns. It is f(x} and also s(x). 
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reverse direction (from t to f) this is still true. The three basic methods of differential 
calculus also dominate integral calculus: 


linearity of derivatives — linearity of integrals 
product rule for derivatives > integration by parts 
chain rule for derivatives — integrals by substitution 
The easiest is linearity, which comes first. Integration by parts will be left for 


Section 7.1. This section starts on substitutions, reversing the chain rule to make an 
integral simpler. 


LINEARITY OF INTEGRALS 


What is the integral of v(x)— w(x)? Add the two separate integrals. The graph of 
u+w has two regions below it, the area under « and the area from v to v+w. 
Adding areas gives the sum rule. Suppose f and g are antiderivatives of v and w: 


sum rule: ft+g isan antiderivative of ctw 
constant rule: cf is an antiderivative of ce 
linearity: af+ bg is an antiderivative of ay + bw 


This is a case of overkill. The first two rules are special cases of the third, so logically 
the last ruic is enough, However it is so important to deal quickly with constants— 
just “factor them out” —that the rule cu+cf is stated separately. The proofs come 
from the linearity of derivatives: (af+ bgY equals af'+ bg’ which equals av + bw. 
The rules can be restated with integral signs: 


sum rule: f [ eve) + w(x) | dx = [ v(x) dx + [ w(x) dx 
constant rule: J ce(x) dx =c j u(x) dx 
linearity: j [ at(x}— bw(x) | dx =a J u(x) dx+ 6 f w(x) dx 


Nete about the constant in {(x)-~ C. All antiderivatives allow the addition of a con- 
stant. For a combination like az{x)+ bw(x), the antiderivative is af(x) + be(x) + C. 
The constants for each part combine into a single constant. To give all possible antide- 
rivatives of a function, just remember to write “+ C” after one of them. The real 
problem is to find that one antiderivative. 


EXAMPLE 4 The antiderivative of c= x? +x°7 is fHe3t(x MWCD+C. 


EXAMPLE 2 The antiderivative of 6 cos +7 sin? is 6sint—7cost+C. 


I-sinx  1-sinx 


EXAMPLE 3 Rewrite = sec? x — sec x tan x. 


af 4s Pas | — 3 
1+ sin x 1—sin“x cos’ x 


The antiderivative is tan x — sec x + C, That rewriting is done by a symbolic algebra 
code (or by you). Differentiation is often simple, so most people check that df:dx = v(x}. 


Question How to integrate tan? x? 
Method Write itassec’x-1. Answer tanx—x+C. 
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INTEGRALS BY SUBSTITUTION 


We now present the most valuable technique in this section—-sudstitution. To see the 
idea, you have to remember the chain rule: 


f(g(x}) has derivative _f'(e(x})(dg/dx) 
sin x? has derivative (cos x”}(2x) 
(x3+1)> has derivative 5(x>+ 1)*(3x7} 


If the function on the right is given, the function on the left is its antiderivative! There 
are two points to emphasize right away: 


1. Constants are no problem—they can always be fixed. Divide by 2 or 15: 
1 
| x cos (x?) dx = 5 sin(x?)+C | x?(x3 + tt dx = id +1P+C 


Notice the 2 from x*, the 5 from the fifth power, and the 3 from x?. 

2. Choosing the inside function g (or u) commits us to its derivative: 
the integral of 2x cos x? is sin x? +C (g=x?, dg/dx = 2x) 
the integral of cos x? is (failure) (no dg/dx) 
the integral of x? cos x? is (failure) (wrong dg/dx) 


To substitute g for x”, we need its derivative. The trick is to spot an inside function 
whose derivative is present. We can fix constants like 2 or 15, but otherwise dg/dx 
has to be there. Very often the inside function g is written u. We use that letter to state 
the substitution rule, when f is the integral of v: 


[ sy $ te = Faony + 6 (1) 


EXAMPLE 4 f sin x cos x dx = 3(sin x)? +C u=sin x (compare Example 6) 
EXAMPLES [ sin?x cos x dx=4(sinx+C u=sinx 

EXAMPLE 6 { cos x sin x dx = — }(cos x)? +C u=cos x (compare Example 4} 
EXAMPLE 7 f tan* x sec*x dx=4(tanx))+C wu=tanx 

The next example has u = x* — 1 and du/dx = 2x. The key step is choosing u: 
EXAMPLE 8 {x dx//x?-1= /x?-1+C€ fx fe? = 1 de = 40x? - 17 + 
A shift of x (to x + 2) or a multiple of x (rescaling to 2x) is particularly easy: 


EXAMPLES 9-40 f(x +2)? dx=4(x+2)4+C =f cos 2x dx =} sin 2x + C 


You will soon be able to do those in your sleep. Officially the derivative of {x + 2)* 
uses the chain rule. But the inside function u = x + 2 has dujdx = 1. The “1” is there 
automatically, and the graph shifts over—as in Figure 5.8b. 

For Example 10 the inside function is u = 2x, Its derivative is du/dx = 2. This 


5.4 Indefinite Integrals and Substitutions 


0 2 0 | 
Fig. 5.8 Substituting u=x +1 and u=2x and u=x?. The last graph has half of du/dx = 2x. 


required factor 2 is missing in f cos 2x dx, but we put it there by multiplying and 
dividing by 2. Check the derivative of 4 sin 2x: the 2 from the chain rule cancels the 
4. The rule for any nonzero constant is similar: 


| v(x + c) dx =f(x + c) and | v(cx) dx = - Sf (cx). (2) 
Squeezing the graph by c divides the area by c. Now 3x + 7 rescales and shifts: 


EXAMPLE 14 =f cos(3x+7) dx=}sin(3x+7)+C ff (3x+7)? dx=$-43xt+7P+C 


Remark on writing down the steps When the substitution is complicated, it is a good 
idea to get du/dx where you need it. Here 3x? + 1 needs 6x: 


7 
7x(3x? + 1)* dx = : (3x? + 1)*6x dx=— | u* ai dx 
6 6 dx 
Tu? 7 (3x? + 1)9 
: scree koe Sn 
Now integrate 65 Cc 6 5 +C (3) 


Check the derivative at the end. The exponent 5 cancels 5 in the denominator, 6x from 
the chain rule cancels 6, and 7x is what we started with. 


Remark on differentials n place of (du/dx) dx, many people just write du: 
f (3x? + 1)*6x dx = [ u* du=4ue+C. (4) 


This really shows how substitution works. We switch from x to u, and we also switch 
from dx to du. The most common mistake is to confuse dx with du. The factor du/dx 
from the chain rule is absolutely needed, to reach du. The change of variables (dummy 
variables anyway!) leaves an easy integral, and then u turns back into 3x? + 1. Here 
are the four steps to substitute u for x: 


1. Choose u(x) and compute du/dx 

2. Locate v(u) times du/dx times dx, or v(u) times du 
3. Integrate f v(u) du to find f(u) + C 

4. Substitute u(x) back into this antiderivative f. 


EXAMPLE 42 f(cos \/x)dx/2,/x=f cos u du= sin u+C =sin \/x+C 


(put in u) (integrate) (put back x) 


The choice of u must be right, to change everything from x to u. With ingenuity, 
some remarkable integrals are possible. But most will remain impossible forever. The 
functions cos x? and 1/,/4— sin? x have no “elementary” antiderivative. Those integ- 
rals are well defined and they come up in applications—the latter gives the distance 
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around an ellipse. That can be computed to tremendous accuracy, but not to perfect 


accuracy. 


The exercises concentrate on substitutions, which need and deserve practice. We 
give a nonexample—{ (x? + 1)? dx does not equal 4(x? + 1)?—to emphasize the need 
for du/dx. Since 2x is missing, u = x? + 1 does not work. But we can fix up x: 


; ; d 1 1 
[ sin ax | sin w= ~ 7 co ut C=——cosmx+tC. 
n T 4 


4 RCISES 
Read-through questions 25 dy/dx = l/y 26 dy/dx = x/y 
Finding integrals by substitution is the reverse of the 9 27 d*y/dx? =1 28 d>y{/dx° = 1 
rule. The derivative of (sin x}> is__© __. Therefore the antide- 29 d?y/dx? =—y W dyjdx = ey 


rivativeof © is 4 . To compute f (1 + sin x)? cos x dx, 
substitute w=_@ . Then du/dx=_ ft so substitute 
du=__9 _. In terms of u the integral is f_h =_ 1! 

Returning to x gives the final answer. 


The best substitutions for f tan (x + 3) sec?(x + 3)dx 
and [(x7+1)'°xdx are u=_ 1 and wu=_k_. Then 
du=__! and __™_. The answers are_" and 9 
The antiderivative of vdvjdx is _P . [2x dx/(1+x’) 
leads to {_& , which we don’t yet know. The integra! 


J dx/(1 + x?) is known immediately as __' 


Find the indefinite integrals in 1-20. 
1§ /24+xdx (add+C) 25 /3—xdx (always+C) 
3 f (xt 1)" dx 4 f(x+1jp "dx 
5 f (x7 4 1x dx 6 § /1—3x dx 
7 [ cos? x sin x dx 8 { cos x dx/sin® x 
9 [ cos? 2x sin 2x dx 
W far /1-? 
3 [Pay /1+e 
15 f (1+ /x) dx//x 


17 [ sec x tan x dx 


10 f cos? x sin 2x dx 
12 fe /1—# ae 
14 fe /1—e ae 
16 § (1-+x3)/x dx 
18 f sec? x tan? x dx 


19 [ cos x tan x dx 20 j sin? x dx 


In 21-32 find a function y(x} that solves the differential 
equation. 
2 dyjdx =x? + \/x 


23 dy/dx = /t —2x 


22 dy/dx=y? (try y=cx") 


24 dy/dx =t/./1—2x 


31 d*y/dx? = \/x 32 (dy/dx)? = \/x 

33 True or false, when f is an antiderivative of v: 
(a) § v(u(x)) dx =f(u(x)) + C 
(b) J o7(x) dx =4 (x) + 
(c} J vf(x\(du/dx) dx =f(u(x))4+C 
(d) [ ofx)(dv/dx) dx =4f?(x) + C 

34 True or faise, when f is an antiderivative of v: 
(a) J f(x)(do{dx) dx =4f7(x)+C 
(b) J vf{v(x))(do/dx) dx =f{v(x))+C 
(c} Integral is inverse to derivative so f(v(x)) =x 
(d) Integral is inverse to derivative so { (df /dx) dx = f(x) 


35 If df/dx=v(x) then fv(x—1}dx= and 
J o(x/2) dx = ; 
M If dffdx=wx) then [v(2x-thdx= and 
f o(x?)x dx = : 
x? | x? dx 
37 = = 
1+x? ae [5S 


3B f(x? + 1)? dx is not $(x? + 1)° but 
39 [ 2x dx/{x? + lis J dy which will soon be In u. 
40 Show that [2x3 dx/1+x?P=[(u—-'dujwe=_ 


41 The acceleration d*f/dt? =9.8 gives f(t)= (two 
integration constants). 


42 The solution to d* y/dx* = 0 is {four constants). 


43 If f(t) is an antiderivative of v(t), find antiderivatives of 
(a) v(t +3) (b) v{t)+3 (Cc) 3v(t) (d) v(3e). 
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Eee §65.5 The Definite Integral 


The integral of v(x) is an antiderivative {(x) plus a constant C. This section takes 
two steps. First, we choose C. Second, we construct f(x). The object is to define the 
integrai—in the most frequent case when a suitable f(x) is not directly known. 

The indefinite integral contains “+ C.” The constant is not settled because f(x) + C 
has the same slope for every C. When we care only about the derivative, C makes 
no difference. When the goal is a numher—a definite integral—C can be assigned a 
definite value at the starting point. 

For mileage traveled, we subtract the reading at the start. This section does the 
same for area. Distance is f(f) and area is f(x)—while the definite integral is 
f(b) — f(a). Don’t pay attention to t or x, pay attention to the great formula of integral 
caiculus: 


b 5 
| v(t) dt = | v(x) dx = f(b) — f(a). (L) 


Viewpoint 1: When { is known, the equation gives the area from a to b. 
Viewpoint 2: When f is not known, the equation defines f from the area. 


For a typical p(x), we can’t find f(x) by guessing or substitution. But still v(x) has an 
“area” under its graph—and this yields the desired integral {(x). 

Most of this section is theoretical, leading to the definition of the integral. You 
may think we should have defined integrals before computing them—which is logi- 
cally true. But the idea of area (and the use of rectangles) was already pretty clear in 
our first examples. Now we go much further. Every continuous function v(x) has an 
integral (also some discontinuous functions). Then the Fundamental Theorem com- 
pletes the circle: The integral leads back to df/dx = v(x). The area up to x is the 
antiderivative that we couldn't otherwise discover. 


THE CONSTANT OF INTEGRATION 


Our goal is to turn f(x)+ C into a definite integral—the area between a and b. The 
first requirement is to have area = zero at the start: 


J(a)+ C=starting area=0 so C=—/ff(a). (2) 
For the area up to x (moving endpoint, indefinite integral), use t as the dummy variable: 
the area from a to x is [* v(t) dt =f(x)—f(a) (indefinite integral) 
the area from a to b is {? v(x) dx=f(b)—f(a) (definite integral) 
EXAMPLE 4 The area under the graph of 5(x + 1)* from a to b has f(x)=(x + 1)°: 
§ 5c + 1) dx = (x + 1) [= 4 8 - (2+ 


The calculation has two separate steps—first find f(x), then substitute b and a. After 
the first step, check that df/dx is v. The upper limit in the second step gives plus f(b), 
the lower limit gives minus f(a). Notice the brackets (or the vertical bar): 


feok=fo-f@ Ah=8-1 — [cos x] = cos 2-1. 


Changing the example to f(x)=(x + 1)° — 1 gives an equally good antiderivative— 
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and now f(0}=0. But f(b) — f(a) stays the same, because the —! disappears: 
[eet 8-1 P= (b+ DS- )- (a+ 1) - N=O+ NF - (a4 DS. 


EXAMPLE 2 When v = 2x sin x? we recognize {= — cos x”. The area from 0 to 3 is 


. 3 
J 2x sin x? dx = — cos f= —cos 9+ cos 0. 


The upper limit copies the minus sign. The lower limit gives —{— cos 0), which is 
+cos 0. That example shows the right form for solving exercises on definite integrals. 


Example 2 jumped directly to f(x) = —cos x?. But most problems involving the 
chain rule go more slowly—by substitution. Set u=x?, with du/dx = 2x: 


3 3 Pe * 
| 2x sin x? dx | sin ue dx | sin u du. (3) 
dx . 


7) 0 


We need new limits when u replaces x?. Those limits on u are a” and b7. (In this case 
a* = 0? and b* = 3* = 9.) If x goes from a to b, then u goes from u(a) to u(b). 


6 du u{d} 
| {u(x)) a dx = | , v(u) du = f(u(b)) — f(u(a)). (4) 


1 6 du u* ty 6* 54 
EXAMPLE 3 24 §)>x dx = oS ee Pe ee, 
{_.« + 5)’x dx iz 5 “| go. 3 
In this case u = x? + $. Therefore du/dx = 2x (or du = 2x dx for differentials). We have 
to account for the missing 2. The integral is fu*. The limits on u=x?+5 are 
u(0)= 07+ 5 and u(1)= 17+ 5. That is why the w-integral goes from 5 to 6. The 
alternative is to find f(x} = ¢(x? + 5)* in one jump (and check it). 


EXAMPLE 4 So sin x? dx = 7? (no elementary function gives this integral). 


If we try cos x’, the chain rule produces an extra 2x—no adjustment will work. Does 
sin x? still have an antiderivative? Yes! Every continuous v(x) has an f(x). Whether 
f(x) has an algebraic formula or not, we can write it as { v(x) dx. To define that 
integral, we now take the limit of rectangular areas. 


INTEGRALS AS LIMITS OF “RIEMANN SUMS” 


We have come to the definition of the integral. The chapter started with the integrals 
of x and x’, from formulas for 1+ --- + mand 17+ «.- +n”, We will not go hack to 
those formulas. But for other functions, too irregular to find exact sums, the rectangu- 
lar areas also approach a limit. 

That limit is the integral. This definition is a major step in the theory of calculus. 
It can be studied in detail, or understood in principle. The truth is that the definition 
js not so painful—you virtually know it already. 


Problem Integrate the continuous function v(x) over the interval [a, 5]. 
Step 1 Split [a, b] into n subintervals (a, x, J, (x1, 2]. -.., [x,-4, D1. 


The “‘meshpoints” x,, xz, ... divide up the interval from a to b. The endpoints are 
X9 =a and x,=b. The length of subinterval k is Ax, =x, — x,—,. In that smaller 
interval, the minimum of v(x) is m,. The maximum is M,. 
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Now construct rectangles. The “/ower rectangle” over interval k has height m,. The 
“upper rectangle” reaches to M,. Since v is continuous, there are points Xj, ANd Xmax 
where v = m, and v= M, (extreme value theorem). The graph of v(x) is in between. 

Important: The area under v(x) contains the area “‘s” of the lower rectangles: 


(i v(x) dx >m, Ax, + m,Ax, +. --- +m,Ax, = s. (5) 
The area under v(x) is contained in the area “S” of the upper rectangles: 
f° v(x) dx <M, Ax, + M, Ax, + »» +M,Ax, = S. (6) 


The lower sum s and the upper sum S were computed earlier in special cases—when 
v was x or x? and the spacings Ax were equal. Figure 5.9a shows why s < area <S. 


my, M, 


Xp Bo x k+l a X, b 


Fig. 5.9 Area of lower rectangles =s. Upper sum S includes top pieces. Riemann sum S* is in between. 


Notice an important fact. When a new dividing point x’ is added, the lower sum 
increases. The minimum in one piece can be greater (see second figure) than the 
original m,. Similarly the upper sum decreases. The maximum in one piece can be 
below the overall maximum. As new points are added, s goes up and S comes down. 
So the sums come closer together: 


‘tr & < iS’ a: S. (7) 


I have left space in between for the curved area—the integral of v(x). 

Now add more and more meshpoints in such a way that Ax,,,, +0. The lower 
sums increase and the upper sums decrease. They never pass each other. If v(x) is 
continuous, those sums close in on a single number A. That number is the definite 
integral—the area under the graph. 


DEFINITION The area A is the common limit of the lower and upper sums: 

s>Aand SA as Ax,,,, > 0. (8) 
This limit A exists for all continuous v(x), and also for some discontinuous functions. 
When it exists, A is the ‘Riemann integral” of v(x) from a to b. 


REMARKS ON THE INTEGRAL 


As for derivatives, so for integrals: The definition involves a limit. Calculus is built 
on limits, and we always add “‘if the limit exists.” That is the delicate point. I hope 
the next five remarks (increasingly technical) will help to distinguish functions that 
are Riemann integrable from functions that are not. 


Remark I The sums s and S§ may fail to approach the same limit. A standard 
example has V(x) = 1 at all fractions x = p/q, and V(x) = 0 at all other points. Every 
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interval contains rational points (fractions) and irrational points (nonrepeating deci- 
mals). Therefore m, =0 and M,= 1. The lower sum is always s=0. The upper sum 
is always S = b— a(the sum of 1’s times Ax’s). The gap in equation (7) stays open. This 
function V(x) is not Riemann integrable. The area under its graph is not defined (at 
least by Riemann—see Remark 5). 


Remark 2 The step function U(x) is discontinuous but still integrable. On every 
interval the minimum m, equals the maximum M,—except on the interval containing 
the jump. That jump interval has m,=0 and M, = 1. But when we multiply by Ax,, 
and require Ax,,,, > 0, the difference between s and S goes to zero. The area under 
a step function is clear—the rectangles fit exactly. 


Remark 3 With patience another key step could be proved: [fs A and S > A for 
one sequence of meshpoints, then this limit A is approached by every choice of mesh- 
points with Ax,,,, > 0. The integral is the lower bound of all upper sums S, and it is 
the upper bound of all possible s—provided those bounds are equal. The gap must 
close, to define the integral. 

The same limit A is approached by “in-between rectangles.”’ The height v(x*) can 
be computed at any point x in subinterval k. See Figures 5.9c and 5.10. Then the 
total rectangular area is a “Riemann sum” between s and S: 


* = y(x*)Ax, + vo(x¥)Ax, + --- + v(x*)Ax,. (9) 


We cannot tell whether the true area is above or below S*. Very often A is closer to 
S* than to s or S. The midpoint rule takes xf in the middle of its interval (Figure 5.10), 
and Section 5.8 will establish its extra accuracy. The extreme sums s and S are used 
in the definition while S* is used in computation. 


rc 


right mid min 


Fig. 5.40 Various positions for xf in the base. The rectangles have height v(xf). 


Remark 4 Every continuous function is Riemann integrable. The proof is optional (in 
my class), but it belongs here for reference. It starts with continuity at x*: “For any 
é there isa ...."” When the rectangles sit between x* — 6 and x* + 6, the bounds M, 
and m, differ by less than 2e. Multiplying by the base Ax,, the areas differ by less 
than 2e(Ax,). Combining all rectangles, the upper and lower sums differ by less than 
2e(Ax, + Ax, + --- + Ax,) = 2e(b — a). 

As ¢—0 we conclude that S comes arbitrarily close to s. They squeeze in on a 
single number A, The Riemann sums approach the Riemann integral, if v is continuous. 

Two problems are hidden by that reasoning. One is at the end, where S and s come 
together. We have to know that the line of real numbers has no “holes,” so there is 
a number A to which these sequences converge. That is true. 


Any increasing sequence, if it is bounded above, approaches a limit. 


The decreasing sequence S, bounded below, converges to the same limit. So A exists. 
The other problem is about continuity. We assumed without saying so that the 
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width 26 is the same around every point x*. We did not allow for the possibility that 
6 might approach zero where v(x) is rapidly changing—in which case an infinite 
number of rectangles could be needed. Our reasoning requires that 


v(x) is uniformly continuous: 6 depends on & but not on the position of x*. 


For each ¢ there is a 6 that works at all points in the interval. A continuous function 
on a closed interval is uniformly continuous. This fact (proof omitted} makes the 
Teasoning correct, and v(x) is integrable. 

On an infinite interval, even v = x? is not uniformly continuous. It changes across 
a subinterval by (x* + 5)? — (x* — 5)? = 4x*6. As x* gets larger, 5 must get smaller— 
to keep 4x*d below ec. No single 6 succeeds at all x*. But on a finite interval [0, 5], 
the choice 6 = ¢/4b works everywhere—so v = x? is uniformly continuous. 


Remark § If those four remarks were fairly optional, this one is totally at your 
discretion. Modern mathematics needs to integrate the zero-one function V(x) in the 
first remark. Somehow V has more 0’s than 1’s. The fractions (where V(x) = 1) can 
be put in a list, but the irrational numbers (where V(x) = 0) are “uncountable.” The 
integral ought to be zero, hut Riemann’s upper sums all involve M, = 1. 

Lebesgue discovered a major improvement. He allowed infinitely many subintervals 
(smaller and smaller). Then all fractions can be covered with intervals of total width 
e. (Amazing, when the fractions are packed so densely.) The idea is to cover 1/q, 2/q, 
..., 4/g by narrow intervals of total width 2/2’. Combining all q = 1, 2, 3, ..., the total 
width to cover all fractions is no more than e($+}4+4+t ---)=6. Since Vix)=0 
everywhere else, the upper sum S is only e. And since e was arbitrary, the “Lebesgue 
integral” is zero as desired. 


That completes a fair amount of theory, possibly more than you want or need— 
but it is satisfying to get things straight. The definition of the integral is still being 
studied by experts (and so is the derivative, again to allow more functions). By 
contrast, the properties of the integral are used hy everybody. Therefore the next 
section turns from definition to properties, collecting the rules that are needed in 
applications. They are very straightforward. 
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Read-through questions 
In fF oft) dt =f(x) +, the constant C equals _a__. Then 


at x=a the integralis_b . At x = b the integral becomes 
¢__. The notation {(x)]; means __¢_. Thus cos x)* equals 
e__. Also [cos x + 3], equals _ ¢_, which shows why 
the antiderivative includes an arbitrary _g _. Substituting 
u=2x—1 changes ff \/2x—1 dx into _h (with limits 
on wu). 


The integral f° v(x) dx can be defined for any | func 


tion v{x), even if we can’t find a simple __| __. First the mesh- 
points x,,X2,... divide [a,b] into subintervals of length 
Ax, =__k__. The upper rectangle with base Ax, has height 


M,=__t_. The upper sum §$ is equal to _m_. The lower 


sum sis .n_. The © __ is between s and S. As more 
meshpoints are added, $__p and s_q_. If S and s 


approach thesame__r__, that defines the integral. The inter- 
mediate sums S*, named after __s__, use rectangles of height 
v(x). Here xf is any point between __t_, and S*¥=__u 
approaches the area. 
If v(x) =df/dx, what comstants C make 1-10 true? 

1 J) o(x) dx =f(b)+C 

2 ff o(x) dx =f(4)+C 

3 §? v(t) dt=-f(X)+C 

4 (ee v(sin x) cos x dx=f(sin b)+C 

5 fi ot) dt =f(t)+C (careful) 

6 dffdx = v(x) +C 

795 (x? — 2x dx =fE, wh de. 
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8 fF v(t) dt =f(x*) + € 
9 §* (—x) dx= C (change —. to t; also dx and limits) 
10 fF v(x) dx =C J, v(2t) dt. 


Choose u(x) in 11-18 and change Limits. Compute the integral 
in 11-16. 


11 fy (x? + 1)19x dx 
13 fr’* tan x sec? x dx 
18 f%’* sec? x tan x dx 


17 f? dx/x (take w= 1/x) 


12 f5” sin’ x cos x dx 
14 f5 xt? dx (lake u= x?) 


16 5 x dx//1— x? 


18 [} x3(1—x) dx (@=1—x) 


With Ax =} in 19-22, find the maximum M, and minimum 
m, amd upper and lower sums S and s. 


19 {5 (x? + It dx 20 f} sin 2nx dx 
21 f3 x dx 22 fi, xdx. 


23 Repeat 19 and 20 with Ax =} and compare with the cor- 
Tect answer, 


24 The difference § —s in 2! is the area 2° Ax of the far right 
rectangle. Find Ax so that S < 4.001. 


25 If v(x) is increasing for a < x < b, the difference S — s is the 
area of the rectangle minus the area of the 
rectangle. Those areas approach zero. So every increasing 
function on [a,b] is Riemann integrable. 


5.6 Properties of the Integral and Average Value 


26 Find the Riemann sum S* for V(x) in Remark 1, when 
Ax = 1/n and each x#¥ is the midpoint. This S* is well-behaved 
but stil} V(x) is not Riemann integrable. 


27 W(x) equals 1 at x=4,4,4,..., and elsewhere W(x) =0. 
For Ax =,01 find the upper sum S. Is W(x) integrable? 
28 Suppose M(x) is a multistep function with jumps of 4, 4, 
4, ... at the points x =4, 3, 4, .... Draw a rough graph with 
M(0)=0 and M(1)= 1. With Ax =} find § and s. 
29 For M(x) in Problem 28 find the difference S —s (which 
approaches zero as Ax +0). What is the area under the 
graph? 
30 If dffdx = —v(x) and f(1)=0, explain f(x)=f! v(t) de. 
31 (a) If df/dx = +0(x) and (0) =3, find f(x). 

(b) If df/dx = +(x) and f(3)=0, find f(x). 


32 In your own words define the integral of {x} from a to b. 


33° True or false, with reason or example. 
(a) Every continuous v(x) has an antiderivative f(x). 
(b) If v(x) is not continuous, S and s approach different 
limits. 
(c) If S and s approach A as Ax +0, then all Riemann 
sums $* in equation (9) also approach A. 
(d) If oy (x)+02{x)=03(x), their upper sums satisfy 
$,+58,=53. 
{e) If v,(x)+ vz(x} =v5{x), their Riemann sums at the 
midpoints xf satisfy Sf +S} =S4. 

’ (f) The midpoint sum is the average of S and s. 

(g) One xf in Figure 5.10 gives the exact area. 


The previous section reached the definition of r p(x) dx. But the subject cannot stop 
there. The integral was defined in order to be used. Its properties are important, and 
its applications are even more important. The definition was chosen so that the 
integral has properties that make the applications possible. 

One direct application is to the average value of v(x). The average of n numbers is 
clear, and the integral extends that idea—it produces the average of a whole contin- 
uum of numbers v(x). This develops from the last rule in the following list (Property 
7), We now collect together seven basic properties of definite integrals. 

The addition rule for { (p(x) + w(x)] dx will not be repeated—even though this 
property of linearity is the most fundamental. We start instead with a different kind 
of addition. There is only one function v(x), but now there are two intervals. 

The integral from a to b is added to its neighbor from b to c. Their sam is the integral 
from a to c. That is the first (not surprising) property in the list. 


5.6 Properties of the Integral and Average Value 


Property 1 Areas over neighboring intervals add to the area over the combined 
interval: 
J v(x) dx + ff v(x) dx = ff v(x) dx. (1) 


This sum of areas is graphically obvious (Figure 5.11a). It also comes from the formal 
definition of the integral. Rectangular areas obey (1)—with a meshpoint at x = b to 
make sure. When Ax,,,, approaches zero, their limits also obey (1). All the normal 
rules for rectangular areas are obeyed in the limit by integrals. 

Property 1 is worth pursuing. It indicates how to define the integral when a= b. 
The integral ‘‘from b to b” is the area over a point, which we expect to be zero. It is. 


Property 2 i v(x) dx = 0. 


That comes from Property 1 when c = b. Equation (1) has two identical integrals, so 
the one from b to b must be zero. Next we see what happens if c = a—which makes 
the second integral go from b to a. 

What happens when an integral goes backward? The “lower limit” is now the larger 
number b. The “upper limit” a is smaller. Going backward reverses the sign: 


Property 3 ff v(x) dx = — f° v(x) dx = f(a) — f(b). 


Proof When c=a the right side of (1) is zero. Then the integrals on the left side 
must cancel, which is Property 3. In going from b to a the steps Ax are negative. That 
justifies a minus sign on the rectangular areas, and a minus sign on the integral 
(Figure 5.11b). Conclusion: Property 1 holds for any ordering of a, b, c. 


0 3 i) 2 
EXAMPLES | 1? dt = — ~— | dt=-1 | B wg 
x 3 1 2 t 


Property 4 For odd functions {*_, v(x) dx =0. “Odd” means that v(— x) = — v(x). 
For even functions f*, v(x) dx =2 9 v(x) dx. “‘Even’’ means that v(— x) = + v(x). 


The functions x, x*, x5, ... are odd. If x changes sign, these powers change sign. The 
functions sin x and tan x are also odd, together with their inverses. This is an impor- 
tant family of functions, and the integral of an odd function from —a to a equals zero. 
Areas cancel: 


ft. 6x5 dx = x® iin = a® —(—a)®=0. 


If v(x) is odd then f(x) is even! All powers 1, x”, x*,... are even functions. Curious 
fact: Odd function times even function is odd, but odd number times even number is 
even. 

For even functions, areas add: f* , cos x dx = sin a — sin(— a) = 2 sin a. 


Fig. 5.144 Properties 1-4: Add areas, change sign to go backward, odd cancels, even adds. 
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The next properties involve inequalities. If v(x) is positive, the area under its graph 
is positive (not surprising). Now we have a proof: The lower sums s are positive and 
they increase toward the area integral. So the integral is positive: 


Property5 If v(x)>0 for a<x<b then E v(x) dx > 0. 


A positive velocity means a positive distance. A positive v lies above a positive area. 
A more general statement is true. Suppose v(x) stays between a lower function |(x) 
and an upper function u(x). Then the rectangles for v stay between the rectangles for | 
and u. In the limit, the area under v (Figure 5.12) is between the areas under | and u: 


Property 6 If I(x) < v(x) < u(x) for a< x <b then 
JP I(x) dx < J? v(x) dx < f° u(x) dx. (2) 


EXAMPLE1 cost<1 = fjcostdt<fjldt => sinx<x 


EXAMPLE2 I <sec?t => fo ldt<fjsecttdt > x<tanx 


1 
5 <1 leads to tan™'x <x. 


EXAMPLE 3 Integrating +x 


All those examples are for x > 0. You may remember that Section 2.4 used geometry 
to prove sin h< h<tan h. Examples 1—2 seem to give new and shorter proofs. But I 
think the reasoning is doubtful. The inequalities were needed to compute the deriva- 
tives (therefore the integrals) in the first place. 


Fig. 5.12 Properties 5~7: v above zero, v between | and u, average value (+ balances —). 


Property 7 (Mean Value Theorem for Integrals) If v(x) is continuous, there is a 
point c between a and b where v(c) equals the average value of v(x): 


b 
v(c) = — | v(x) dx = “average value of v(x).” (3) 
This is the same as the ordinary Mean Value Theorem (for the derivative of f(x)): 
S'(o= LO) = ‘average slope of f.” (4) 


With f’ = v, (3) and (4) are the same equation. But honesty makes me admit to a flaw 
in the logic. We need the Fundamental Theorem of Calculus to guarantee that 
f(x) = fF v(t) dt really gives f’ =v. 

A direct proof of (3) places one rectangle across the interval trom a to b. Now raise 
the top of that rectangle, starting at v,,;, (the bottom of the curve) and moving up to 
Umax (the top of the curve). At some height the area will be just right—equal to the 
area under the curve. Then the rectangular area, which is (b — a) times v(c), equals 
the curved area 5 v(x) dx. This is equation (3). 


5.6 Properties of the Integral and Average Value 


2 


vin =x? v(x) = sin? x 


Fig. 5.43 Mean Value Theorem for integrals: area/(b — a) = average height = v{c) at some c. 


That direct proof uses the intermediate value theorem: A continuous function u(x) 
takes on every height between v,,;, and »,,,,. At some point {at two points in 
Figure 5.12c) the function v(x) equals its own average value. 


Figure 5.13 shows equal areas above and below the average height v{c) = v,,¢. 


EXAMPLE 4 The average value of an odd function is zero (between —1 and 1): 


Luifs F aoe 1 | 0 td 1 1 
= x dx= =-- = == 
22] 4 4], 4 4 b-a 2 


For once we know c. It is the center point x = 0, where v(c) = v,,,. = 0. 


EXAMPLE 5 The average value of x? is 4 (between 1 and —1): 


Lt aap ct 1) oe ee 

— x=— =-=-— _ — = - = = 

Deg 6 }-1 6 6) 3 b-a 2 
Where does this function x* equal its average value +? That happens when c? = 4, so 
e can be either of the points 1/./3 and —1/,/3 in Figure 5.13b. Those are the Gauss 


points, which are terrific for numerical integration as Section 5.8 will show. 


EXAMPLE 6 The average value of sin* x over a period (zero to n) is 4: 


: esas eee fae rte ee 
Tt }o an 9 «2 b-a & 


The point ¢ is 7/4 or 3x/4, where sin?c = 4. The graph of sin? x oscillates around its 
average value 3. See the figure or the formula: 


sin? x = 4— 4} cos 2x. (5) 


The steady term is 4, the oscillation is — $ cos 2x. The integral is f(x) = 4x — } sin 2x, 
which is the same as x — 5 sin x cos x, This integral of stn? x will be seen again. Please 
verify that df/dx = sin? x. 


THE AVERAGE VALUE AND EXPECTED VALUE 
The ‘‘average value” from a to 6 is the integral divided by the length b — a. This 


was computed for x and x* and sin’ x, but not explained. It is a major application 
of the integral, and it is guided by the ordinary average of nm numbers: 


a 


sb 1 
aye = v(x) dx comes from Baye = —(¥, Hep + -- +2,). 
n 


integration is paratlel to summation’ Sums approach integrals. Discrete averages 
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approach continuous averages. The average of 4, $, } is §. The average of 4, 2, 3, 4, 
$ is 3. The average of m numbers from 1/n to n/n is 


toe ( Ft + ow tla EE {7) 


The middie term gives the average, when n is odd. Or we can do the addition. As 
m— oo the sum approaches an integral (do you see the rectangles?). The ordinary 
average of numbers becomes the continuous average of v(x) = x: 


a+1 1 7 1 1 
CS ATE GS =- te =) 
on > 5 and { x dx 5 (no aca } 


In ordinary language: “The average value of the numbers between 0 and 1 is 4.” Since 
a whole continuum of numbers lies between 0 and 1, that statement is meaningless 
until we have integration. 

The average value of the squares of those numbers is (x”),. = J x? dx/(b— a) =4. 
If you pick a number randomly between 0 and |, its expected value is 4 and its expected 
square is 5. 

To me that sentence is a puzzle. First, we don’t expect the number to be exactly 
4—-so we need to define “expected value.” Second, if the expected value is $, why is 
the expected square equal to 4 instead of ? The ideas come from probability theory, 
and calculus is leading us to continuous probability. We introduce it briefly here, and 
come back to it in Chapter 8. 


PREDICTABLE AVERAGES FROM RANDOM EVENTS 


Suppose you throw a pair of dice. The outcome is not predictable. Otherwise why 
throw them? But the average over more and more throws is totally predictable. We 
don’t know what will happen, but we know its probahility. 

For dice, we are adding two numbers between | and 6. The outcome is between 2 
and 12. The probability of 2 is the chance of two ones: (1/6)(1/6) = 1/36. Beside each 
outcome we can write its probability: 


(se) (3s) *{3e) (5) (56) *(3e) *(Ge) 9( $s) (Ge) (Ge) #2(3) 


To repeat, one roll is unpredictable. Only the probabilities are known, and they add 
to 1. (Those fractions add to 36/36; all possibilities are covered.) The total from a 
million rolls is even more unpredictable—it can be anywhere between 2,000,000 and 
12,000,000. Nevertheless the average of those million outcomes is almost completely 
predictable. This expected value is found by adding the products in that line above: 


Expected value: multiply (outcome) times (probability of outcome) and add: 
2,6 12 20 30 42 40 36 30 22° 12 
A+++ S44 544545454 = 
36 36 36 36 6 6 IE — 9H — 86 — IK 3: 
If you throw the dice 1000 times, and the average is not between 6.9 and 7.1, you get 
an A. Use the random number generator on a computer and round off to integers. 


Now comes continuous probability. Suppose all numbers between 2 and 12 are 
equally probable. This means all numbers—not just integers. What is the probability 
of hitting the particular number x = x? It is zero! By any reasonable measure, x has 


5.6 Properties of the Integral and Average Value 


no chance to occur. In the continuous case, every x has probability zero. But an 
interval of x’s has a nonzero probability: 


the probability of an outcome between 2 and 3 is 1/10 
the probability of an outcome between x and x + Ax is Ax/10 


To find the average, add up each outcome times the probability of that outcome. 
First divide 2 to 12 into intervals of length Ax =1 and probability p= 1/10. If we 
round off x, the average is 63: 


(i)(a)--n(G)-« 


Here all outcomes are integers (as with dice). It is more accurate to use 20 intervals 
of length 1/2 and probability 1/20. The average is 62, and you see what is coming. 
These are rectangular areas (Riemann sums). As Ax -+ 0 they approach an integral. 
The probability of an outcome between x and x + dx is p(x) dx, and this problem has 
p(x) = 1/10. The average outcome in the continuous case is not a sum but an integral. 


12 120 dy 2 712 
expected value E(x) = | xp(x) dx -| x—= 3 = 
. 2 i 2 10 20 |, 


That is a big jump. From the point of view of integration, it is a limit of sums. From 
the point of view of probability, the chance of each outcome is zero but the probability 
density at x is p(x)= 1/10. The integral of p{x) is 1, because some outcome must 
happen. The integral of xp(x) is x,,.= 7, the expected value. Each choice of x is 
random, but the average is predictable. 

This completes a first step in probability theory. The second step comes after more 
calculus. Decaying probabilities use e~* and e~*\—then the chance of a large x is 
very smali. Here we end with the expected values of x” and 1/./x and I/x, for a 
random choice between 0 and I (so p(x) = !): 


Val aie pa te ee ee ee 
Bix) | dx = —— e(+)- , aed E(*) ie oc(!) 


A CONFUSION ABOUT “EXPECTED” CLASS SIZE 


A college can advertise an average class size of 29, while most students are in large 
classes most of the time. I will show quickly how that happens. 

Suppose there are 95 classes of 20 students and 4 classcs of 200 students. The total 
enroliment in 100 classes is 1900 + 1000 = 2900. A random professor has expected 
class size 29. But a random student sees it differently. The probability is 1900/2900 
of being in a small class and 1000/2900 of being in a large class. Adding class size 
times probability gives the expected class size for the student: 


1900 1000 . 
—— } + —— |= 
(20) (S500) (200) (503) 82 students in the class 


Similarly, the average waiting time at a restaurant seems like 40 minutes (to the 
customer). To the hostess, who averages over the whole day. it is 10 minutes. If you 
came at a random time it would be 10, but if you are a random customer it is 40. 

Traffic problems could be eliminated by raising the average number of people per 
car to 2.5, or even 2. But that is virtually impossible. Part of the problem is the 
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difference between (a) the percentage of cars with one person and (b) the percentage 
of people alone in a car. Percentage (b) is smaller. In practice, most people would be 


in crowded cats. See Problems 37-38. 


5.6 EXERCISES 


Read-through questions 


The integrals [? u(x) dx and [> v(x) dx add to __@_. The 
integral spe (x) dx equals _© _. The reason is _¢_. If 
v{x) <x then So v(x) dx<_ 4d “The average value of me on 
the interval 1 <x <9 is defined by _@ _. It is equal to ufc) 
at a point x=c which is _! . The rectangle across this 
interval with height »(c) has the same area as _ 9. The 


average value of #{x)=x+1 on the interval Lt <x <9 is 
h 


If x is chosen from 1, 3, 5, 7 with equal probabilities 4, its 
expected value (average) is__!_. The expected value of x? 
is__| . If x is chosen from 1, 2, ..., 8 with probabilities ;, 
its expected value isk . If x is chosen from 1 <x <9, the 
chance of hitting an integer is! . The chance of falling 
between x and x + dx is p(x)dx=__™__. The expected value 


E(x) is the integral _" . It equals _ 9 


In 1-6 find the average value of v(x} between a and }, and find 
all points c where v,,, = v{c). 

1 v=x*,a=-1,b=1 2 v=x°,g=—-1,b=1 
4 v= /x,a=0,b=4 


6 v=(sin x)’, a= —7, b=n. 


3 y=cos*x,a=0,b=2 
§ v=1/x?,a@=1,b=2 
7 Atx=8, F(x)= fF v(t) dt + fF oft) dt is : 
8 [i xdx+ fi xdx—fixdx= 


Are 9-16 true or false? Give a reason or an example. 
9 The minimum of [7 v(t) dt is at x = 4. 
10 The value of [**? p(t) dt does not depend on x. 
11 The average value from x =0 to x = 3 equals 
Avy. on OS x 1) 4 4(0,,, OD 1 <x <3), 


12 The ratio (f(b) —f(a)}/(b — a) is the average value of f(x) 
ona<x<b. 


13 On the symmetric interval -—1 <x <1, o(x)—»v,,, is an 
odd function. 


14 If {x} < v(x) < u(x) then di/dx <du/dx < dujdx. 


15 The average of v(x) from 0 to 2 plus the average from 2 
to 4 equals the average from 0 to 4. 


16 (a) Antiderivatives of even functions are odd functions. 
(b) Squares of odd functions are odd functions. 


17 What number é gives {? (x(x) — 2) dx = 


18 If {(2)=6 and f{(6)=2 then the average of df/dx from 
x=2tox=6is 


19 (a) The averages of cosx and |cos x| from 0 to x are 


(b) The average of the numbers 2,,...,0,is__ Ss than 
the average of |v,|, ..., |v,|- 
20 (a) Which property of integrals proves J} v(x) dx < 
Jp let) dx? 
(b) Which property proves —J5 v(x) dx < ‘; |w(x)}| dx? 
Together these are Property 8: |[} v(x} dx| <5 |v(x)| dx. 
21 What function has »,,, (from 0 to x) equal to 4 (x) at all 
x? What functions have v,,, = v(x) at all x? 
22 (a) If v(x} is mereasing, explain from Property 6 why 
JG v(t) dt < xv(x) for x > 0, 
(b) Take derivatives of both sides for a Seaid proof. 
23 The average of v(x}=1t/(1+.x?) on the interval [0, 5) 


approaches as b> oo, The average of V(x)= 
x?/(1 + x?} approaches ; 


24 If the positive numbers v, approach zero as n > oo prove 
that their average (v, + --- +v,)/n also approaches zero. 


2§ Find the average distance from x=a to points in the 
interval 0 < x < 2. Is the formula different if a< 2? 


26 (Computer experiment} Choose random numbers x 
between O and | until the average value of x? is between .333 
and .334. How many values of x? are above and below? If 
possible repeat ten times. 


27 A point P ts chosen randomly along a semicircle (see 
figure: equat probability for equal arcs). What is the 
average distance y from the x axis? The radtus is 1. 
28 A point Q is chosen randomly between —1 and 1. 
(a) What is the average distance Y up to the semicircle? 
(b) Why is this different from Problem 27? 


Buffon needle 


y<cos@ y > cosd 


P 


§.7_ The Fundamental Theorem and Its Applications 


29 (A classic way to compute x} A 2” needle is tossed onto 
a floor with boards 2” wide. Find the probability of falling 
across a crack. (This happens when cos @ > y = distance from 
midpoint of needle to nearest crack. In the rectangle 
0<8<n/2,0<y< 1, shade the part where cos @ > y and find 
the fraction of area that is shaded.) 


3% If Buffon’s needle has length 2x instead of 2, find the 
probability P(x) of falling across the same cracks. 


31 If you roil three dice at once, what are the probabilities of 
each outcome between 3 and 18? What is the expected value? 


32 If you choose a random point in the square 0O<x< 1, 
0< y< 1, what is the chance that its coordinates have y* < x? 


33 The voltage V(t) = 220 cos 2t/60 has frequency 60 hertz 
and amplitude 220 volts. Find ¥,,, from 0 to ¢. 


aye 


34 (a) Show that v,,.,(x) = 4(v(x} + v(—x)) is always even. 
(b) Show that 2 4¢() = 4(v(x) — o(—x)) is always odd. 


35 By Problem 34 or otherwise, write (x + 1)° and I/(x + 1) 
as an even function plus an odd function. 


3% Prove from the definition of df/dx that it is an odd func- 
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37 Suppose four classes have 6, 8, 10, and 40 students, averag- 
ing . The chance of being in the first class is 
. The expected class size (for the student) is 


vo-a(§)-a(8)"(@)@(8)-—— 


38 With groups of sizes x), ...,x, adding to G, the average 
size is . The chance of an individual belonging to 
grouplis ss. The expected size of his or her group is 
E(x) = x(x, /G) + «+ + x4(x,/G). *Prove £} x?/G 2 G/n. 
39 True or false, 15 seconds each: 

(a) If f(x) < g(x) then df/dx < dg/dx. 

(b) If df dx < dg/dx then f(x) < g(x). 

(c) xv(x) is odd if v(x) is even. 

(d) If << Wy. on all intervals then v(x) < w(x) at all 

points. 

x* for x<3 

—x? forx>3 


2x for x <3 
oer ne a then Sts) =| 


Thus f5 v(x} dx = (4) —f(0} = —16. Correct the mistake. 
41 If v(x) =|x—2| find f(x). Compute J? v(x) dx. 


40 HH ufx) -{ 


tion if ffx) is even. 


5.7 The Fundamental Theorem and its Applications 


When the endpoints are fixed at a and 4, we have a definite integral. When the upper 
limit is a variable point x, we have an indefinite integral. More generally: When the 
endpoints depend in any way on x, the integral is a function of x. Therefore we can 
find its derivative. This requires the Fundamental Theorem of Calculus. 

The essence of the Theorem is: Derivative of integral of v equals v. We also compute 
the derivative when the integral goes from a(x) to b(x}—both limits variable. 

Part 2 of the Fundamental Theorem reverses the order: integral of derivative of f 
equals {+ C. That will follow quickly from Part 1, with help from the Mean Value 
Theorem. It is Part 2 that we use most, since integrals are harder than derivatives. 

After the proofs we go to new applications, beyond the standard problem of area 
under a curve. Integrals can add up rings and triangles and shells—not just rectangles. 
The answer can be a volume or a probability—not just an area. 


THE FUNDAMENTAL THEOREM, PART 4 


Start with a continuous function v. Integrate it from a fixed point a to a variable 
point x. For each x, this integral f(x) is a number. We do not require or expect a 
formula for f(x)—it is the area out to the point x. It is a function of x! The Fundamen- 
tal Theorem says that this area function has a derivative (another limiting process). 
The derivative df{/dx equals the original v(x). 


42 Why are there equal areas above and below »,,,? 


214 


8S Integrals 


SC (Fundamental Theorem, Part 1) Suppose v(x) ia a continuous function: 


If fix)=§F oft) dt then df/dx = v(x). 


The dummy variable is written as t, so we can concentrate on the limits. The value 
of the integral depends on the limits @ and x, not on ¢. 
To find df/dx, start with Af= f(x + Ax) — f(x) = difference of areas: 


Af= fF" v(t) dt — fF v(t) de = JE" v(t) dt. (1) 


Officially, this is Property 1. The area out to x + Ax minus the area out to x equals 
the small part from x to x + Ax. Now divide by Ax: 


xtAx 
ae AS | v{t) dt = average value = v(c). (2) 


This is Property 7, the Mean Value Theorem for integrals. The average value on this 
short interval equals v(c}. This point c is somewhere between x and x + Ax (exact 
position not known), and we let Ax approach zero. That squeezes ¢ toward x, so v(c) 
approaches v(x)—remember that p is continuous. The limit of equation (2) is the . 
Fundamental Theorem: 


df 
Re as and p(c)— v(x) so a v(x). (3) 
If Ax is negative the reasoning still holds. Why assume that v(x) is continuous? 
Because if p is a step function, then f(x) has a cormer where df/dx is not v(x). 
We could skip the Mean Value Theorem and simply bound pv above and below: 


for tr between x and x + Ax: Vnin& v(t) <0pax 
integrate over that interval: UminAx= Af <vpy,,Ax (4) 
divide by Ax: Umin SAS /AX S Vinax 


As Ax +0, vyin and vag, approach v(x). In the limit df/dx again equals v(x). 


area under v(x) fix+ Ay) 
= increase in f(x) Af= vax 
FW) 
Ax 
{| area ~ 
vQAxr 
X X+AX x X+tAr 


Ag. 5.44 Fundamental Theorem, Part 1: (thin area Af)/(base length Ax) > height v(x). 


Graphical meaning The f-graph gives the area under the v-graph. The thin strip in 
Figure 5.14 has area Af. That area is approximately v(x) times Ax. Dividing by its 
base, Af/Ax is close to the height p(x). When Ax — 0 and the strip becomes infinitely 
thin, the expression ‘close to” converges to “equals.” Then df/dx is the height at v(x). 


DERIVATIVES WITH VARIABLE ENDPOINTS 


When the upper limit is x, the derivative is p(x). Suppose the lower limit is x. The 
integral goes from x to b, instead of a to x. When x moves, the lower limit moves. 


5.7 The Fundamental Theorem and its Applications 


The change in area is on the left side of Figure 5.15. As x goes forward, area is removed. 
So there is a minus sign in the derivative of area: 


$5 ft . 4 
The derivative of 2(x) -| v(t} dt és Ae = — v(x). (5} 


. 


The quickest proof is to reverse b and x, which reverses the sign (Property 3): 


a(x) = -| v(t) dt so by Part 1 dg = — of{x). 
b dx 


o(h(x)} 
or} v{a(x)) gain 
lose lose Wb Ab 
Oj Ax U(autx)) Aa 
x 
x o¥+An b atx) a(x+ Ax) d(x) b+ Ax) 


Fig. 5.45 Area from x to 6 has dg/dx = — u(x), Area u(b)db is added, area x{a)da is lost 


The general case is messier but not much harder (it is quite useful}. Suppose both 
limits are changing. The upper limit b(x) is not necessarily x, but it depends on x, 
The lower limit a(x) can also depend on x (Figure 5.15b), The area A between those 
limits changes as x changes, and we want dA/dx: 


i) dA db da 
if A= {- v(t) dt then ar i(b(x)} a — p(a(x}) ae (6) 


The figure shows two thin strips, one added to the area and one subtracted. 

First check the two cases we know. When a= 0 and b= x, we have da/dx = 0 and 
db/dx = 1. The derivative according to (6) is v(x) times 1—the Fundamental Theorem. 
The other case has a = x and b = constant. Then the tower limit in (6) produces — (x). 
When the integral goes from a= 2x to b= x*, its derivative is new: 


EXAMPLE 4 A= fi. cos ¢ dt=sin x?— sin 2x 

dAjdx = (cos x*){(3x7) — (cos 2x)(2). 
That fits with (6), because db/dx is 3x? and da/dx is 2 (with minus sign). It also looks 
like the chain rule—which it is! To prove (6) we use the letters v and f: 


btx) 
A= | v(t) dt =f(b(x))— flax) (by Part 2 below) 


a(x) 
aA = f'(B(x}) ud — f'(a(x)} aa (by the chain rule) 
dx dx dx 


Since f’ =z, equation (6) is proved. In the next example the area turns out to be 
constant, although it seems to depend on x. Note that v(t) = 1/t so (3x) = L/3x. 


3x 
EXAMPLE2 A= Li has = 3)- an (2}= 0. 
ax ¢t dx 3x 2x 
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# dA 
Question A -| u(t} dt has aes u(x) + u(— x). Why does v(— x) have a plus sign? 


THE FUNDAMENTAL THEOREM, PART 2 


We have used a hundred times the Theorem that is now to be proved. It is the key 
to integration. “The integral of df/dx is {(x)+ C.” The application starts with v(x). 
We search for an f(x) with this derivative. If df/dx = e{x), the Theorem says that 


| v(x) dx = | a dx =f(x)+C. 


x 
We can't rely on knowing formulas for py and f -only the definitions of { and d/dx. 

The proof rests on one extremely special case: df /dx is the zero function, We casily 
find f(x) = constant. The problem is to prove that there are no other possibilities: f 
must be constant. When the slope is zero, the graph must be flat. Everybody knows 
this is true, but intuition is not the same as proof. 

Assume that df /dx = 0 in an interval. If f(x} is not constant, there are points where 
f(a) #f(b). By the Mean Value Theorem, there is a point ¢ where 

(c= ee (this is not zero because f(a} #f{b)). 
But f‘(c) #0 directly contradicts df/dx = 0. Therefore f{x) must be constant. 

Note the crucial role of the Mean Value Theorem. A local hypothesis (dfidx = 0 
at each point) yields a global conclusion {f= constant in the whole interval). The 
derivative narrows the field of view, the integra] widens it. The Mean Value Theorem 
connects instantaneous to average, local to global, points to intervals. This special 
case (the zero function) applies when A(x) and {{x) have the same derivative: 


Uf dAjdx = df idx on an interval, then A(x) = f(x) + C. (7) 


Reason: The derivative of A(x) -— f{x) is zero. So A(x) — f(x} must be constant. 
Now comes the big theorem. It assumes that v(x) is continuous, and integrates 


using f(x}: 


b 


a 


§D (Fundamental Theorem, Part 2) If v(x)= a then | v(x) dx = f(b) — fa). 


Proof The antiderivative is f(x). But Part | gave another antiderivative for the same 
x(x). It was the integral—constructed from rectangles and now called A(x): 
Cx 
A(x)= | v(t) dt alse has 


we 


a = ex) 
gee 


Since A‘'=t and f'= vt, the special casc in equation (7) states that 4(x) =f(x}+ C. 
That is the essential point: The integral from rectangles equals f(x) + C. 

At the lower limit the area integra] is A=0. So f{a)+ C=O. At the upper iimit 
f(b) + C = A(b). Subtract to find A(b), the definite integral: 


A(b) = §? v(x) dx = f(b) — fla). 


Calculus is beautiful its Fundamental Theorem is also its most useful theorem. 
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Another proof of Part 2 starts with f’ =v and looks at subintervals: 


(x1) —f(@ = v(xF)(x; — a) (by the Mean Value Theorem) 
IS (x2) —f (x1) = v(x¥)(x2 — x,) (by the Mean Value Theorem) 


S(b) —f (Xn 1) = v(x*)(b — x,-1) (by the Mean Value Theorem). 
The left sides add to f(b) — f(a). The sum on the right, as Ax > 0, is t v(x) dx. 


APPLICATIONS OF INTEGRATION 


Up to now the integral has been the area under a curve. There are many other 
applications, quite different from areas. Whenever addition becomes “continuous,” we 
have integrals instead of sums. Chapter 8 has space to develop more applications, but 
four examples can be given immediately—which will make the point. 

We stay with geometric problems, rather than launching into physics or engineering 
or biology or economics. All those will come. The goal here is to take a first step 
away from rectangles. 


EXAMPLE 3 (for circles) The area A and circumference C are related by dA/dr = C. 


The question is why. The area is mr. Its derivative 2zr is the circumference. By the 
Fundamental Theorem, the integral of C is A. What is missing is the geometrical 
reason. Certainly zr? is the integral of 2zr, but what is the real explanation for A = 
J C(r) dr? 

My point is that the pieces are not rectangles. We could squeeze rectangles under 
a circular curve, but their heights would have nothing to do with C. Our intuition 
has to take a completely different direction, and add up the thin rings in Figure 5.16. 


shell volume = 42r2Ar 


ring area = 27rAr 


Fig. 5.146 Area of circle = integral over rings. Volume of sphere = integral over shells. © 


Suppose the ring thickness is Ar. Then the ring area is close to C times Ar. This is 
precisely the kind of approximation we need, because its error is of higher order (Ar)’. 
The integral adds ring areas just as it added rectangular areas: 


0 0 


4-| car= | 2nur dr = nr’. 


That is our first step toward freedom, away from rectangles to rings. 
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The ring area AA can be checked exactly—it is the difference of circles: 
AA = n(r + Ar)? — ar? = 2nr Ar + n(Ar)?. 


This is CAr plus a correction. Dividing both sides by Ar > 0 leaves dA/dr = C. 

Finally there is a geometrical reason. The ring unwinds into a thin strip. Its width 
is Ar and its length is close to C. The inside and outside circles have different perime- 
ters, so this is not a true rectangle—but the area is near CAr. 


EXAMPLE 4 For a sphere, surface area and volume satisfy A = dV/dr. 


What worked for circles will work for spheres. The thin rings become thin shells. A 
shell goes from radius r to radius r + Ar, so its thickness is Ar. We want the volume 
of the shell, but we don’t need it exactly. The surface area is 4zr?, so the volume is 
about 4zr? Ar. That is close enough! 

Again we are correct except for (Ar). Infinitesimally speaking dV = A dr: 


v=( A ir=| eae at 
0 0 3 


This is the volume of a sphere. The derivative of V is A, and the shells explain why. 
Main point: Integration is not restricted to rectangles. 


EXAMPLE 5 The distance around a square is 4s. Why does the area have dA/ds = 2s? 


The side is s and the area is s*. Its derivative 2s goes only half way around the square. 
I tried to understand that by drawing a figure. Normally this works, but in the figure 
dA/ds looks like 4s. Something is wrong. The bell is ringing so I leave this as an 
exercise. 


EXAMPLE 6 Find the area under v(x) = cos !x from x =0 to x= 1. 


That is a conventional problem, but we have no antiderivative for cos~! x. We could 
look harder, and find one. However there is another solution—unconventional but 
correct. The region can be filled with horizontal rectangles (not vertical rectangles). 
Figure 5.17b shows a typical strip of length x = cos v (the curve has v=cos~' x). As 
the thickness Av approaches zero, the total area becomes | x dv. We are integrating 
upward, so the limits are on v not on x: 

‘ 2 

area = [5* cos v dv =sin |" =1. 

The exercises ask you to set up other integrals—not always with rectangles. Archi- 
medes used triangles instead of rings to find the area of a circle. 


Fig. 5.47 Trouble with a square. Success with horizontal strips and triangles. 
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5.7 EXERCISES 


Read-through questions 


The area f(x) = fi v(t) dt is a function of _a__. By Part 1 of 
the Fundamental Theorem, its derivative is _b__. In the 
proof, a small change Ax produces the area of a thin __c¢ 

This area Af is approximately _d__ times _e_. So the 


derivative of [* t? dt is __ 


The integral |? t? dt has derivative __g _. The minus sign 
is because _h__. When both limits a(x) and b(x) depend on 
x, the formula for df/dx becomes__|__ minus __j__. In the 
example {3 * t dt, the derivative is __k 


By Part 2 of the Fundamental Theorem, the integral of 
df/dx is __|__. In the special case when df/dx = 0, this says 
that __m__. From this special case we conclude: If dA/dx = 
dB/dx then A(x)=__n_. If an antiderivative of 1/x is Inx 
(whatever that is), then automatically Ee dx/x = __o 


The square 0<x<s, O<y<s has area A=_p_.Ifs 
is increased by As, the extra area has the shape of _q_. 
That area AA is approximately . So dA/ds=__s 


Find the derivatives of the following functions F(x). 
1 J} cos? r de 2 J} cos 3¢ dt 

3 fo trae 4 fo x"de 

5 fF wdu 6 f%2 v(u) du 


7 e v(t) dt (a “running average’’ of v) 


8 t | v(t) dt (the average of v; use product rule) 
0 


1 x l x+2 
9 + sin? ¢ dt 10 Al trdt 
x 0 2 x 


11 f5 Cf, v(u) dul dt 12 f§ (df/dt)? dt 
13 J5 v(t) de +f? v(t) de 14 f§ v(—2) de 
15 fi, sin ¢? dt 16 f=, sin t dt 


17% w(e)o(e) dt 18g) 5 de 
sin x S(x) 

19 | sin~'t dt 20 | 4 dt 
0 0 

21 True or false 


(a) If df/dx = dg/dx then f(x) = g(x). 

(b) If d?f/dx? = d*g/dx? then f(x) = g(x)+C. 

(c) If 3>x then the derivative of [} v(t) dt is —v(x). 

(d) The derivative of [} v(x) dx is zero. 
22 For F(x) =f?* sin t dt, locate F(x + Ax) — F(x) on a sine 
graph. Where is F(Ax) — F(0)? 


23 Find the function v(x) whose average value between 0 and 
x is cos x. Start from fj v(t) dt = x cos x. 


24 Suppose df/dx = 2x. We know that d(x?)/dx = 2x. How 


do we prove that f(x) =x?+C? 
25 If | nde v(t) dt = ii v(t) dt (equal areas left and right of 
zero), then v(x) is an function. Take derivatives to 
prove it. 
26 Example 2 said that [}* de/t does not really depend on x 
(or t!). Substitute xu for t and watch the limits on u. 
27 True or false, with reason: 

(a) All continuous functions have derivatives. 

(b) All continuous functions have antiderivatives. 

(c) All antiderivatives have derivatives. 


(d) A(x) = f3% dt/t? has dA/dx =0. 
Find {7 v(t) dt from the facts in 28-29. 
d(x") . x 
28 = a 
ax v(x) 29 i v(t) dt rs 


30 What is wrong with Figure 5.17? It seems to show that 
dA = 4s ds, which would mean A = { 4s ds = 2s?. 


31 The cube 0<x, y, z<s has volume V= . The 

three square faces with x = s or y=s or z=s have total area 

A= . If s is increased by As, the extra volume has 

the shape of . That volume AV is approximately 
. So dV/ds = ; 


32 The four-dimensional cube 0<x, y, z, t<s has hyper- 

volume H = . The face with x=s is really a 
. Its volume is V= . The total volume of 

the four faces with x=s, y=s, z=s, or t=s is 

When s is increased by As, the extra hypervolume is 

AH = . So dH/ds = ; 


33 The hypervolume of a four-dimensional sphere is H = 
4n?r*. Therefore the area (volume?) of its three-dimensional 
surface x? + y? +2? +1? =r? is P 
34 The area above the parabola y= x? from x =0 to x=1 
is }. Draw a figure with horizontal strips and integrate. 


35 The wedge in Figure (a) has area 2’ d0. One reason: It is 

a fraction d0/2x of the total area zr”. Another reason: It is 

close to a triangle with small base rd@ and height 

Integrating 4r? d0 from 0 =0 to 0= gives the area 
of a quarter-circle. 


36 A=, \/r? —x?* dx is also the area of a quarter-circle. 
Show why, with a graph and thin rectangles. Calculate this 
integral by substituting x =r sin @ and dx =r cos 0 dé. 
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37 The distance r in Figure (b) is related to @ by r= 
Therefore the area of the thin triangle is $r?7 d@ = 
Integration to # = gives the total area 4. 


38 The x and y coordinates in Figure (c) add to 
rcos@+rsin = . Without integrating explain why 


a ae 
o (cosé+sing)* ~ 


39 The horizontal strip at height y in Figure (d) has width dy 
and length x = . So the area up to y = 2 is ‘ 
What length are the vertical strips that give the same area? 


40 Use thin rings to find the area between the circles r= 2 
and r = 3. Draw a ptciure to show why thin rectangles would 
be extra difficult. 


(e) 
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41 The length of the strip in Figure (e) is approximately 
. The width is . Therefore the triangle has 
area [}_______da {do you get 49). 


42 The area of the ellipse in Figure (f) is Znr?. Its derivative 
is 4nr, But this is not the eorrect perimeter. Where does the 
usual] reasoning go wrong? 


43 The derivative of the integral of v{x) is v(x). What is the 
corresponding statement for sums and differences of the num- 
bers v,? Prove that statement. 


44 The integral of the derivative of f(x) is f(x}+ C. What ts 
the corresponding statement for sums of differences of f;? 
Prove that statement. 


45 Does d7f/dx? = a(x) lead to J} (f5 a(t) dt) dx =f(1)—f(0)? 
4 The mountain y = — x* +7 has an area A(t) above the x 
axis. As ¢ increases so does the area. Draw an xy graph of the 


mountain at t= !. What line gives dA/dt? Show with words 
or derivatives that d?A/de? > 0. 


This section concentrates on definite integrals. The inputs are y(x) and two endpoints 
a and b. The output is the integral 7. Our goal is to find that number 
° y(x) dx =f, accurately and in a short time. Normally this goal is achievable—as 
soon as we have a good method for computing integrals. 

Our two approaches so far have weaknesses. The search for an antiderivative 
succeeds in important cases, and Chapter 7 extends that range—but generally f(x) 
is not available. The other approach (by rectangles} is in the right direction but too 
crude. The height is set by p(x) at the right and left end of each small interval. The 
right and left rectangle rules add the areas (Ax times y): 


Ry = (AxWy¥, + yo t os Fy.) and Ly = (Ax)(yo + yy t + + Yy—1)- 


The vaiue of y(x) at the end of interval j is y,. The extreme left value yp = y(a) enters 
L,. With n equai intervals of length Ax =(b— a)/n, the extreme right value is y, = 
y(b). It enters R,. Otherwise the sums are the same—simple to compute, easy to 
visualize, but very inaccurate. 


This section goes from slow methods (rectangles) to better methods (trapezoidal 
and midpoint) to good methods (Simpson and Gauss). Each improvement cuts down 
the error. You could discover the formulas without the book, by integrating x and 
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x? and x*. The rule R, would come out on one side of the answer, and L, would be 
on the other side. You would figure out what to do next, to come closer to the exact 
integral. The book can emphasize one key point: 


The quality of a formula depends on how many integrals 
J 1 dx, J x dx, | x? dx, ..., it computes exactly. If | x? dx 
is the first to be wrong, the order of accuracy is p. 


By testing the integrals of 1, x, x?, ..., we decide how accurate the formulas are. 

Figure 5.18 shows the rectangle rules R, and L,. They are already wrong when 
y= x. These methods are first-order: p= 1. The errors involve the first power of 
Ax—where we would much prefer a higher power. A larger p in (Ax)? means a 
smaller error. 


Ax Ax Ax 
Fig. 5.148 Errors E and e in R, and L, are the areas of triangles. 


When the graph of y(x) is a straight line, the integral J is known. The error triangles 
E and e have base Ax. Their heights are the differences y;,,—y;. The areas are 
$(base)(height), and the only difference is a minus sign. (L is too low, so the error 
L—1 is negative.) The total error in R, is the sum of the E’s: 


R, — 1 = ZAx(y, — Yo) + ++ +4AX(Yqn— Ya-1) = 44X(Yq — Yo): (1) 
All y’s between yo and y, cancel. Similarly for the sum of the e’s: 
L, — 1 = — 3Ax(y, — Yo) = — 4AxLy(b) — y(a)]. (2) 


The greater the slope of (x), the greater the error—since rectangles have zero slope. 

Formulas (1) and (2) are nice—but those errors are large. To integrate y = x from 
a=0 to b= 1, the error is }Ax(1 — 0). It takes 500,000 rectangles to reduce this error 
to 1/1,000,000. This accuracy is reasonable, but that many rectangles is unacceptable. 

The beauty of the error formulas is that they are “asymptotically correct” for all 
functions. When the graph is curved, the errors don’t fit exactly into triangles. But 
the ratio of predicted error to actual error approaches 1. As Ax +0, the graph is 
almost straight in each interval—this is linear approximation. 

The error prediction }Ax[y(b) — y(a)] is so simple that we test it on y(x)= af ®: 


T= /x dx =3 n= I 10 100 1000 
error R, —1= .33 044 .0048 .00049 
error L, —I= 67 — .056 — .0052 — .00051 


The error decreases along each row. So does Ax = .1, .01, .001, .0001. Multiplying n 
by 10 divides Ax by 10. The error is also divided by 10 (almost). The error is nearly 
proportional to Ax—typical of first-order methods. 

The predicted error is }Ax, since here y(1)= 1 and y(0)=0. The computed errors 
in the table come closer and closer to Ax = .5, .05, .005, .0005. The prediction is the 
“leading term” in the actual error. 
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The table also shows a curious fact. Subtracting the last row from the row above 
gives exact numbers 1, .1, .01, and .001. This is (R,, — J) —(L, — 1), which is R, — L,. 
It comes from an extra rectangle at the right, included in R,, but not L,,. Its height is 
1 and its area is 1, .1, .01, .001. 

The errors in R,, and L,, almost cancel. The average T,, = 4(R,, + L,) has less error— 
it is the “trapezoidal rule.”’ First we give the rectangle prediction two final tests: 


n=1 n= 10 n=100 n= 1000 
(x? — x) dx: errors 1.7°107! 1.7-1073 1.7-1075 1.7-1077 
) 
J dx/(10+ cos 2mx); errors —1-10°? 2-10°-'* 1 “—" 


Those errors are falling faster than Ax. For y= x* — x the prediction explains why: 
y(0) equals y(1). The leading term, with y(b) minus y(a), is zero. The exact errors are 
4(Ax)?, dropping from 10~' to 10°? to 10° * to 10” ’. In these examples L, is identical 
to R, (and also to T,), because the end rectangles are the same. We will see these 
4(Ax)? errors in the trapezoidal rule. 

The last row in the table is more unusual. It shows practically no error. Why do 
the rectangle rules suddenly achieve such an outstanding success? 

The reason is that y(x) = 1/(10 + cos 27x) is periodic. The leading term in the error 
is zero, because y(0) = y(1). Also the next term will be zero, because y’(0) = y’(1). Every 
power of Ax is multiplied by zero, when we integrate over a complete period. So the 
errors go to zero exponentially fast. 


Personal note I tried to integrate 1/(10 + cos 2xx) by hand and failed. Then I was 
embarrassed to discover the answer in my book on applied mathematics. The method 
was a special trick using complex numbers, which applies over an exact period. 
Finally I found the antiderivative (quite complicated) in a handbook of integrals, and 
verified the area 1/ /99. 


THE TRAPEZOIDAL AND MIDPOINT RULES 


We move to integration formulas that are exact when y= x. They have second- 
order accuracy. The Ax error term disappears. The formulas give the correct area 
under straight lines. The predicted error is a multiple of (Ax)?. That multiple is found 
by testing y = x*—for which the answers are not exact. 

The first formula combines R, and L,,. To cancel as much error as possible, take 
the average 4(R,+L,). This yields the trapezoidal rule, which approximates 
J v(x) dx by T,: 


T, =4R, +4, = Ax(dyo + 1 + Y2 + -*+ + Yn—1 thn): (3) 
Another way to find T,, is from the area of the “trapezoid”’ below y = x in Figure 5.19a. 


T= a|4 (Yo+y) + + (Vp + YQ) + 


j p+ | j i+] 
Ax Ax Ax : Ax : - Ax , 
Fig. 5.49 Second-order accuracy: The error prediction is based on v = x?. 


53.8 Numerical integration 


The base is Ax and the sides have heights y,_, and y,. Adding those areas gives 
3(L, + R,) in formula (3)—the coefficients of y, combine into 4+ 4= 1. Only the first 
and Jast intervals are missing a neighbor, so the mle has $y, and 4y,. Because 
trapezoids (unlike rectangles) fit under a sloping line, T, is exact when y= x. 

What is the difference from rectangles? The trapezoidal rule gives weight 4Ax to 
Yo and y,. The rectangle rule R, gives fuil weight Ax to y, (and no weight to yo). 
R, — T, is exactly the leading error 4y, — 4yo. The change to T, knocks out that error. 


Another important formula is exact for y(x) = x. A rectangle has the same area as 

a trapezoid, if the height of the rectangle is halfway between y,, and y,, Ona straight 

line graph that is achieved at the midpoint of the interval. By evaluating y(x) at the 

halfway points 4Ax, Ax, JAx, ..., we get much better rectangles. This leads to the 
midpoint rule M,,: 

M,, = Ax(yy;2 + Vaj2 $00 + Yq-aj2)- (4) 

For Io x dx, trapezoids give 4{0)+ 1+2+3+4(4)=8. The midpoint rule gives 


4$+23+44+4=8, again correct. The rules become different when y = x”, because y,;2 
is no longer the average of yy and y,. Try both second-order rules on x?: 


1= J x? dx n= 1 10 100 
error T,—- [= 1/6 1/600 1/60000 
error M,- f= —1/12 — 1/1200 — 1/120000 


The errors fall by 100 when n is multiplied by 10. The midpoint rule is twice as good 
(—1/12 vs. 1/6). Since all smooth functions are close to parabolas (quadratic approxi- 
mation in each interval), the leading errors come from Figure 5.19. The trapezoidal 
error is exactly $(Ax)* when y(x) is x? (the 12 in the formula divides the 2 in y’): 


T- Te SO) - + = tured ]=Gerli- we} © 


M,— 1s ~ 3 (dx? yi- 6 ]=— 3g (Ox'['O)- yO] 6) 


For exact error formulas, change y'(b)— y‘(a) to (b — a)y’(c). The location of c is 
unknown (as in the Mean Value Theorem). In practice these formulas are not much 
used—they involve the pth derivative at an unknown location c. The main point 
about the error is the factor (Ax)?. 

One crucial fact is easy to overlook in our tests. Each value of y(x) can be extremely 
hard to compute. Every time a formula asks for y;, a computer calls a subroutine. The 
goal of numerical integration is to get below the error tolerance, while calling for a 
minimum number of values of y. Second-order rules need about a thousand values for 
a typical tolerance of 10°°. The next methods are better. 


FOURTH-ORDER RULE: SIMPSON 


The trapezoidal error is nearly twice the midpoint error (1/6 vs. —1/12}. So a 
good combination wil] have twice as much of M, as T,. That is Simpson’s rule: 


1 2 1 
“3 T.+ 5 My= ¢Ax| yo + Ayia +291 + ya + dyn t + 4yn—112 + Ya} (7) 


Multiply the midpoint values by 2/3= 4/6. The endpoint values are multiplied by 


5, 
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2/6, except at the far ends a and b (with heights yp and y,). This 1-4—2—4—2-4-1 
pattern has become famous. 

Simpson’s rule goes deeper than a combination of T and M. It comes from a 
parabolic approximation to y(x) in each interval. When a parabola goes through yo, 
Vij2s Yi, the area under it is $Ax(yo + 4y,,2 + y,). This is S over the first interval. All 
our rules are constructed this way: Integrate correctly as many powers 1, x, x7, ... as 
possible. Parabolas are better than straight lines, which are better than flat pieces. 
S beats M, which beats R. Check Simpson’s rule on powers of x, with Ax = 1/n: 


error if y= x? 0 
error if y= x? 0 
error ify=x*  8.33:10°% 833-1077 ~=8.33-1071! 


Exact answers for x? are no surprise. S, was selected to get parabolas right. But the 
zero errors for x* were not expected. The accuracy has jumped to fourth order, with 
errors proportional to (Ax)*. That explains the popularity of Simpson’s rule. 

To understand why x° is integrated exactly, look at the interval [—1, 1]. The odd 
function x* has zero integral, and Simpson agrees by symmetry: 


1 
| x* ix= jx] =0 and -[C9 +408 +19 ]=0. (8) 


=f 


=} 


Nj J+ j j+1 
a AxIN3 
Fig. 5.20 Simpson versus Gauss: E = c(Ax)*(yj, ; — yj") with cs = 1/2880 and cg = —1/4320. 


THE GAUSS RULE (OPTIONAL) 


We need a competitor for Simpson, and Gauss can compete with anybody. He 
calculated integrals in astronomy, and discovered that two points are enough for a 
fourth-order method. From —1 to 1 (a single interval) his rule is 


fh ye) dx x y(-1/,/3) + y(1/,/3). (9) 


Those “‘Gauss points” x = —1/,/3 and x= 1 //3 can be found directly. By placing 
them symmetrically, all odd powers x, x*, ... are correctly integrated. The key is in 
y= x’, whose integral is 2/3. The Gauss points — xg and + xg get this integral right: 
Zeiten )? + (x6)? ata and x wie 
3 G G) > G 3 G + Ja 
Figure 5.20c shifts to the interval from 0 to Ax. The Gauss points are 
(1+1 /./3) Ax/2. They are not as convenient as Simpson’s (which hand calculators 
prefer). Gauss is good for thousands of integrations over one interval. Simpson is 
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good when intervals go back to back—then Simpson also uses two }’s per interval. 
For y = x*, you see both errors drop by 10~* in comparing n= 1 to n= 10: 
f=, x* dx Simpson error = 8.33: 10° 8.33-1077 
Gauss error — 5,56° 1073 - 5.56°107? 


DEFINITE INTEGRALS ON A CALCULATOR 


It is fascinating to know how numerical integration is actually done. The points are 
not equally spaced! For an integral from 0 to 1, Hewlett-Packard machines might 
internally replace x by 3u? — 2u° (the limits on u are also 0 and 1). The machine 
remembers to change dx. For example, 


a1 


b é a | 
—= become ————— = 
\, /x Jo \/3u?—-2u3 Jo 


Algebraically that tooks worse—but the infinite value of I/\/x at x = 0 disappears 
at u=0. The differential 6(4—u*) du was chosen to vanish at u=Q and u=1. We 
don’t need y(x) at the cndpoints—where infinity is most common. In the wu variable 
the integration points are equally spaced—therefore in x they are not. 

When a difficult point is inside [a, b], break the interval in two pieces. And chop 
off integrals that go out to infinity. The integral of e *’ should be stopped by 
x = 10, since the tail is so thin. (It is bad to go too far.) Rapid oscillations are among 
the toughest— the answer depends on cancellation of highs and lows, and the calcula- 
tor requires many integration points. 


V3- Qu 


The change from x to u affects periodic functions. I thought equal spacing was 
good, since 1/(10 + cos 2xx} was integrated above to enormous accuracy. But there 
is a danger called affasing. If sin 82x is sampled with Ax = 1/8, it is always zero, A 
high frequency 8 is confused with a low frequency 0 (its ‘alias’ which agrces at the 
sample points). With unequal spacing the problem disappears. Notice how any integ- 
ration method can be deceived: 


Ask for the integral of »=0 and specify the accuracy. The calculator 
samples y at x,, ...,.x,. (With a PAUSE key, the x’s may be displayed.) 
Then integrate ¥(x)=(x—x,)* --(x—x,)*. That also returns the 
answer zero (now wrong), because the calculator follows the same steps. 


On the HP-28S you enter the function, the endpoints, and the accuracy. The 
variable x can be named or not (see the margin). The outputs 4.67077 and 4.7E-5 are 
the requested integral iF e* dx and the estimated error bound. Your input accuracy 
.00001 guarantees 


3: "EXPC(X)' Lp : 3: (CEXP)) 
2-{X 1 2} relative error in y= wee epee < 00001. roe a ear 

z compu : . 

1: .90001 org 1: .00001 


The machine estimates accuracy based on its experience in sampling }(x). If you 
guarantee e* within .QO0Q0000001, it thinks you want high accuracy and takes longer. 
In consulting for HP, William Kahan chose formulas using 1, 3, 7, 15, ... sample 
points. Each new formula uses the sampies in the previous formula. The calculator 
stops when answers are close. The fast paragraphs are based on Kahan’s work. 
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TI-81 Program to Test the Integration Methods L, R, T, M, S 


PrgmI:NUM INT :D/25H :A+JD>X :Disp "L, R, M, 
:Disp "A=" zA 7X sRtYYOR tT, S" 

z:Input A 2YTL :1S>CJ,N) :Disp L 

Disp "B=" 2134 :Goto 1 sDisp R 

z:Input B :9—>R :(L+R-Y¥1)D—>L : Disp M 

zLbl N :Q-—M :RDOR :Disp T 

:Disp "N=" :LblL 1 :MDOM :Disp § 

Input N X+H35X 2: CLRID /25T >Pause 

: (B-A) /NOD iM+Y¥4—>M 3: C2M+T)/34S5S :Goto N 


Place the integrand y{x) in the Y41 position on the Y= function edit screen. Execute 
this program, indicating the interval [A, B] and the number of subintervals N. Rules 
Land R and M use N evaluations of p(x). The trapezoidal rule uses N+ 1 and 
Simpson’s rule uses 2N + 1. The program pauses to display the results. Press ENTER 
to continue by choosing a different N. The program never terminates (only pauses). 
You break out by pressing ON. Don’t forget that 1S, Goto, ... are on menus. 
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Read-through questions 


To integrate p(x}, divide [a,b] into m pieces of length 
Ax=_a_. R, and L, place a _b over each piece, 
using the height at the right or ¢ endpoint: 
R, = Ax(y, + - +y,) and L,=_d_. These are _e 

order methods, because they are incorrect for p=__t__. The 
total error on [0,1} is approximately _g . For y=cos xx 
this leading term is_h _. For y =cos 2nx the error is very 


small because [0, 1] isa complete i 


A much better method is 7,=4R,+_] = 
Ax[4tyo +__k yy, + +_f__y,]. This _m_ rule is 

n__-order because the error for p=x is__o _. The error 
for y=x? fromatobis_p_. The _q __ tule is twice as 
accurate, using M,=Ax[_o i). 


Simpson’s method is S,=$M,+_ 8 .Itis__¢ _-order, 
because the powers __u_ are integrated correctly. The 
coefficients of yo, Yij2,¥; are _¥v times Ax. Over three 
intervals the weights are Ax/6 times 1~4-__w__. Gauss uses 

x points in each interval, separated by Ax/,/3. For a 
method of order p the error is nearly proportional to __y_. 


1 What is the difference L, — T,,? Compare with the ieading 
efror term in (2). 


2 If you cut Ax in half, by what factor is the trapezoidal 
error reduced (approximately)? By what factor is the error in 
Simpson’s rule reduced? 


3 Compute R, and L, for {} x?dx and = 1,2, 10. Either 
verify (with computer) or use (without computer) the formula 
134234 -- +P =) (nt 


4 One way to compute TJ, is by averaging 4(L,+R,). 
Another way is to add $yo+y, + «> +4y,. Which is more 
efficient? Compare the number of operations. 


5 Test three different rules on J =f) x* dx for n=2, 4, 8. 


6 Compute x to six places as 45 dx/(1 + x7), using any 
rule. 


7 Change Simpson's rule to Ax(tyo + 41,2 +441) in each 
interval and find the order of accuracy p. 


8 Demonstrate superdecay of the error when 1/(3 + sin x) is 
integrated from 0 to 27. 


9 Check that (Ax)*(yj4,—yj)/12 is the correct error for 
y=1 and y=x and p= x? from the first trapezoid (j = 9). 
Then it is correct for every parabola over every interval. 


10 Repeat Problem 9 for the midpoint error 
~(Ax)*(¥j41 — ¥))/24. Draw a figure to show why the rectan- 
gle M has the same area as any trapezoid through the mid- 
point (including the trapezoid tangent to p(x))}. 

11 In principle [°, sin? x dx/x? = 2. With a symbolic alge- 
bra code or an HP-28S, how many decimal places do you 
get? Cut off the integral to ft and test large and small A. 


12 These four integrals all equal x: 

Leal ete 

o Jx(t—-x) Joo * a. bx 
(a} Apply the midpoint rule to two of them until 
nm 3.1416. 


(b) Optional: Pick the otber two and find x ~ 3. 
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13 To compute in 2= {7 dx/x =.69315 with error less than 
O01, how many intervals should T, need? Its leading error is 
(Ax)* [y'(b) — y'(a)]/12. Test the actual error with y= 1/x. 


14 Compare T, with M, for { ./x dx and m= 1, 10, 100. The 
error prediction breaks down because y'(0) = a. 


15 Take f(x}=§} y(x) dx in error formula 3R to prove that 
§3* y(x) dx — y(0) Ax is exactly (Ax)? y'(c) for some point c. 


16 For the periodic function y(x) = 1/{2 + cos 6x) from —t 
to !, compare T and S and G for n= 2. 


17 For I= {j, ./t — x? dx, the leading error in the trapezoi- 
dal cule is . Try n= 2, 4, 8 to defy the prediction. 


18 Change to x =sin 6, ./1—x? =cos 6, dx =cos @ dé, and 
repeat T, on §* cos?6 d9. What is the predicted error after 
the change to @? 


19 Write down the three equations Ay(0)}+ By(4) + Cyl) = J 


for the three integrals I = {5 1 dx, [5 x dx, {} x? dx. Solve for - 


A, B, C and name the rule. 


20 Can you invent a rule using Ayo + By + Cyij2+ 
Dys,4 + Ey, to reach higher accuracy than Simpson’s? 


21 Show that T, is the only combination of L, and R, that 
has second-order accuracy, 


22 Calculate f e~** dx with ten intervals from 0 to 5 and 0 
to 20 and 0 to 400. The integral from 0 to 0 is $./n. What 
is the best point to chop off the infinite integral? 


23 The graph of p(x) = 1/{x? + 107 '°) has a sharp spike and 
a long tail. Estimate { y dx from Typ and Tyo9 (don’t expect 
much), Then substitute x = 10-5 tan 6, dx = 107% sec? 8 a0 
and integrate 10° from 0 to 2/4. 


24 Compute fj |x—2)dx from T, and compare with the 
divide and conquer method of separating {9 |x — | dx from 
fe Ix —n] dx. 


25 Find a,b,c so that p=ax?+bx+c equals 1, 3,7 at 
x=0, 4, 1 {three equations), Check that $-1+#-3+2+7 
equals {5 y dx. 


26 Find c in S—} =c(Ax)*[y"(1) — y"(O)] by taking y= x* 
and Ax = I. 


27 Find ¢ in G—I=c(Ax)*(y"()—y”"(-1)) by taking 
y=x*, Ax =2, and G=(~-1/,/3)* + (1//3)*. 


28 What condition on y(x) makes L,=R,=T, for the 
integral f° y(x) dx? 


29 Suppose (x) is concave up. Show from a picture that the 
trapezoidal answer is too high and the midpoint answer is 
too low. How does y” > 0 make equation (5) positive and (6) 
negative? 
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CHAPTER 6 


Exponentials and Logarithms 


This chapter is devoted to exponentials like 2* and 10° and above ail e*. The goal is 
to understand them, differentiate them, integrate them, solve equations with them, 
and invert them (to reach the logarithm). The overwhelming importance of e* makes 
this a crucial chapter in pure and applied mathematics. 

In the traditional order of calculus books, e* waits until other applications of the 
integral are complete. I would like to explain why it is placed earlier here. I believe 
that the equation dy/dx = y has to be emphasized above techniques of integration. 
The laws of nature are expressed by differential equations, and at the center is e. Its 
applications are to life sciences and physical sciences and economics and engineering 
(and more—wherever change is influenced by the present state). The model produces 
a differential equation and I want to show what calculus can do. 

The key is always b™*" = (b")(b"). Section 6.1 applies that rule in three ways: 


1. to understand the logarithm as the exponent; 
2. to draw graphs on ordinary and semilog and log-log paper; 
3. to find derivatives. The slope of 5* will use b** 4 = (b*)(b*), 


Pee 866.1 AnOvenicn 


There is a good chance you have met logarithms. They turn multiplication into 
addition, which is a lot simpler. They are the basis for slide rules (not so important) 
and for graphs on jog paper (very important). Logarithms are mirror images of 
exponentials—and those I know you have met. 

Start with exponentials. The numbers 10 and 10° and 10° are basic to the decimal 
system. For completeness I also include 10°, which is “ten to the zeroth power” or 
1. The logarithms of those numbers are the exponents. The logarithms of 1 and 10 and 
100 and 1000 are 0 and 1 and 2 and 3. These are logarithms “to base 10,” because 
the powers are powers of 10. 


Questlon When the base changes from 10 to 5, what is the logarithm of 1? 
228 Answer — Since b° = 1, log,! is always zero. To base b, the logarithm of b” is n. 


6.4 An Overview 


Negative powers are also needed. The number 10* is positive, but its exponent x can 
be negative. The first examples are 1/10 and 1/100, which are the same as 10°! and 
10°-*. The logarithms are the exponents —1 and —2: 


1000 = 10° and log 1000 = 3 
1/1000=10°3. —s and_— log 1/1000 = —3. 


Multiplying 1000 times 1/1000 gives 1 = 10°. Adding logarithms gives 3 + (—3)=0. 
Always 10” times 10” equals 10"*". In particular 10° times 10? produces five tens: 


(10)(10)(10) times (10)(10) equals (10)(10)(10)(10)(10) = 10°. 


The law for b™ times b" extends to all exponents, as in 10*° times 10*. Furthermore 
the law applies to all bases (we restrict the base to b>0 and b #1). In every case 
multiplication of numbers is addition of exponents. 


6A b™ times b” equals b"*", so logarithms (exponents) add 
b™ divided by b" equals b”~", so logarithms (exponents) subtract 


and 


log,( yz) = log,y + log,z log,(y/z) = log,y — log,z. (1) 


Historical note In the days of slide rules, 1.2 and 1.3 were multiplied by sliding 
one edge across to 1.2 and reading the answer under 1.3. A slide rule made in 
Germany would give the third digit in 1.56. Its photograph shows the numbers on a 
log scale. The distance from | to 2 equals the distance from 2 to 4 and from 4 to 8. 
By sliding the edges, you add distances and multiply numbers. 

Division goes the other way. Notice how 1000/10 = 100 matches 3 — 1 = 2. To divide 
1.56 by 1.3, look back along line D for the answer 1.2. 


The second figure, though smaller, is the important one. When x increases by 1, 2* 
is multiplied by 2. Adding to x multiplies y. This rule easily gives y= 1, 2, 4, 8, but 
look ahead to calculus—which doesn’t stay with whole numbers. 

Calculus will add Ax. Then y is multiplied by 2°*. This number is near 1. If 
Ax = 7/5 then 24* = 1.07—the tenth root of 2. To find the slope, we have to consider 
(24* — 1)/Ax. The limit is near (1.07 — 1)/3'5 = .7, but the exact number will take time. 
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Fig. 6.4 An ancient relic (the slide rule). When exponents x add, powers 2* multiply. 
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Base Change Bases other than 10 and exponents other than 1, 2, 3, ... are needed 
for applications. The population of the world x years from now is predicted to grow 
by a factor close to 1.02*. Certainly x does not need to be a whole number of years. 
And certainly the base 1.02 should not be 10 (or we are in real trouble). This prediction 
will be refined as we study the differential equations for growth. It can be rewritten 
to base 10 if that is preferred (but look at the exponent): 


1.02* is the same as 1081-92)", 


When the base changes from 1.02 to 10, the exponent is multiplied —as we now see. 
For practice, start with base b and change to base a. The logarithm to base a will 
be written “log.” Everything comes from the rule that logarithm = exponent: 


base change for numbers: b= a'°8". 
Now raise both sides to the power x. You see the change in the exponent: 
base change for exponentials. b* = q''’8*™, 


Finally set » = b*. Its logarithm to base 8 is x. Its logarithm to base a is the exponent 
on the right hand side: log, y = (log, &)x. Now replace x by log, y: 


base change for logarithms: lov, y = (log, blog, ¥). 
We absolutely need this ability to change the base. An example with a = 2 is 
b=8=23 87 = (2°)? = 2° log, 64 = 3-2 = (log,8)(log,64). 


The rule behind base changes is (a"° = a"*. When the mth power is raiscd to the 
xth power, the exponents multiply. The square of the cube is the sixth power: 


(a)(a)(a) times (a){a){a) equals (a)(a)(a){a)(a)(a): (a*)’ = a®. 


Another base will soon be more important than 10-—-here are the rules for base 
changes: 


6B To change numbers, powers, and logarithms from base 5 to base a, use 


b = gio X= gllebex Jog. y = (log, b) (log, y} 


The first is the definition. The second is the xth power of the first. The third is the 
logarithm of the second (remember y is 6*). An important case is y = a: 


log,a = (log, 5)(log,a) = 1 so log, b = 1 log,a. (3) 


EXAMPLE & = 2° means 8/? = 2. Then (log,8)(logg2) = (3)(1/3) = 1. 


This completes the aigebra of logarithms. The addition rules 6A came from 
(b")(b") = b™*". The multiplication rule 6B came from (a")* = a"*. We still need to 
define b* and a® for all real numbers x. When x is a fraction, the definition is easy. 
The square root of a® is a* (m= 8 times x = 1/2}. When x is not a fraction, as in 2", 
the graph suggests one way to fill in the hole. 

We could define 2" as the limit of 2°, 271/19, 2314109 As the fractions r approach 
7m, the powers 2’ approach 2. This makes y = 2* into a continuous function, with the 
desired properties (2")(2") = 2"*" and (2")* = 2"*—whether m and n and x are inte- 
gers or not. But the ¢’s and d’s of continuity are not attractive, and we eventually 
choose (in Section 6.4) a smoother approach based on integrals. 


6.1 An Overview 
GRAPHS OF b* AND log,y 
It is time to draw graphs. In principle one graph should do the job for both functions, 


because y = b* means the same as x = log, y. These are inverse functions. What one 
function does, its inverse undoes. The logarithm of g(x) = b* is x: 


g” *(g(x)) = log,(b*) = x. (4) 
In the opposite direction, the exponential of the logarithm of y is y: 
a(g*(y)} = bem” = y, (5) 


This holds for every base b, and it is valuable to see b= 2 and b= 4 on the same 
graph. Figure 6.2a shows y = 2* and y = 4*, Their mirror images in the 45° line give 
the logarithms to base 2 and base 4, which are in the ght graph. 

When x is negative, y = b* is still positive. If the first graph is extended to the left, 
it stays above the x axis. Sketch it in with your pencil. Also extend the second graph 
down, to be the mirror image. Don’t cross the vertical axis. 


x 
l 2 3 4 12 4 8 16 


Fig. 6.2. Exponentials and mirror images (logarithms). Different scales for x and y. 


There are interesting relations within the left figure. All exponentials start at 1, 
because b° is always 1. At the height y= 16, one graph is above x = 2 (because 4* = 
16). The other graph is above x = 4 (because 2* = 16). Why does 4* in one graph equal 
2?* in the other? This is the base change for powers, since 4 = 2?. 

The figure on the right shows the mirror image—the logarithm. All logarithms 
start from zero at y= 1. The graphs go down to — oo at y=0. (Roughly speaking 
2” *” is zero.) Again x in one graph corresponds to 2x in the other (base change for 
logarithms). Both logarithms climb slowly, since the exponentials climb so fast. 


The number log, 10 is between 3 and 4, because 10 is between 2° and 2*. The slope 
of 2* is proportional to 2*—which never happened for x”. But there are two practical 
difficulties with those graphs: 


1. 2* and 4* increase too fast. The curves turn virtually straight up. 
2. The most important fact about Ab* is the value of h—and the base 
doesn’t stand out in the graph. 


There is also another point. In many problems we don’t know the function y = 
{(x). We are looking for it! All we have are measured values of y (with errors mixed 
in). When the values are plotted on a graph, we want to discover f(x). 


Fortunately there is a solution. Scale the y axis differently. On ordinary graphs, 
each unit upward adds a fixed amount to y. On a log scale each unit multiplies y by 
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a fixed amount. The step from y= 1 to y = 2 is the same length as the step from 3 to 
6 or 10 to 20. 

On a log scale, y= 11 is not halfway between 10 and 12. And y= 0 is not there at 
all. Each step down divides by a fixed amount—we never reach zero. This is com- 
pletely satisfactory for Ab*, which also never reaches zero. 

Figure 6.3 is on sem#log paper (also known as log-linear), with an ordinary x axis. 
The graph of y= Ab* is a straight line. To see why, take logarithms of that equation: 


log y=log A+ x log b, (6) 


The relation between x and log y is linear. It is really log y that is plotted, so the graph 
is straight. The markings on the y axis allow you to enter y without looking up its 
iogarithm—you get an ordinary graph of log » against x. 

Figure 6.3 shows two examples. One graph is an exact plot of y= 2+ 10°. It goes 
upward with slope 1, because a unit across has the same length as multiplication by 
10 going up. 10° Aas slope 1 and 10"°*°* (which is b*) will Aave slope log b. The 
crucial number log } can he measured directly as the slope. 


A 6 8 1,0 ] 2 3 4 5$ 678910 


Fig. 6.3 2-10" and 4-10~*? on semilog paper. Fig.6.4 Graphs of Ax* on log-log paper. 


The second graph in Figure 6.3 is more typical of actual practice, in which we start 
with measurements and look for f(x). Here are the data points: 


x=00 02 04 06 08 1.0 
y=40 3.2 2.4 2.0 16 1.3 


We don’t know in advance whether these values fit the model y = Ab*. The graph is 
strong evidence that they do. The points lie close to a line with negative slope— 
indicating log b< 0 and b <1. The slope down is haif of the earlier slope up, so the 
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model is consistent with 
y=A-10°*? or log y=log A— 4x. (7) 


When x reaches 2, » drops by a factor of 10, At x =0 we see A = 4, 

Another model—a power y = Ax* instead of an exponential—also stands out with 
logarithmic scaling. This time we use log-log paper, with both axes scaled. The 
logarithm of y = Ax* gives a linear relation between log y and log x: 


log y=log A+k log x. (8) 


The exponent k becomes the slope on log-log paper. The base b makes no difference. 
We just measure the slope, and a straight line is a lot more attractive than a power 
curve. 

The graphs in Figure 6.4 have slopes 3 and 3 and —1. They represent Ax* and 
Ax/x and A/x. To find the A’s, look at one point on the line. At x = 4 the height is 
8, so adjust the 4’s to make this happen: The functions are x*/8 and 4./x and 32/x. 
On semilog paper those graphs would not be straight! 

You can buy log paper or create it with computer graphics. 


THE DERIVATIVES OF y = 6” AND x= log,y 


This is a calculus book. We have to ask about slopes. The algebra of exponents is 
done, the rules are set, and on log paper the graphs are straight. Now come limits. 

The central question is the derivative. What is dy/dx when y= b*? What ts dx/dy 
when x is the logarithm log, y? Thpse questions are closely related, because 6* and 
log,y are inverse functions. If one slope can be found, the other is known from 
dx/dy = 1/(dy/dx). The problem is to find one of them, and the exponential comes 
first. 

You will now see that those questions have quick (and beautiful) answers, except 
Jor a mysterious constant. There is a multiplying factor ¢ which needs morc time. | 
think it is worth separating out the part that can be done immediately, leaving c in 
dy/dx and 1/c in dx/dy. Then Section 6.2 discovers ¢ by studying the special number 
called e {but ¢ #e). 


6C The derivative of 6% is a multiple ch*. The number c depends on the 
base 6. 


The product and power and chain rules do not yield this derivative. We are pushed 
all the way back to the original definition, the limit of Ay/Ax: 


+ ai xth_ px 
ay yp Met) 
h 0 h 


dx AO h 9) 


Key idea: Split b*~* into b* times b*. Then the crucial quantity 6* factors out. More 
than that, 6 comes outside the limit because it does not depend on h. The remaining 
limit, inside the brackets, is the number c that we don't yet know: 


xpER px Lo 
| = tim : = o*| fim” ; "| =o (10) 


This equation is central to the whole chapter: dy/dx equals ch* which equals cy. The 
rate of change of y is proportional to y. The slope increases in the same way that b* 
increases (except for the factor c). A typical example is money in a bank, where 
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Read-through questions On ordinary paper the graph of y=__! isa straight line. 
Its slope is__™ . On semilog paper the graph of y=__9 
In 10* = 10,000, the exponent 4 is the _@ _ of 10,000. The is a straight line. Its slope is _¢ . On log-log paper the 


base is b= . The logarithm of 10" ttmes 10° is _ ¢_. gtaph of y=__P ___ is a straight line. Its slope is 
The logarithm of 10"/10"1s__.d__. The logarithm of 10,000* een er eee : 
ie @ By ob theniten. f-, Here a caay number The slope of y= b* is dvjdx =__¢ _, where c depends on 


and y is always _ 9 


A base change gives b=a 
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interest is proportional to the principal. The rich get richer, and the poor get slightly 
richer. We will come back to compound interest, and identify b and c. 
The inverse function is x = log,y. Now the unknown factor is 1/e: 


6D The slope of log,y is 1/cy with the same c (depending on 8). 


Proof If dpjdx = cb* then dxjdy = I/cb* = Ley. (11) 
That proof was like a Russian toast, powerful but too quick! We go more carefully: 
f(bj)=x (logarithm of exponential) 
L(P\(cb*) = 1 (x derivative by chain rule) 
f(b") = L/eb* (divide by cb*) 
Ty = lf/ey (identify b* as y} 


The logarithm gives another way to find c. From its slope we can discover l/c. This 
is the way that finally works (next section). 


ys2" 


slope ¢2' 


Fig. 6.5 The slope of 2* is about .7-2*. The slope of log,y is about 1/.7y. 


Final remark It is extremely satisfying to meet an f(y} whose derivative is I/cy. 
At last the ““~1 power” has an antiderivative. Remember that [x"dx = x"*1/(n + 1) 
is a failure when n= — 1. The derivative of x° (a constant} does not produce x" !. 
We had no integral for x~', and the logarithm fills that gap. If y is replaced by x or t 
(all dummy variables) then 


d 


l d 1 
ae log,x = ae and + lOggt =; (12) 


dt ct 


The base & can be chosen so that c = 1. Then the derivative is |/x. This final touch 
comes from the magic choice b = e—the highlight of Section 6.2. 


6.4 EXERCISES 


and b* =a 


y= 2 it follows that log,8 times Jog,2 equals ___k__. log,x is _ 


b. The number c is the limit as h-+0 of __s 
log,y is the inverse, (dx/dy{dy/dx)=__1 
. Then dy{dx = cb* yields dx/dy = __4 __. Substituting 5* for y, the 
8° is 245. In other words log,y is __j _ times loggy. When slope of log,y is _¥__. With a change of letters, the slope of 
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Problems 1-10 use the rules for logarithms. 


I Find these logarithms (or exponents): 
{a)log,32 —_(b) log,(1/32) (c) logs(1/32) 
(d)logs,2 —_(e) logio(10./10) _(f) log (log, 16) 
2 Without a calculator find the values of 
(a) 3t825 (b) 32¥m25 
(c) logy 95 + log; 92 (d) (log35)(log,9) 
(e) 10°10-*10* (f) log.56 ~log,7 


3 Sketch y= 277 and y= 4(4") from —1 to | on the same 
graph. Put their mirror images x = —log,y and x =log,2y 
on a second graph. 


4 Following Figure 6.2 sketch the graphs of y = (4) and x = 
logs;2y. What are log, ,22 and !og,/.4? 


5 Compute without a computer: 


(a) log,3 + log,$ (b) log,{4}'° 
(c} log, ,100*° (d) (log, oe)(log, 10) 
(e) 27°42? (f) log{1/e) 


6 Solve the following equations for x: 
(a) log, o(10*} =7 
(c) log, 10 = 2 
(e} log x + log x = log 8 


{b} log 4x —log 4= log 3 
{d) log(1/x) = 2 

(f) log.(x”) =5 

7 The logarithm of y =x” is logy = 


*8 Prove that (log,a}(log,c) = (logya)(log,c). 


9 2'° is close to 10° (1024 versus 1000). If they were equal 
then log,10 would be . Also log,92 would be 
instead of 0.301. 


10 The number 2!°°° has approximately how many (decimal) 
digits? 


Questions 11-19 are about the graphs of y = b* and x = log, y. 


11 By hand draw the axes for semilog paper and the graphs 
of y= 1.1* and y= [Q(1.17. 


12 Display a set of axes on which the graph of y = log, 9x is 
a straight line. What other equations give straight lines on 
those axes? 


13 When noise is measured in decibels, amplifying by a factor 
A increases the decibel level by 10 log A. If a whisper is 20db 
and a shout is 70db then 10 log A = 50 and A= 


14 Draw semilog graphs of y = 10'~* and y = 4(./10¥. 


15 The Richter scale measures earthquakes by logyo(f/Zo) = 
R. What is R for the standard earthquake of intensity fo? If 
the 1989 San Francisco earthquake measured R = 7, how did 
its intensity J compare to I,? The 1906 San Francisco quake 
had R= 8.3. The record quake was four times as intense with 
R= 


16 The frequency of A ahove middle C is 440/second. The 
frequency of the next higher A is . Since 27/1? 3 1.5, 
the note with frequency 660/sec is 


17 Draw your own semilog paper and plot the data 
y=7, 11, 16, 28,44 at x=0, 1/2, 1, 3/22 

Estimate A and 6 in y = Ab’. 

18 Sketch log-log graphs of y = x? and y= /x. 


19 On log-log paper, printed or homemade, plot y =4, 11, 
21, 32, 45 at x=1, 2, 3, 4, 5. Estimate A and k in y= Ax. 


Questions 20-29 are about the derivative dy/dx = cb’. 
20 g{x)=5* has slope g’=cg. Apply the cham mile to 
afty)) = y to prove that dffdy = 1/cy. 


21 If the slope of log x is 1/ex, find the slopes of log (2x) and 
log (x7) and log (27). 


22 What is the equation {including c} for the tangent line to 
y= 10% at x = 0? Find also the equation at x = 1. 


23 What is the equation for the tangent line to x = log, oy at 
y= 1? Find also the equation at py = 10. 


24 With 6 = 10, the slope of 10“ is c10*. Use a calculator for 
small A to estimate c = lim {10" — 1)/A. 


25 The unknown constant in the slope of y=(1) is 
L=tim (.1"—1)/h. (a) Estimate L by choosing a amall A. 
(b) Change # to —h to show that L = —c from Problem 24. 
26 Find a base 6 for which (b* — 1)/h = 1. Use hk = 1/4 by hand 
or h= 1/10 and 1/100 by calculator. 


27 Find the second derivative of y = 6* and also of x = log, y. 


28 Show that C=lim (100"— 1)/h is twice as large as c= 
lim (10* — 1)/A. (Replace the last A's by 2h.) 


29 In 28, the limit for b= 100 is twice as large as for b= 
So ¢ probably involves the of 6. 
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Re §8§86.2 The Exponential ec —_xi 


The last section discussed b* and log,y. The base b was arbitrary—it could be 2 or 6 
or 9.3 of any positive number except 1. But in practice, only a few bases are used. 
I have never met a logarithm to base 6 or 9.3. Realistically there are two leading 
candidates for b, and 10 is one of them. This section is about the other one, which is 
an extremely remarkable number. Tbis number is not seen in arithmetic or algebra 
or geometry, where it looks totally clumsy and out of place. In calculus it comes into 
its own. 

The number is e. That symbol was chosen by Euler (initially in a fit of selfishness, 
but he was a wonderful mathematician). It is the base of the natural logarithm. 
It also controls the exponential ¢, which is much more important than In x. 
Euler also chose 1 to stand for perimeter—anyway, our first goal is to find e. 

Remember that the derivatives of 5* and log,y include a constant ¢ that depends 
on b. Equations (10) and (11) in the previous section were 


d 
ae oh and ey = (1) 
At x =0, the graph of b* starts from b°= 1. The slope is c. At y= 1, the graph of 
log, y starts from log,1 = 0. The logarithm has slope l/c. With the right choice of the 
base b those slopes will equal | (because c will equal 1). 
For y = 2* the slope c is near .7. We already tried Ax = .1 and found Ay = .07. The 
base has to be larger than 2, for a starting slope of c= 1. 
We begin with a direct computation of the slope of log, y at y = 1: 


| ead ; 
— slope at 1 = lim ;Llows(l + hy- logs! | = lim log,| (1 + ny |, (2) 


Always log,1 = 0. The fraction in the middle is log,(1 + 4) times the number 1/h. This 
number can go up into the exponent, and it did. 

The quantity (1+ 4)” is unusual, to put it mildly. As h + 0, the number 1 +A is 
approaching {. At the same time, 1/h is approaching infinity. im the limit we have 
1°. But that expression is meaningless (like 0/0). Everything depends on the 
balance between “nearly 1”’ and “‘nearly oo.” This balance produces the extraordinary 
number ¢: 


. 1v" 
DEFINITION The number e is equal to lim (1+ h)'*, Equivalently e = lim (: + :) : 
~ axa 


Before computing e, look again at the slope I/c. At the end of equation (2) is the 
logarithm of e: 


/e = log,e. (3) 
When the base is b = e, the slope is log,e = 1. That base e has c= 1 as desired: 
The derivative of c* is 1+ e* and the derivative of logy is I (4) 


This is why the base ¢ is all-important in calculus. It makes c = !. 

To compute the actual number e from (1 + A)’, choose hk = 1, 1/10, 1/100, .... Then 
the exponents 1/hk aren = 1, 10, 100, .... (All limits and derivatives will become official 
in Section 6.4.) The table shows (1 + A)’* approaching e as h-> 0 and n> cc: 
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ac They Menai a+ne=(1+ 2) 
n A 7 
1 10 2.0 2.0 
2 05 1.5 2.25 
10 O1 L.A 2.593742 
100 0.08 1.01 2.704814 
1000 0,001 t.001 2.716924 
10000 =. 0.0001 1.0001 2.718146 


The last column is converging to e (not quickly). There is an infinite series that 
converges much faster. We know 125,000 digits of e (and a billion digits of 2). There 
are no definite patterns, although you might think so from the first sixteen digits: 


e = 2.7 1828 1828 45 90 45--- (and t/e x .37). 
The powers of e produce y = e*. At x = 2.3 and 5, we are close to y= 10 and 150. 


The logarithm is the inverse function. The logarithms of 150 and 10, to the base e, 
are close to x = 5 and x = 2.3. There is a special name for this logarithm——the aatural 
logarithm. There is also a special] notation “In” to show that the base is e: 


In y means the same as log, y. The natural logarithm is the exponent in & = y. 


The notation In y (or In x—it is the function that matters, not the variable) is standard 
in calculus courses. After calculus, the base is generally assumed to be e. In most of 
science and engineering, the natura] logarithm is the automatic choice. The symbol 
“exp (x)” means e*, and the truth is that the symbol “log x” generally means In x. 
Base e¢ is understood even without the Ictters In. But in any case of doubt—on a 
calculator key for examplc—the symbol “In x” emphasizes that the base is e. 


THE DERIVATIVES OF @” AND In x 


Come back to derivatives and slopes. The derivative of b* is cb*, and the derivative 
of Jog,y is Licey. Uf h=e then c=1. For all bases, equation (3) is Ic =log,e. 
This gives ¢—the slope of b* at x = 0: 


6E The aumber c is 1\/log,e = log.b. Thus c equals |n b. (5) 


c=Inh ts the mysterious constant that was not available earlier. The slope of 2* is 
In 2 times 2*, The slope of ¢* is In ¢ times e* (but In e = 1). We have the derivatives 
on which this chapter depends: 


6F The derivatives of e* and In y are e* and 1/y. For other bases 


d 
— log,y 


d x x 
an &* = (In bb and rit 


1 
(in by’ 


To make clear that those derivatives come from the functions (and not at all from 
the dummy variables}, we rewrite them using ¢ and x: 
1 


d f —_ of Q d — 
ad =e and a In x= eo (7) 
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Remark on slopes at x = 0: It would be satisfying to see directly that the slope of 2* 
is below 1, and the slope of 4* is above 1. Quick proof: e is between 2 and 4. 
But the idea is to see the slopes graphically. This is a small puzzle, which is fun to 
solve but can be skipped. 

2* rises from | at x = 0 to 2 at x= 1. On that interval its average slope is 1. Its 
slope at the beginning is smaller than average, so it must be less than | —as desired. 
On the other hand 4° rises from 5 at x = —4 to 1 at x = 0. Again the average slope 
is 3/3 = 1. Since x =0 comes at the end of this new interval, the slope of 4* at that 
point exceeds 1. Somewhere between 27 and 4* is e*, which starts out with slope 1. 


This is the graphical approach to e. There is also the infinite series, and a fifth 
definition through integrals which is written here for the record: 


1, e is the number such that e* has slope | at x =0 
2. ¢ is the base for which In y= log, y has slope | at y= 1 


1¥ 
3. eis the mit of (1+ >) asn—> co 
43 


ee ee et | 14 
ee ee ee ee ee ee ee 
4emotntat gy eh Re 


5. the area {{ x~‘ dx equals 1. 


The connections between 1, 2, and 3 have been made. The slopes are | when e is the 
limit of ({ + 1/n)”. Multiplying this out wlll lead to 4, the infinite series in Section 6.6. 
The official definition of In x comes from { dx/x, and then § says that In e= 1. This 
approach to e (Section 6.4) seems less intuitive than the others. 

Figure 6.6b shows the graph of e *. It is the mirror image of e* across the vertical 
axis. Their product is e*e *=1. Where e* grows exponentially, e~* decays 
exponentially—or it grows as x approaches — 00. Their growth and decay are faster 
than any power of x. Exponential growth is more rapid than polynomial growth, so 
that e*/x" goes to infinity (Problem 59). It is the fact that e* has slope e* which keeps 
the function climbing so fast. 


Fig. 6.6 e* grows between 2* and 4*. Decay of e~*, faster decay of e777’. 


The other curve is y= e~*"?. This is the famous “bell-shaped curve” of probability 
theory. After dividing by fn, it gives the aormal distribution, which applies to so 
many averages and so many experiments. The Gallup Poll will be an example in 
Section 8.4, The curve is symmetric around its mean value x = 0, since changing x to 
— x has no effect on x?. 

About two thirds of the area under this curve is between x = — 1 andx=1. If you 
pick points at random below the graph, 2/3 of all samples are expected in that 
interval. The points x = — 2 and x = 2 are “two standard deviations” from the center, 
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enclosing 95% of the area. There is only a 5% chance of landing beyond. The decay 
is even faster than an ordinary exponential, because $x” has replaced x. 


THE DERIVATIVES OF e* AND 6%” 


The slope of e* is e*. This opens up a whole world of functions that calculus can deal 
with. The chain rule gives the slope of e** and e*"* and every e“: 


6G The derivative of e“” is e"™ times du/dx. 


Special case u = cx: The derivative of e™ is ce. 


EXAMPLE 41 The derivative of e** is 3e°* (here c= 3). The derivative of e*!"* is 
e"*cos x (here u=sin x). The derivative of f(u(x)) is df/du times du/dx. Here 
f=e" so df/du =e". The chain rule demands that second factor du/dx. 


EXAMPLE 2 e"?)* is the same as 2*. Its derivative is In 2 times 2*. The chain rule 
rediscovers our constant c = In 2. In the slope of b* it rediscovers the factor c = In b. 


Generally e“* is preferred to the original b*. The derivative just brings down the 
constant c. It is better to agree on e as the base, and put all complications (like c= 
In b) up in the exponent. The second derivative of e™ is c?e. 


EXAMPLE 3 The derivative of e~*’/? is —xe~*”/? (here u= — x?/2 so du/dx = — x). 
EXAMPLE 4 The second derivative of f= e~*/?, by the chain rule and product rule, 


1S 
faire "tea = pee, (10) 


Notice how the exponential survives. With every derivative it is multiplied by more 
factors, but it is still there to dominate growth or decay. The points of inflection, 
where the bell-shaped curve has f” = 0 in equation (10), are x= 1 and x= — 1. 
EXAMPLE5 (u=nIn x). Since e"'"* is x" in disguise, its slope must be nx"~!: 


d n 
= Inx =x" —-| = n—1 
slope = e” rr In x)=x (") ne. (11) 


This slope is correct for all n, integer or not. Chapter 2 produced 3x? and 4x* from 
the binomial theorem. Now nx"~' comes from In and exp and the chain rule. 


EXAMPLE 6 An extreme case is x* = (e'"*)*. Here u= x In x and we need du/dx: 


4 yx) = amin x+x- :) = x*(In x + 1). 
dx x 


INTEGRALS OF e™ AND e” du/dx 


The integral of e* is e*. The integral of e* is not e*. The derivative multiplies by c so 
the integral divides by c. The integral of e* is e/c (plus a constant). 


EXAMPLES |'e2*dx=102*+¢ ya ee 
2 In b 
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! 2 : 
fowrra = ger +C fe~ ? dx > failure 


The first one has c= 2. The second has c= 1n b—remember again that b* = e'?"*. 
The integral divides by In b. In the third one, e°**" is e** times the number e? and 
that number is carried along. Or more likely we see e?**" as e“. The missing du/dx = 
3 is fixed by dividing by 3. The last example fails because du/dx is not there. We 
cannot integrate without du/dx: 


6H The indefinite integral [erga equals e+ C, 


Here are three examples with du/dx and one without it: 


je cos x dx= e™*+C [re dx=e 74+ 


J x -| 
Bop £e Ere +C 
(I +e) 


/x 1+e 


The first is a pure e“du. So is the second. The third has u = /x and du/dx = 1/2,/x, 
so only the factor 2 had to be fixed. The fourth example does not belong with the 
others. It is the integral of du/u*, not the integral of e“du. I don’t know any way to 
tell you which substitution is best—except that the complicated part is 1 + e* and it 
is natural to substitute u. If it works, good. 

Without an extra e* for du/dx, the integral { dx/(1 + e*)? looks bad. But u = 1 + e* 
is still worth trying. It has du = e*dx = (u— 1)dx: 


dx du 1 [... od 
lr lee \(4-d-a) ve 


That last step is “partial fractions.” The integral splits into simpler pieces (explained 
in Section 7.4) and we integrate each piece. Here are three other integrals: 


[evan few + e*)}dx |e “*(4 + @*)dx 


The first can change to ~ f e“du/u?, which is not much better. (It is just as impossible.) 
The second is actually [udu, but I prefer a split: [4e* and fe?* are safer to do 
separately. The third is { (4e~* + 1)dx, which also separates. The exercises offer prac- 
tice in reaching e“du/dx — ready to be integrated. 


Warning about definite integrals When the lower limit is x = 0, there is a natural 
tendency to expect {(0) = 0O—in which case the lower limit contributes nothing. For 
a power f= x? that is true. For an exponential f= e** it is definitely not true, because 


flO) = 1: 
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6.2 EXERCISES 


Read-through questions 


The number eis approximately 3 __. It is the limit of (1 + A) 
to the power __b__. This gives 1.01'°° when h=__c_. An 
equivalent form ts e=lim(_4@__). 


When the base is b = e, the constant c in Section 6.1 is ¢ 
Therefore the derivative of y=e*isdy/dx=__t __. The deriv- 


ative of x =log.y is dx/dy=_9 . The slopes at x =0 and 


y=lare both __h__. The notation for log,yis__|__, which 
isthe J logarithm of y. 
The constant ¢ in the slope of b*isc=__k__. The function 


b* can be rewritten as __!__. Its derivative is om _. The 
derivative of eis __n__. The derivative of e™™* is __ © 


The derivative of e* brings down a factor __p 


The integral of e is_@_ . The integral of e* is ot. 
The integral of e“du/dx is 3 _. In general the integral of 
e“) by itself is__t to find. 


Find the derivatives of the functions in 1-18. 


1 7e™ 2 -—Te"™* 

3 (e*) 4(e*y® 

5 3° 6 tins 

7 (2/3 § 4** 

9 1A +e) 10 eG*s 
11 em* + xn? 12 xe!* 
13 xe -—< 14 xe" — 2xe* + 2e* 
15 —_— 16 e'*" + In(e*’) 


17 e** + sin & 18 x71" (which is e—) 


19 The difference between e and (1+ I/n)" is approximately 
Ce/n. Subtract the calculated values for n = 10, 100, 1000 from 
2.7183 to discover the number C. 


20 By algebra or a calculator find the limits of (1 + 1/n)*" and 
(1+ L/ny* 


21 The limit of (11/10)!'°, (101/100)'°, ... is ¢. So the limit of 
(10/11)!°, (100/101)! ... is _ So the limit of 
(0/11)!", 00/102}'%", ... is . The last sequence is 
(1 — 1 jny. 


22 Compare the number of correct decimals of ¢ for 
(1.001}!°°° and (1.0001)'0°°° and if possible (1.00001)!9°°°°, 
Which power n would give all the decimals in 2.71828? 


23 The function y=e€ solves dy/dx = y. Approximate this 
equation by A¥/Ax=Y, which is Yx+hA)— ¥(x})=AY(x). 
With 4 =; find Y(h) after one step starting from Y{0) = 1. 
What ts ¥(1} after ten steps? 


74 The function that solves dy/dx = — y starting from y= 1 
at x=0 ts . Approximate by Y(x+A)— Y(x)= 
—AY(x). If h=4 what is Y(A) after one step and what is Y(1) 
after four steps? 


25 Invent three functions f, g, h such that for x> 10 
(1 + Ux}® < fx) < & < gfx) < e?* < Ax) <x’. 


26 Graph e* and \/e* at x = —2, —1, 0, 1, 2. Another form 
of \/e* is ; 


Find antiderivatives for the functions in 27-36, 


27 e* +67" 28 (e**)(e”*) 

29 1*4+2%7+3* 36 2°* 

31 (2ey + 2e* 32 (1/e*) + (1/x9 

33 xe" + xe" 34 (sin xje* + (cos xje""* 
35 \/e* +(e) 36 xe* (trial and error) 


37 Compare e * with e~*’. Which one decreases faster near 
x =0? Where do the graphs meet again? When is the ratio of 
e-*" to e~* less than 1/1007 


38 Compare e* with x*: Where do the graphs meet? What 
are their slopes at that point? Divide x* by e* and show that 
the ratio approaches infinity. 


39 Find the tangent line to y =e at x = a. From which point 
on the graph does the tangent line pass through the origin? 


40 By comparing slopes, prove that if x > 0 then 
(aje">1+x (b)e"* > 1—x. 


41 Find the minimum value of y = x* for x > 0, Show from 
d?y/dx* that the curve is concave upward. 


42 Find the slope of y = x'/* and the point where dy/dx = 0, 
Check d?y/dx? to show that the maximum of x'/ is 


43 If dy/dx =y find the derivative of e™*y by the product . 
rule. Deduce that (x) = Ce* for some constant C. 


44 Prove that x* = e* has only one positive solution. 


Evaluate the integrals in 45—54. With infinite limits, 49-50 are 
“improper.” 


1 f.4 
45 | e* dx 46 | sin x e7 dx 
° 0 
1 i 
47 | 2* dx 48 | 277 dx 
=1 io | 
m no 
49 | e-* dx 50 | xe* dx 
0 


1 
$1 | e'** dx 
° 
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55 Integrate the integrals that can be integrated: 
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59 This exercise shows that F(x) = x"/e* +0 as x — 00. 


The maximum of x"/e*, at x =n, is n"/e". 


x 1 
ain x 10 
~ { apt hos eae 4 { ee? (a) Find dF/dx. Notice that F(x) decreases for x >n>0. 


(b} F(2x) = (2x)/e?* = 2"x"e* + e* < t/a" e*, 


e F du/dx 3 Deduce that F(2x)— 0 as x - oo. Thus F(x) + 0. 
{ dujdx *~ | e @ With n=6, graph F(x) = x°/e" on a calculator or com- 
5 puter. Estimate tts maximum. Estimate x when you reach 
Je (F) dx {er “ dx. F(x) = 1. Estimate x when you reach F(x) = 4, 
x 


56 Find a function that solves y‘(x) = 5y(x) with {0} = 2. 
§7 Find a function that solves p(x} = 1/y(x) with (0) = 2. 


58 With electronic help graph the function (1+ 1/x)". What 
are its asymptotes? Why? 


61 Stirling’s formula says that n! ~ /22n n"/e". Use it to esti- 


many decimal digits in 10!? 


6.3 Growth and Decay in Science and Economics 


The derivative of y= e* has taken time and effort. The result was y’ = ce, which 
means that y' = cy. That computation brought others with it, virtually for free—the 
derivatives of b* and x* and e*™), But I want to stay with y’ = cy—which is the most 
important differential equation in applied mathematics. 

Compare y’ = x with y' = y. The first only asks for an antiderivative of x. We quickly 
find y = 4x? + C. The second has dy/dx equal to y itself—which we rewrite as dy/y = 
dx. The integral is In y= x + C. Then y itself is e*e“. Notice that the first solution is 
4x? plus a constant, and the second solution is e* times a constant. 


’ 


There is a way to graph slope x versus slope y. Figure 6.7 shows ‘‘tangent arrows,’ 
which give the slope at each x and y. For parabolas, the arrows grow steeper as x 


fig. 6.7 The slopes are y'= x and y’ = y. The solution curves fit those slopes. 


mate 6°/e® to the nearest whole number. Is it correct? How 


62 x°/e > 0 is also proved by ’Hépital’s rule (at x = 00): 


lim x°/e* = lim 6x5/e? = fill this in = 0, 
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grows—because y’=slope=x. For exponentials, the arrows grow steeper as y 
grows—the equation is y’ = slope = y. Now the arrows are connected by y= Ae*. 
A differential equation gives a field of arrows (slopes). Its solution is a curve that stays 
tangent to the arrows — then the curve has the right slope. 

A field of arrows can show many solutions at once (this comes in a differential 
equations course). Usually a single yo is not sacred. To understand the equation we 
start from many yo—on the left the parabolas stay parallel, on the right the heights 
stay proportional. For y’ = — y all solution curves go to zero. 

From y’ = y it is a short step to y’ = cy. To make c appear in the derivative, put c 
into the exponent. The derivative of y = e™ is ce“, which is c times y. We have reached 
the key equation, which comes with an initial condition—a starting value yo: 


dy/dt = cy with y= yo at t= 0. (1) 


A small change: x has switched to t. In most applications time is the natural variable, 
rather than space. The factor c becomes the “growth rate” or “decay rate’—and e™ 
converts to e™. 

The last step is to match the initial condition. The problem requires y= yo at 
t=0. Our e starts from e“°=1. The constant of integration is needed now—the 
solutions are y = Ae“. By choosing A = yo, we match the initial condition and solve 
equation (1). The formula to remember is yoe". 


61 The exponential law y = yoe solves y' = cy starting from yo. 


The rate of growth or decay is c. May I call your attention to a basic fact? The 
formula yoe" contains three quantities yo,c,t. If two of them are given, plus one 
additional piece of information, the third is determined. Many applications have one 
of these three forms: find t, find c, find yo. 


1. Find the doubling time T if c= 1/10. At that time ype“? equals 2yo: 
: in2. .7 

e'=2 yields cT=In 2 so that T= 7 eS, 7 (2) 
The question asks for an exponent T. The answer involves logarithms. If a cell grows 
at a continuous rate of c= 10% per day, it takes about .7/.1 = 7 days to double in 
size. (Note that .7 is close to In 2.) Ifa savings account earns 10% continuous interest, 
it doubles in 7 years. 

In this problem we knew c. In the next problem we know T. 


2. Find the decay constant c for carbon-14 if y=4y9 in T= 5568 years. 
e? =4 yields cT=In 4 so that c x (In 4)/5568. (3) 


After the half-life T= 5568, the factor e*’ equals $. Now c is negative (In } = — In 2). 
Question 1 was about growth. Question 2 was about decay. Both answers found 
e' as the ratio y(T)/y(0). Then cT is its logarithm. Note how c sticks to T. 
T has the units of time, c has the units of *1/time.”’ 
Main point: The doubling time is (In 2)/c, because cT= In 2. The time to multiply 
by e is I/c. The time to multiply by 10 is (In 10)/c. The time to divide by e is —1/c, 
when a negative c brings decay. 


3. Find the initial value yo if c= 2 and y(1)=5: 
Wt) = yoe" yields yo = y(the™* = 5e~?. 
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2 ¥o y=yoe" (1.0513)20 


Se compound 


continuously 


$2 ys e 05t 2 
¥ y=Yo 
$] yr 1 + .05¢ 
4% y=yoe simple interest 


ap » f 
cT=In2 5 10 15 20 years 


Aig. 6.8 Growth (c > 0} and decay (¢ <0). Doubling time T = (In 2)/c. Future value at 5%. 


All we do is run the process backward. Start from 5 and go back to yo. With time 
reversed, & becomes e~“. The product of e”? and e ? is 1—growth forward and 
decay backward. 

Equally important is T+ t. Go forward to time T and go on to T + t: 


A T+ t) is yo? *9 which is (yoe7)e*. (4) 


Every step t, at the start or later, multiplies by the same ¢'. This uses the fundamental 
property of exponentials, that e7 *' = eT e. 


EXAMPLE 4 Population growth from birth rate b and death rate d (both constant): 
dy/dt=by—dy=cy (the net rate is c=b-d). 


The population in this model is yoe* = yoe”e “. It grows when b> d (which makes 
c > Q). One estimate of the growth rate is c = 0.02/year: 
: : In2 7 
The earth’s population doubles in about T= —— = 0 = 35 years. 

First comment: We predict the future based on c. We count the past population 
to find c. Changes in ¢ are a serious problem for this model. 

Second comment: ype“ is not a whole number. You may prefer to think of bacteria 
instead of people. (This section begins a major application of matkematics to economics 
and the life sciences.) Malthus based his theory of human population on this equation 
y’ = cy—and with large numbers a fraction of a person doesn’t matter so much. To 
use calculus we go from discrete to continuous. The theory must fail when t is very 
large, since populations cannot grow exponentially forever. Section 6.5 introduces the 
logistic equation y' = cy — by”, with a competition term — by? to slow the growth. 

Third comment: The dimensions of b, c, d are “1/time.” The dictionary gives birth 
rate = number of births per person in a unit of time. It is a relative rate—people 
divided by people and time. The product ct is dimensionless and ¢ makes sense {also 
dimensionless). Some texts replace c by 2 (lambda). Then 1/4 is the growth time or 
decay time or drug elimination time or diffusion time. 


EXAMPLE 2 Radioactive dating A gram of charcoal] from the cave paintings in 
France gives 0.97 disintegrations per minute. A gram of living wood gives 6.68 disin- 
tegrations per minute. Find the age of those Lascaux paintings. 


The charcoal stopped adding radiocarbon when it was burned (at t= 0). The 
amount has decayed to yoe". In living wood this amount is still yp, because cosmic 
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rays maintain the balance. Their ratio is e* = 0.97/6.68. Knowing the decay rate c 
from Question 2 above, we know the present time t: 


0.97 ; _ 5568, (0.97) _ 
ct=I)n (22) yields t= Sina = 14,400 years. 


Here is a related problem—the age of uranium. Right now there is 140 times as much 
U-238 as U-235, Nearly equal amounts were created, with half-lives of (4.5)10° and 
(0.7)10° years. Question. How long since uranium was created? Answer: Find t by 
substituting c = (In $)/(4.5)10° and C = (In $)/(0.7)10°: 


In 140 


e/e'=140 = ct—Ctr=In 140 > t= 
e-C 


= 6(10°) years. 


EXAMPLE 3 Calculus in Economics: price inflation and the value of money 


We begin with two inflation rates —- a continuous rate and an annual rate. For the 
price change Ay over a year, use the annual rate: 


Ay = (annual rate) times (y) times (Ap). (5) 
Calculus applies the continuous rate to each instant dt. The price change is dy: 
dy = {continuous rate) times (y) times (dt). (6) 
Dividing by dt, this is a differential equation for the price: 
dy/dt = (continuous rate) times (y) = -O5y. 
The solution is yoe°™. Set r= 1. Then e°* = 1.0513 and the annual rate is 5.13%. 


When you ask a bank what interest they pay, they give both rates: 8% and 8.33%. 
The higher one they call the “effective rate.” It comes from compounding (and depends 
how often they do it). If the compounding is continuous, every dt brings an increase 
of dy—and e°°® is near 1.0833. 

Section 6.6 returns to compound interest. The interval drops from a month to a 
day to a second. That leads to (1 + L/n}", and in the limit to e. Here we compute the 
effect of 5% continuous interest: 


Future value A dollar now has the same value as e'°*? dollars in T years. 
Present value A dollar in T years has the same value as e °°? dollars now. 
Doubling time Prices double (¢°*7 = 2) in T= In 2/.05 = 14 years. 


With no compounding, the doubling time is 20 years. Simple interest adds on 20 
times 5% = 100%. With continuous compounding the time is reduced by the factor 
In 2 = .7, regardless of the interest rate. 


EXAMPLE 4 In 1626 the Indians sold Manhattan for $24. Our calculations indicate 
that they knew what they were doing. Assuming 8% compound interest, the original 
$24 is multiplied by ¢°*'. After t= 365 years the multiplier is e?°? and the $24 has 
grown to [15 trillion dollars. With that much money they could buy back the land 
and pay off the national debt. 

This seems farfetched. Possibly there is a big flaw in the model. It is absolutely 
true that Ben Franklin left money to Boston and Philadelphia, to be invested for 200 
years. In 1990 it yielded millions (not trillions, that takes longer). Our next step is a 
new model. 
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Question How can you estimate e?9? with a $24 calculator (log but not In)? 
Answer Muitiply 29.2 by log,,e = .434 to get 12.7. This is the exponent to base 10. 
After that base change, we have 101?-? or more than a trillion. 


GROWTH OR DECAY WITH A SOURCE TERM 


The equation y’ = y will be given a new term. Up to now, all growth or decay has 
started from yg. No deposit or withdrawal was made later. The investment grew by 
itself—a pure exponential. The new term s allows you to add or subtract from the 
account, It is a “source”’—or a “sink” if s is negative. The source s=5 adds 5d¢, 
proportional to dt but not to y: 


Constant source: dy/dt = y + 5 starting from y= yo. 


Notice y on both sides! My first guess y = e'*> failed completely. Its derivative is e'* > 


again, which is not y + 5. The class suggested y = e' + St. But its derivative e' + 5 is 

still not y+ 5. We tried other ways to produce 5 in dy/dt. This idea is doomed to 

failure. Finally we thought of y= Ae‘ — 5. That has y = Ae’ = y + 5 as required. 
Important: A és aot yp. Set t = 0 to find yg = A — 5. The source contributes 5e‘ — 5: 


The solution is (yo + 5)e' — 5. That is the same as yge' + S{e' — 1). 


s= 5 multiplies the growth term e' — | that starts at zero. yge* grows as hefore. 


EXAMPLES dy/dt = — y+ 5 has y=(yo — Sje"' +5. This is ye~' + S(1 —e~). 


That final term from the soutce is still positive. The other term ype~' decays to zero. 
The limit as t > co is y,. = 5. A negative c leads to a steady state y... 


Based on these examples with c= 1 and c= -- 1, we can find y for any c and s. 
d 
EQUATION WITH SOURCE 7 =cy+s starts from y= yp at t= 0. (7) 


The source could be a deposit of s = $1000/year, after an initial investment of yo = 
$8000. Or we can withdraw funds at s = — $200/year. The units are “dollars per year” 
to match dy/dt. The equation feeds in $1000 or removes $200 continuous!y—not all 
at once. 

Note again that y = e“**" is not a solution. Its derivative is (c + s)y. The combina- 
tion y=e"+ s is also not a solution (but closer). The analysis of y =cy+ s will be 
our main achievement for differential equations (in this section). The equation is not 
restricted to finance—far from it—but that produces excellent examples. 

I propose to find y in four ways. You may feel that one way is enough.f The first 
way is the fastest—only three lines—but please give the others a chance. There is no 
point in preparing for real problems if we don’t solve them. 


Solution by Method 1 {fast way) Substitute the combination y = Ae‘ + B. The selu- 
tion has this forrm—-exponential plus constant, From two facts we find A and B: 


the equation y =cy +s gives cAe“ = c(Ae" + B)+s 
the initial value at = 0 gives A+ B= yp. 


tMy class says one way is more than enough. They just want the answer. Sometimes | cave 
in and wote down the formula: » is yoe™ plus s(e* — 1)/c from the source term. 


Ve 


0 Se -5 
fi 
ig Oe! -5 
10 5et+5 
5 5 = Yoo 
6 Set +5 
Fig. 6.9 
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The first line has cAe* on both sides. Subtraction leaves cB +s=0, of B= — s/c. 
Then the second line becomes A = yo — B= yo + (s/c): 


KEY FORMULA y= (>. + SY : or = -y=yoe™+ ~(e— 1). (8) 


With s=0 this is the old solution y,e* (no source). The example with c=1 and 
s=5 produced (yy + 5)e' — 5. Separating the source term gives ype’ + S(e' — 1). 


Solution by Method 2 (slow way) The input yo produces the output y,e*. After ¢ 
years any deposit is multiplied by e". That also applies to deposits made after the 
account is opened. If the deposit enters at time T, the growing time is only ¢— T 
Therefore the multiplying factor is only e“~™. This growth factor applies to the smail 
deposit (amount sd4T) made between time Tand T + dT. 

Now add up all outputs at time t. The output from yp is yoe". The smail deposit 
sdT near time T grows to ¢“~"s dT. The total is an integral: 


t 
y(t) = yoe" + &!—) sdT. (9) 
r=0 
This principle of Duhamel would still apply when the source s varies with time. 
Here s is constant, and the integral divides by c: 
t _ a s 8g 


ss} et Naps] = — 2426, (10) 
a coe 


That agrees with the source term from Method 1, at the end of equation (8). There 
we looked for “exponential plus constant,” here we added up outputs. 

Method 1 was easier. It succeeded because we knew the form Ae‘ + B—with 
“undetermined coefficients.’”’ Method 2 is more complete. The form for y is part of 
the output, not the input. The source s is a continuous supply of new deposits, all 
growing separately. Section 6.5 starts from scratch, by directly integrating y’ = cy + s. 


Remark Method 2 is often described in terms of an integrating factor. First write 
the equation as y’— cy = s. Then multiply by a magic factor that makes integration 
possible: 


{y’ — cyje“* = se“ multiply by the factor e~“ 
ye" |i =o ck integrate both sides 
ye t~ yo= — =e“ 1} substitute 0 and t 


pyre y+ “te —1) isolate y to reach formula (8) 


The integrating factor produced a perfect derivative in line 1. I prefer Duhamel’s idea, 
that all inputs yo and s grow the same way. Either method gives formula (8) for y. 


THE MATHEMATICS OF FINANCE (AT A CONTINUOUS RATE) 


The question from finance is this: What inputs give what outputs? The inputs can 
come at the start by yp, or continuously by s. The output can be paid at the end or 
continuously. There are six basic questions, two of which are already answered. 
The future value is yge" from a deposit of yp. To produce y in the future, deposit 
the present value ye~%. Questions 3-6 involve the source term 5. We fix the continuous 
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rate at 5% per year (c = .05), and start the account from yp = 0. The answers come 
fast from equation (8). 


Question 3 With deposits of s = $1000/year, how large is y after 20 years? 


ye fe ie we (e.0n29 ~ 1) = 20,000(e — 1) = $34,400. 


One big deposit yields 20,000e ~ $54,000. The same 20,000 via s yields $34,400. 

Notice a small by-product (for mathematicians). When the interest rate is c= 0, 
our formula s(e‘— 1)/c turns into 0/0. We are absolutely sure that depositing 
$1000/year with no interest produces $20,000 after 20 years. But this is not obvious 
from 0/0. By ’H6pital’s rule we take c-derivatives in the fraction: 


ct 
fine eo = = st, This is (1000)(20) = 20,000. (11) 


c70 ¢c cd 


Question 4 What continuous deposit of s per year yields $20,000 after 20 years? 
= 5 (96.0542) _ i _ 1000 
20,000 05 (e 1) requires s = oy 582. 


Deposits of $582 over 20 years total $11,640. A single deposit of yg = 20,000/e = 
$7,360 produces the same $20,000 at the end. Better to be rich at t= 0. 


Questions 1 and 2 had s=0 (no source). Questions 3 and 4 had y, = 0 (no initial 
deposit). Now we come to y=0. In 5, everything is paid out by an annuity. In 6, 
everything is paid up on a foan. 


Question5 What deposit yo provides $1000/year for 20 years? End with »=0. 
$ : —s§ 2 
y= yoe' + ae — 1) =0 requires yo = —O -e"), 


Substituting s= — 1000, c= .05, t = 20 gives yo ~ 12,640. If you win $20,000 in a 
lottery, and it is paid over 20 years, the lottery only has to put in $12,640. Even less 
if the interest rate is above 5%. 


Question é What payments s will clear a loan of yo = $20,000 in 20 years? 


Unfortunately, s exceeds $1000 per year. The bank gives up more than the $20,000 
to buy your car (and pay tuition). ft also gives up the interest on that money. You pay 
that back too, but you don’t have to stay even at every moment. instead you repay 
at a constant rate for 20 years. Your payments mostly cover interest at the start and 
principal at the end. After t = 20 years you are even and your debt is y = 0. 

This is like Question 5 (also y= 0), but now we know yp, and we want s: 


y= yor + °(e* — 1) =0 requires s = — cyoe"e"— 1), 


The loan is yo = $20,000, the rate is c = .05/year, the time is t = 20 years. Substituting 
in the formula for s, your payments are $1582 per year. 


Puzzle How is s = $1582 for loan payments related to s = $582 for deposits? 
0 — $582 per year > $20,000 and $20,000 > — $1582 per year > 0. 


6.3 Growth and Decay in Science and Economics 
That difference of exactly 1000 cannot be an accident. 1582 and 582 came from 
1000—"— and 1000 —— with difference 1000 £—! = 1000. 
e-1 4-1 e~ 1 


Why? Here is the real reason. Instead of repaying 1582 we can pay only 1000 (to 
keep even with the interest on 20,000). The other 582 goes into a separate account. 
After 20 years the continuous 582 has built up to 20,000 (including interest as in 
Question 4). From that account we pay back the loan. 

Section 6.6 deals with daily compounding—which differs from continuous com- 
pounding by only a few cents. Yearly compounding differs by a few dollars. 


s=-1000 20 


Fig. 6.40 Questions 3-4 deposit s. Questions 5-6 repay loan or annuity. Steady state —s/c. 


TRANSIENTS VS. STEADY STATE 


Suppose there is decay instead of growth. The constant c is negative and ye“ dies 
out. That is the “transient” term, which disappears as t > co. What is left is the 
“steady state.” We denote that limit by y,,. 

Without a source, y,, is zero (total decay). When s is present, y,, = — s/c: 


6Jj The solution y= (ve + =e = : approaches y, = — : when e+ 0. 


At this steady state, the source s exactly balances the decay cy. In other words 
cy +s=0. From the left side of the differential equation, this means dy/dt = 0. There 
is no change. That is why y,, is steady. 

Notice that y,, depends on the source and on c—but not on yo. 


EXAMPLE 6 Suppose Bermuda has a birth rate b = .02 and death rate d= .03. The 
net decay rate is c= — .01. There is also immigration from outside, of s = 1200/year. 
The initial population might be yo = 5 thousand or yp = 5 million, but that number 
has no effect on y,,. The steady state is independent of y,. 


In this case y,, = — s/c = 1200/.01 = 120,000. The population grows to 120,000 if 
Yo is smaller. It decays to 120,000 if yo is larger. 


EXAMPLE 7 Newton’s Law of Cooling: dy/dt = c(y— y.,). (12) 


This is back to physics. The temperature of a body is y. The temperature around it 
is y,.. Then y starts at yp and approaches y,,, following Newton’s rule: The rate is 
proportional to y — y,,. The bigger the difference, the faster heat flows. 

The equation has —cy,, where before we had s. That fits with y,, = — s/c. For the 
solution, replace s by —cy,, in formula (8). Or use this new method: 
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Solution by Method 3 The new idea is to look at the difference y — y,, . Its derivative 
is dy/dt, since y,, is constant. But dyp/dt is c(y — y.)—this is our equation. The differ- 
ence starts from jp ~ y,,., and grows or decays as a pure exponential: 


d ; 
FAC —Yal=AV— Yo) has the solution (¥— ¥o)= (Vo Yale" (13). 


This solves the law of cooling. We repeat Method 3 using the letters s and c: 


d 8 $ s s 
eats =-|= =- i, —j= + — |e"~” 
Al + *) (» + ‘) has the solution (» + *) (v6 *)e (14) 


Moving s/c to the right side recovers formula (8). There is a constant term and an 
exponential term. In a differential equations course, those are the “particular solution” 
and the “Aomogeneous solution.” In a calculus course, it’s time to stop. 


EXAMPLE 8 In a 70° room, Newton's corpse is found with a temperature of 90°. A 
day later the body registers 80°. When did he stop integrating (at 98.6°)? 


Solution Here y,, = 70 and yg = 90. Newton’s equation (13) is y= 20e7 + 70. Then 
y= 80 at t= 1 gives 20e = 10. The rate of cooling is c= In 4. Death occurred when 
20e* + 70 = 98.6 or ¢' = 1.43. The time was t =n 1.43/In 4 = half a day earlier. 


6.3 EXERCISES 


Read-through exercises 
If y =cy then xt)=_9__. If dy/dt=7y and yo =4 then 


yitj=_ >. This solution reaches 8atr=_ ¢ _. If the dov- 
bling time is Tthenc=_¢_. If y’ =3y and {1} =9 then yo 
was _¢ . When c is negative, the solution approaches 


1 ast—+o. 


The constant solution to dy/dt=y+6is y=_9 . The 
general solution is y = Ae’ — 6. If yo =4 then A=____. The 
solution of dy/dt =cy +s starting from yp is y= Ae? + B= 

i _. The output from the source 5 is __J__. An input at 
time T grows by the factor k __ at time ¢. 


At c=10%, the interest in time dt is dy=__!_. This 
equation yields y(t)=__™__. With a source term instead of 
Yor a Continuous deposit of s=4000/year yields y=__n 


after 10 years. The deposit required to produce 10,000 in 10 
years is s=__ © (exactly or approximately). An income of 
4000/year forever (!) comes from yp =__P__. The deposit to 


give 4000/year for 20 years is yp=__a__. The payment rate 
s to clear a loan of 10,000 in 10 years is__¢ 


The solution to y= — 3y +s approaches y,,=__$ 


Solve 1—4 starting from y, = 1 and from y, = — t. Draw both 
solutions on the same graph. 


dy dy dy 
1—= = 3 = 
2t 2 tt t 7 2y 


Solve 5-8 starting from y, = 10. At what time does y increase 
to 100 or drop to 1? 
dy dy dy dy 
5 = ‘== 7 ~ = ¢tt 8 ~=e-4 
at 4y t * a a 
9 Draw a field of “tangent arrows” for y’ = — y, with the 
solution curves y= e * and y= —e*. 


10 Draw a direction field of arrows for y’ = y— 1, with solu- 
tion curves y=e" +1] and p=1, 
Problems 11-27 involve yje*. They ask for c or t or yo. 


11 If a culture of bacteria doubles in two hours, how many 
hours to multiply by 10? First find c. 


12 If bacteria increase by factor of ten in ten hours, how 
many hours to increase by 100? What is c? 


13 How old is a skull that contains $ as much radiocarbon 
as a modern skull? 


14 Ifa relic contains 90% as much radiocarbon as new mate- 
rial, could it come from the time of Christ? 


15 The population of Cairo grew from 5 million to 10 million 
in 20 years. From y =cy find c. When was y =8 million? 


16 The populations of New York and Los Angeles are grow- 
ing at 1% and 1.4% a year. Starting from 8 million (NY) and 
6 million (LA), when will they be equal? 
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17 Suppose the value of $1 in Japanese yen decreases at 2% 
per year. Starting from $1 = ¥240, when will 1 dollar equal t 
yen? 


18 The effect of advertising decays exponentially. If 40% 
remember a new product after three days, find c. How long 
will 20% remember it? 


19 If y= 1000 at t=3 and y=3000 at :=4 (exponential 
growth), what was yp at t= 0? 


20 If y= 100 att =4 and y=10 at t= 8 (exponential decay} 
when will y= 1? What was yo? 


2% Atmospheric pressure decreases with height according to 
dp/dh = cp. The pressures at h =O (sea level} and A = 20 kan 
are 1013 and 450 millibars. Find c. Explain why p= 
»/ 1013-50 halfway up at A= 10. 


22 For exponential decay show that {t) is the square root of 
(0) times {2t). How could you find (31) from y(t) and ){21)? 


23 Most drugs in the bloodstream decay by y’ =cy (first- 
order kinetics). (a) The half-life of morphine is 3 hours. Find 
its decay constant c (with units}. (b} The half-tife of nicotine 
is 2 hours. After a six-hour flight what fraction remains? 


24 How often should a drug be taken if its dose is 3 mg, it is 
cleared at c =.01/hour, and i mg is required in the blood- 
stream at all times? (The doctor decides this level based on 
body size.) 
25 The antiseizure drug dilantin has constant clearance rate 
= —auntil y=y,. Then y = — ay/y,. Solve for ft) in two 

pieces from yo. When does y reach y,? 
26 The actual elimination of nicotine is multiexponential: y = 
Ae* + Be“, The first-order equation (d/dt — c)y = 0 changes 
to the second-order equation (d/dt — c)(d/dt — C)y = 0. Write 
out this equation starting with py”, and show that it is satisfied 
by the given y. 
27 True or false. If false, say what's true. 

(a) The time for y = e* to double is (in 2)/(n c). 

(b) If y’ = cy and 2’ = cz then (y + 2) = 2cly + 2). 

(c) If y’ =cy and z' =cz then (y/z) =0. 

(d} If y =cy and z’ = Cz then (yz)’ = (c + C)yz. 
28 A rocket has velocity ». Burnt fuel of mass Am leaves at 
velocity vy — 7. Total momentum is constant: 


my = (m — Am)(v + Av) + Amv — 7}. 


What differential equation connects m to v? Solve for x(n) not 
w(t}, starting from vy = 20 and my = 4. 


Problems 29-36 are about solutions of py’ =cy +s. 


29 Solve y'=3y+1 with yo =0 by assuming y= Ae" +B 
and determining A and B. 


W Solve y’ = 8— y starting from yo and y = Ae' +8. 


Solve 31-34 with yo =0 and graph the solution. 


dy dy 

31a =ytl 32 Tat ie 
dy dy 

33 Fee salir ia 


35 (a) What value y = constant solves dy/dt = — 2y + 12? 
(b) Find the solution with an arbitrary constant A. 
(c) What solutions start from yp = 0 and yp = 10? 
(d) What is the steady state y,.? 
following results starting from y, = 1. Draw graphs. 
(a) y increases to 00 (b) y increases to 2 
{c} y decreases to —2 ({d) y decreases to — oo 


36 Choose + signs in dy/dt=+3y+6 to achieve the 


37 What value y = constant solves dy/dt = 4 — y? Show that 
yt) = Ae! +4 is also a solution. Find {1} and y,, if yo = 3. 


38 Solve y=y+e' from yo=0 by Method 2, where the 
deposit ¢? at time Tis multiplied by ¢~7. The total output 
at time t is X(t) = [Ue e" 7 dT= . Substitute back to 
check y= y+et. 


39 Rewrite y'=y+e' as y'—y=e'. Multiplying by e~', the 
left side is the derivative of . Integrate both sides 
from yp = 0 to find x0). 


40 Solve y’ = —y+1 from yo =0 by rewriting as y’ + y=1, 
multiplying by ¢, and integrating both sides. 


41 Solve y'= y+ from yo = 0 by assuming y = Ae + Br+C. 


Problems 42-57 are about the matbematics of finance. 


42 Dollar bills decrease in value at c = — .04 per year because 
of inflation. If you hold $1000, what is the decrease in dt 
years? At what rate s should you print money to keep even? 


43 If a bank offers annual interest of 74% or continuous 
interest of 74%, which is better? 


44 What continuous interest rate is equivalent to an annual 
rate of 9%? Extra credit: Telephone a bank for both rates 
and check their calculation. 


45 At 100% interest (c = 1) how much is a continuous deposit 
of s per year worth after one year? What initial deposit yo 
would have produced the same output? 


46 To have $50,000 for college tuition in 20 years, what gift 
yg Should a grandparent make now? Assume c = 10%. What 
continuous deposit should a parent make during 20 years? If 
the parent saves s = $1000 per year, when does he or she reach 
$50,000 and retire? 
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47 Income per person grows 3%, the population grows 2%, 
the total income grows . Answer if these are (a) 
annual rates (b) continuous rates. 


48 When dy/dt =cy+4, how much ts the deposit of 4dT at 
time T worth at the later time ¢? What is the value at ¢ = 2 of 
deposits 4dT from T=0 to T= 1? 


49 Depositing s= $1000 per year leads to $34,400 after 20 
years (Question 3). To reach the same result, when should you 
deposit $20,000 al! at once? 


50 For how long can you withdraw s =$500/year after 
depositing }'p = $5000 at 8%, before you run dry? 


51 What continuous payment s clears a $1000 loan in 60 
days, if a loan shark charges 1% per day continuously? 


§2 You are the loan shark. What is $1 worth after a year of 
continuous compounding at 1% per day? 


§3 You can afford payments of s=$100 per month for 48 
months. If the dealer charges ¢ = 6%. how much can you 
borrow? 


84 Your income is {et per year. Your expenses are Eye" 


per year. (a) At what future time are they equal? {b) If you 
borrow the difference until then, how much money have you 
borrowed? 


55 Ifa student loan in your freshman year is repaid plus 20% 
four years later, what was the effective interest rate? 


56 Is a variable rate mortgage with ¢ =.09 + .00!r for 20 
years better or worse than a fixed rate of 10%? 


57 At 10% instead of 8%, the $24 paid for Manhattan is 
worth after 365 years. 


6.4 Logarithms 


Problems 58-65 approach a steady state y, as f—> x. 
58 If dyfdt = —y+7 what is ,? What is the derivative of 
yoy, ? Then y—y, equals yo —¥, times 
59 Graph yt) when )’ = 3y — 12 and yy Is 
(a) below 4 


(b} equal to 4 (c) above 4 


60 The solutions to dy/dt =e(y— 12) converge to }., = 
provided ¢ ts 


61 Suppose the time unit in dy/dt = cy changes from minutes 
to hours. How does the equation change? How does dy/dt = 
~ y+ 5 change? How does y, change? 
62 True or false, when }, and y, both satisfy y =cy +s. 
(a) The sum y= y, + \'2 also satisfies this equation. 
{b} The average y = 3(}, + }2) satisfies the same equution., 
{c) The derivative y= |, satisfies the sane equation. 


63 If Newton's coflee cools from 80° to 60° in 12 minutes 
{room temperature 20°}, ind c. When was the coffee at 100 ? 


64 If yo = 100 and Wt) =90 and 4{2) = 84, what is 5 ,? 
65 If yg = 100 and y(1) = 90 and 3{2) = 81, what is y,? 
66 To cool down collee, should you add milk now or later? 


The coflee is at 70°C, the milk is at 10°, the room is at 20°. 


(a) Adding 1 part milk to 5 parts coffee makes it 60°. With 
y, = 20°, the white coffee cools to v(t) = 


. The milk 
Mixing at time ¢ gives 


(b) The black coffee cools to rit} = 
warms to ui(fh— 
(5y.+7,,6= 


We have given first place to e* and a lower place to In x. In applications that is 
absolutely correct. But logarithms have one important theoretical advantage (plus 
many applications of their own). The advantage is that the derivative of In x is 1x. 
whereas the derivative of e* is e*. We can’t define e* as its own integral, without 
circular reasoning. But we can and do define In x (the natural logarithm) as the 


integral of the **— 1 power” which is L/x: 


ii y= [ae (1) 


Note the dummy variables. first x then u. Note also the live variables, first x then y. 
Especially note the lower limit of integration, which is | and not 0. The logarithm is 
the area measured from |. Therefore In 1 = 0 at that starting point—as required. 


6.4 Logarithms 


Earlier chapters integrated all powers except this ‘““—1 power.” The logarithm is 
that missing integral. The curve in Figure 6.11 has height y = 1/x—it is a hyperbola. 
At x=0 the height goes to infinity and the area becomes infinite: log 0= — co. 
The minus sign is because the integral goes backward from 1 to 0. The integral 
does not extend past zero to negative x. We are defining In x only for x > 0.t 


) 1/2 1 2 4 
Fig. 6.44 Logarithm as area. Neighbors In a + In b = In ab. Equal areas: —In} =In 2=41n 4. 


With this new approach, In x has a direct definition. Jt is an integral (or an area). 
Its two key properties must follow from this definition. That step is a beautiful 
application of the theory behind integrals. 


Property 1: \nab=\na+\Inb. The areas from 1 to a and from a to ab combine into 
a single area (1 to ab in the middle figure): 


2& 


a ab ab 
Neighboring areas: | : dx + | : dx = | d te. (2) 


The right side is In ab, from definition (1). The first term on the left is In a. The 
problem is to show that the second integral (a to ab) is In b: 


ab } ” by 
—dx = —du = Inb. (3) 
a x 1 U 


We need u= 1 when x = a (the lower limit) and u= b when x = ab (the upper limit). 
The choice u = x/a satisfies these requirements. Substituting x = au and dx =a du 
yields dx/x = du/u. Equation (3) gives In b, and equation (2) is Ina +1n b=In ab. 


Property 2: \n b"=nI\nb. These are the left and right sides of 


| ”) *y 
| —dx = n| — du. (4) 
4 1U 


This comes from the substitution x = u". The lower limit x = 1 corresponds to u= 1, 
and x = b" corresponds to u=b. The differential dx is nu"~'du. Dividing by x = u" 
leaves dx/x =n du/u. Then equation (4) becomes In b" = n In b. 


Everything comes logically from the definition as an area. Also definite integrals: 


3x 
EXAMPLE 4 Compute | 


x 


zt. Solution: In 3x — In x= in =In 3. 


1 
EXAMPLE 2 Compute | ~ dx Solution: In 1 —In .1 =1In 10. (Why?) 
ee 


+The logarithm of —1 is mi (an imaginary number). That is because e*! = —1. The logarithm 
of i is also imaginary—it is 47i. In general, logarithms are complex numbers. 
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e 1 
EXAMPLE 3 Compute | hoses Solution: In e? = 2. The area from 1 to e? is 2. 


1 


Remark While working on the theory this is a chance to straighten out old debts. 
The book has discussed and computed (and even differentiated) the functions e* and 
b* and x", without defining them properly. When the exponent is an irrational number 
like x, how do we multiply e by itself x times? One approach (not taken) is to come 
closer and closer to z by rational exponents like 22/7. Another approach (taken now) 
is to determine the number e* = 23.1... by its logarithm.+ Start with e itself: 


e is (by definition) the number whose logarithm is 1 
e” is (by definition) the number whose logarithm is z. 


When the area in Figure 6.12 reaches 1, the basepoint is e. When the area reaches 7, 
the basepoint is e*. We are constructing the inverse function (which is e*). But how 
do we know that the area reaches x or 1000 or —1000 at exactly one point? (The 
area is 1000 far out at e'°°°. The area is —1000 very near zero at e~ '°°°.) To define 
e we have to know that somewhere the area equals 1! 

For a proof in two steps, go back to Figure 6.11c. The area from 1 to 2 is more 
than 4 (because 1/x is more than 4 on that interval of length one). The combined area 
from 1 to 4 is more than 1. We come to area=1 before reaching 4. (Actually at 
e=2.718....) Since 1/x is positive, the area is increasing and never comes back to 1. 

To double the area we have to square the distance. The logarithm creeps upwards: 


Inx—co but 2* 0, (5) 


The logarithm grows slowly because e* grows so fast (and vice versa—they are 
inverses). Remember that e* goes past every power x”. Therefore In x is passed by 
every root x!/". Problems 60 and 61 give two proofs that (In x)/x'" approaches zero. 

We might compare In x with «J & At x = 10 they are close (2.3 versus 3.2). But out 
at x = e!° the comparison is 10 against e*, and In x loses to J x. 


e | e l e 


Fig. 6.42 Area is logarithm of basepoint. Fig. 6.43 In x grows more slowly 
than x. 


+Chapter 9 goes on to imaginary exponents, and proves the remarkable formula e™ = — 1. 
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APPROXIMATION OF LOGARITHMS 


The limiting cases In 0 = — o0 and In «0 = + oo are important. More important are 
logarithms near the starting point In 1 = 0. Our question is: What is In (1 + x) for x 
near zero? The exact answer is an area. The approximate answer is much simpler. 
If x (positive or negative) is small, then 


In (l+x)2x and el x, (6) 


= oa The calculator gives In 1.01 = .0099503. This is close to x = .01. Between 1 and 1 + x 


y=e% the area under the graph of 1/x is nearly a rectangle. Its base is x and its height is 1. 


Sa 2 
1am trea x'/2 So the curved area In (1 + x) is close to the rectangular area x. Figure 6.14 shows 


area x how a small triangle is chopped off at the top. 

ye The difference between .0099503 (actual) and .01 (linear approximation) is 
— .0000497. That is predicted almost exactly by the second derivative: 4 times (Ax)? 

Fig. 6.14 times (In x)” is $(.01)?(—1) = — .00005. This is the area of the small triangle! 


In(1 + x) = rectangular area minus triangular area = x — 4x’. 


The remaining mistake of .0000003 is close to 4x3 (Problem 65). 

May I switch to e*? Its slope starts at e° = 1, so its linear approximation is 1 + x. 
Then In (e*) = In (1 + x) + x. Two wrongs do make a right: In (e*) = x exactly. 

The calculator gives e°’ as 1.0100502 (actual) instead of 1.01 (approximation). The 
second-order correction is again a small triangle: x? = .00005. The complete series 
for In (1 + x) and e* are in Sections 10.1 and 6.6: 


In (1+ x)=x— x?/2+x3/3-... e=1+x+x7/2+x3/6+.... 
DERIVATIVES BASED ON LOGARITHMS 


Logarithms turn up as antiderivatives very often. To build up a collection of integrals, 
we now differentiate In u(x) by the chain rule. 


The slope of In x was hard work in Section 6.2. With its new definition (the integral 
of 1/x) the work is gone. By the Fundamental Theorem, the slope must be 1/x. 

For In u(x) the derivative comes from the chain rule. The inside function is u, the 
outside function is In. (Keep u > 0 to define In u.) The chain rule gives 


# ia cx = Diaby ae im Gita e 

dx cx x dx x 

4 in (x2 +1) = 2x/(x? +1) Lr ten % 
dx dx OS xX 

d d 1 

qx Be ele =I qx it (in x) = 


Those are worth another look, especially the first. Any reasonable person would 
expect the slope of In 3x to be 3/x. Not so. The 3 cancels, and In 3x has the same 
slope as In x. (The real reason is that In 3x = In 3 + In x.) The antiderivative of 3/x is 
not In 3x but 3 In x, which is In x*. 
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Before moving to integrals, here is a new method for derivatives: logarithmic differen- 
Gation or LD. It applies to products and powers. The product and power rules are 
always available, but sometimes there is an easier way. 

Main idea: The logarithm of a product p{x) is a sum of logarithms. Switching to 
In p, the sum rule just adds up the derivatives. But there is a catch at the end, as you 
see in the example. 


EXAMPLE 4 Find dp/dx if p(x) = x*\/x — 1. Here In p(x) =x In x +} In(x— 1). 


as ldp 1 t 
. p ns at _— + + Sa ee eee 
Take the derivative of In p nae x - In x Ax 1) 
Now multiply by p(x): dp =piltinxt ele 
ultiply by pfx): ie Ax- 1) 


The catch is that last step. Multiplying by p complicates the answer. This can’t be 
helped—logarithmic differentiation contains no magic. The derivative of p = fg is the 
same as from the product rule: In p= in f+ Ing gives 

+£ and p'= AF + £) =f'g + fe’. (7) 
g aor: 


For p= xe* sin x, with three factors, the sum has three terms: 


‘ F 1 cos x 
inp=Inx+x+insin xand pp=p)/-+1+— . 
x sin x 


We multiply p times p'p (the derivative of ln p). Do the same for powers: 


ie 1 l 1 in; 
EXAMPLES p=x'* => Inp=-Inx = zt 7 = => | 
x dx xP OS 


21n x 
EXAMPLE6 p=x!* > In p=(Inx)? = oP sal 


1 d 
EXAMPLE7 p=x''™* => In p= >in x=l=> =o (') 
< : 


INTEGRALS BASED ON LOGARITHMS 


Now comes an important step. Many integrals produce logarithms. The foremost 

example is 1/x, whose integral is In x. In a certain way that is the only example. but 

its range is enormously extended by the chain rule. The derivative of In u(x} is u’:u, 
so the integral gocs from u’iu back to Inu: 

Pduidx 

u(x) 


Try to choose u(x) so that the integral contains dujdx divided by u. 


dx =In u(x) or equivalently \? =Inu. 
u 


( dx dx ] 
——_ =Injx +7 =~ Inlex - 
EXAMPLES x47 njx | | ee Injcex + 7| 
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Final remark When u is negative, In u cannot be the integral of 1/u. The logarithm 
is not defined when u <0, But the integral can go forward by switching to —u: 


Thus In{(— 1) succeeds when in u fails.t The forbidden case is u = 0. The integrals In u 
and In(—w), on the plus and minus sides of zero, can be combined as Jn|uj. Every 
integral that gives a loganthm allows u <0 by changing to the absolute value |x|: 


“dy -1 * dx 4 
ie = = Lior |o: =In1—Ine i == Linx—5i]; =In 1—In 3, 


The areas are —] and —1n 3. The graphs of 1/x and 1/(x — 5} are below the x axis. 
We do not have logarithms of negative numbers, and we will not integrate 1/(x — 5) 
from 2 to 6. That crosses the forbidden point x = 5, with infinite area on both sides. 


The ratio du/u leads to important integrals. When u=cos x or u=sin x, we are 
integrating the tangent and cotangent. When there is a possibility that u< 0, write 
the integral as In |x]. 


[ton x d= [ 2% ax - —In |cos x| | 2 = ing? +7) 
cos x 


x7+7 2 
cos x . dx 
cot x dx= | — dx =ln |sin x| —— =In|lIn x| 
sin x xInx 


Now we report on the secant and cosecaat. The integrals of [/cos x and 1/sin x 
also surrender to an attack by logarithms — based on a crazy trick: 


sec x + tan x 

sec x dx = | sec x | ——————_ } dx =iIn |sec x + tan xl. (9) 
sec x + tan x 
esc x —cot x 

|x x ax | os x ( ) dx =In |csc x — cot xl. (10) 
csc x — cot x 


Here u = sec x + tan x is in the denominator; du/dx = sec x tan x + sec? x is above it. 
The integral is In |u|. Similarly (10) contains du/dx over u=csc x — cot x. 

In closing we integrate In x itself. The derivative of x In x is Inx +1. To remove 
the extra 1, subtract x from the integral: fin xdx=xinx—x. 

In contrast, the area under 1/(In x) has no elementary formula. Nevertheless it is 
the key to the greatest approximation in mathematics—the prime number theorem. 
The area §° dx/in x is approximately the number of primes between a and b. Near e!°°, 


about 1/1000 of the integers are prime. 


6.4 EXERCISES 
Read-through questions eo. As x, Inx approaches __{ . But the ratio 
Th . sfx : a (in xf approaches _§ _. The domain and range of In x 
¢ natural logarithm of x is {] __9 _. This definition leads . 
tolnxy=_b_ andlax"=_¢_.. Then ¢ is the number a — 
whose logarithm (area under t/x curve) is _ 4 _. Similarly 
e” is now defined as the number whose natural logarithm is The derivative of Inxis_ |! . The derivative of In(1 + x} 


+The integral of 1/x (odd function) is In |x| (even function). Stay clear of x = 0. 
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is_ | 
k__. The quadratic approximationis __1! 
approximation to e*is_ om __ 


. The tangent approximation to In{1! + x) at x =0 is 
. The quadratic 


. Thus 
(In cos x) =__9__. An antiderivative of tan x is_P__. The 
product p = x e** hasIn p=__a_. The derivative of this equ- 
ation is__* . Multiplying by p gives p)=__* _, which is 
LD or logarithmic differentiation, 


The integral of u’(x)/u{x) is t_. The integral of 
2x/(x? +4)is__u . The integra] of I/cx is__¥ __. The integ- 


ral of [/(ct + s)is__w__. The integral of 1/cos x, after a trick, 


The derivative of In w{x) by the chain rule is __1 


. We should write in |x| for the antiderivative of 1/x, 
since this allows _y _. Similarly {du/u should be written 
z 


Find the derivative dy/dx in 1-10. 


1 y=In(2x) 2 y=In(2x + 1) 
3 y=(n x)? 4 y=(In x}/x 
5 p=xIlnx—x 6 y=logyox 
7 y=In(sin x) 8 y=Indn x) 


9 y=7 In 4x 10 y =In((4x}’) 


Find the indefinite (or definite) integral in 11-24. 


dt dx 
11 j-— — 
{5 i {* 
* dx 1 dt 
3 = 
i 3+x i 3+ 2t 
2 x dx 2 x? dx 
5 peat 
: 3 xt) is | x? +1 


& dx 
i xin x)? 


n/t 
20 { tan x dx 
GQ 


* dx 
‘ [ x(In x} 


cos x dx 
sin x 


21 [un 3x dx 


2 {° xf dx 
x 


22 [ot 3x dx 


ax 
% (we xj(in In x) 


25 Graph y =In(I + x) 26 Graph y = In(sin x} 


Compute ¢y/dx by differentiating In y. This is LD: 


27 y=./x7+t 2 y=./x*+1./x7?-1 


29 p=e"* 30 y=x** 


My=e 32 y=x" 
gee M4 y= (2) 9) 
35 y=x tiles % put 
Evaluate 37-42 by any method. 
10 dt 10x dt Ri =i 

cad tb -| ee 38 ax 4 dx 

5 t Sx t 1 x Lge it 

d [iat af? de 
al ar wrt FT 

d 2 
41 — In(sec x + tan x} gp: RSs ex an 
ae sec x + tan x 


Verify the derivatives 43—46, which give useful antiderivatives: 


a 1 
43 — In(x + ./x?+1) = —= 
dx fl+x? 


d x-—a 2a 
“ zh(2=4) ~ (x? — a) 


Estimate 47-50 to linear accuracy, then quadratic accuracy, 
by e* = 1 +x +4x7. Then use a calculator. 


47 In(1.1) 48 ¢} 49 In(.99) 50 e? 


In(I + x) e*—1 
x 


§1 Compute lim §2 Compute lim ——— 
x70 z-~0 x 


53 Compute lim lop 4%) 4 Compute lim at 

x70 x x0 x 
§5 Find the area of the “hyperbolic quarter-circle” enclosed 
by x =2 and y = 2 above y= I/x. 


56 Estimate the area under y = 1/x from 4 to 8 by four upper 
rectangles and four lower rectangles. Then average the 
answers {trapezoidal rule). What ts the exact area? 


1 1 1 
57 Why is 5 _ 3 fot 5 near In a? Is it above or below? 
58 Prove that Inx< /x — 1) for x > 1, Compare the integ- 
rats of t/t and 1 h/t, from | to x. 


59 Dividing by x in Problem 58 gives (In x}/x < a/x — 1)/x. 
Deduce that (In x)/x +0 as x— a. Where is the maximum 
of (In x)/x? 


60 Prove that (In x}/x!" also approaches zero. (Start with 
dn x'*}/x'/" + 0.) Where is its maximum? 
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61 For any power n, Problem 6.2.59 proved e* > x" for large 
x. Then by logarithms, x > nln x. Since (In x)/x goes below 
I/n and stays below, tt converges to 


62 Prove that yln y approaches zero as y -+ 0, by changing 
y to 1/x. Find the limit of y* (take its logarithm as y — 0). 
What is .1°? on your calculator? 


63 Find the limit of In x/log, x as x + 0. 


64 We know the integral fi 1 dt = [rtihyy = (x* — 1h. 
Its limit as h > O is 


65 Find linear approximations near x = 0 for e * and 2*. 


66 The x’ correction to In{1 + x) yields x ~ $x? + 4x*. Check 
that In 1.0] = .0099503 and find In 1.02. 


67 Anant crawls at 1 foot/seeond along a rubber band whose 
original length is 2 feet. The band is being stretched at 1 
foot/second by pulling the other end. At what time T, if ever, 
does the ant reach the other end? 

One approach: The band’s length at time ¢ is ¢ + 2. Let p(t) 
be the fraction of that length which the ant has covered, and 
explain 

{a) p= Ut+ 2) (bp) y =Infe + 2) — 


68 If the rubber band ts stretched at & feet/second, when if 
ever does the same ant reach the other end? 


In2 (c) T=2e—2. 


69 A weaker ant slows down to 2/(t + 2) feet/second, so y" = 
2i(t + 2)*. Show that the other end is never reached. 
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70 The slope of p = x* comes two ways from In p = x In x: 
1 Logarithmic differentiation (LD): Compute (In p)’ and 
multiply by p. 
2 Exponential differentiation (ED); Write x* as ¢'"*, 
take its derivative, and put back x*. 


71 If p= 2* then Inp= . LD gives p’ =(p}(In py = 
. ED gives p = e______. and then p’ = 


72 Compute In 2 by the trapezoidal rule and/or Simpson’s 
rule, to get five correct decimals. 


73 Compute In 10 by either rule with Ax = 1, and compare 
with the value on your calculator. 


74 Estimate 1/ln 90,000, the fraction of numbers near 90,000 
that are prime. (879 of the next 10,000 numbers are actually 
prime.) 


75 Find a patr of posttive integers for which x” = y*. Show 
how to change this equation to (In x)/x =(In y)/y. So look for 
two points at the same height in Figure 6.13. Prove that you 
have discovered all the integer solutions, 


*76 Show that (In x)/x =(In y)/y is satisfied by 


( + ‘) (: + ye 
x={—— ] and y={—— 
t t 


with t #0. Graph those points to show the curve x’ = y*. It 
crosses the line y= x at x= , Where t + ow. 


This section begins with the integrals that solve two basic differential equations: 


= =cy and 


dy 
. 
at =cyts (1) 


We already know the solutions. What we don’t know is how to discover those solu- 
tions, when a suggestion “try e“”’ has not been made. Many important equations, 
including these, separate into a y-integral and a ¢-integral. The answer comes directly 
from the two separate integrations. When a differential equation is reduced that far— 
to integrals that we know or can look up—it is solved. 

One particular equation will be emphasized. The fogistic equation describes the 
speedup and slowdown of growth. Its solution is an S-curve, which starts slowly, 


rises quickly, and ijevels off. (The 1990’s are near the middle of the 


S, if the 


prediction is correct for the world population.) S-curves are solutions to aonlinear 
cquations, and we will be solving our first nonlinear model. It is highly important 


in biology and ail life sciences. 
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SEPARABLE EQUATIONS 


The equations dy/dt = cy and dy/dt = cy + s {with constant source s} can be solved 
by a direct method. The idea is to separate y from t: 


dy dy 
—=cdt and ——_ = c dt. 2 
y y + (s/c) 2) 


All y’s are on the left side. All t’s are on the right side (and ¢ can be on either side). 
This separation would not be possible for dy/dt = y + ¢. 

Equation (2) contains differentials. They suggest integrals. The t-integrals give ct 
and the y-integrals give logarithms: 


s 
In y= ct + constant and n(» + :) = ct + constant. (3) 


The constant is determined by the initial condition. At t = 0 we require y = yo, and the 
right constant will make that happen: 


In y=ct+In yo and nly + *) =ct+ n(v + “) (4) 


Then the final step isolates y. The goal is a formula for y itself, not its loganthm, so 
take the exponential of both sides (e'*” is y): 


$ $ 
y= poe and yt - = (v6 + Se (5) 


It is wise to substitute y back into the differential equation, as a check. 

This is our fourth method for y’ = cy +s. Method 1 assumed from the start that 
y = Ae“ + B. Method 2 multiplied all inputs by their growth factors e“'~ 7 and added 
up outputs. Method 3 solved for y — y,,. Method 4 is separation of variables (and all 
methods give the same answer). This separation method is so useful that we repeat 
its main idea, and then explain it by using it. 


To solve dy/dt = ul y)v(t), separate dy/u(y) from v(t)dt and integrate both sides: 


| dy/uly) = | utjdt + C. {6) 
Then substitute the initial condition to determine C, and solve for Xt). 


EXAMPLE 4 dy/dt = y? separates into dy/y” = dt. Integrate to reach —I/y=¢t+C. 
Substitute ¢= 0 and y= yo to find C= — 1/y9. Now solve for y: 
— u =t- _ and y= ed eae 
y Yo 1 typ 
This solution blows up (Figure 6.15a) when ¢ reaches 1/y,. If the bank pays interest 
on your deposit squared (y = y”), you soon have all the money in the world. 


EXAMPLE 2 dy/dt = ty separates into dy/y = t dt. Then by integration In y = #47 + C. 
Substitute t=0 and y= yo to find C=In yo. The exponential of 427+ In yo gives 
y= yoe"’?, When the interest rate is c = t, the exponent is t?/2. 


EXAMPLE3 dy/dt=y+t is not separable. Method 1 survives by assuming y= 


6.5 Separable Equations Including the Logistic Equation 


J y 


l 
i 
t 
f 
| 
t 
t 
' 
U 


blowup umes '¢ =— 
i} 


Aig. 6.45 The solutions to separable equations ‘ =y? and —=n- of —=n— 


Ae' + B + Dt—with an extra coefficient D in Problem 23. Method 2 also succeeds— 
but not the separation method. 


EXAMPLE 4 Separate dy/dt = ny/t into dy/y = n dt/t. By integration In y=nlnt+C. 
Substituting t= 0 produces In 0 and disaster. This equation cannot start from time 
zero (it divides by t). However y can start from y, at t= 1, which gives C=In y,. The 
solution is a power function y= y,t". 

This was the first differential equation in the book (Section 2.2). The ratio of dy/y 
to dt/t is the “elasticity” in economics. These relative changes have units like 
dollars/dollars—they are dimensionless, and y = 7 has constant elasticity n. 

On log-log paper the graph of In p=nlnt+C is a straight line with slope n. 


THE LOGISTIC EQUATION 


The simplest model of population growth is dy/dt = cy. The growth rate c is the birth 
tate minus the death rate. If c is constant the growth goes on forever—beyond the 
point where the model is reasonable. A population can’t grow ail the way to infinity! 
Eventually there is competition for food and space, and y= e* must slow down. 

The true rate c depends on the population size y. It is a function e{y) not a constant. 
The choice of the model is at least half the problem: 


Problem in biology or ecology: Discover c(y). 
Problem in mathematics: Solve dy/dt = c(y}y. 


Every model looks linear over a smali range of y’s—but not forever, When the rate 
drops off, two models are of the greatest importance. The Michaelis-Menten equation 
has e{y) = c/{y + K). The logistic equation has e{y) = c — by. It comes first. 

The nonlinear effect is from “interaction.” For two populations of size y and z, the 
number of interactions is proportional to y times z. The Law of Mass Action produces 
a quadratic term byz. It is the basic model for interactions and competition. Here we 
have one population competing within itself, so z is the same as y. This competition 
slows down the growth, because — by” goes into the equation. 


The basic model of growth versus competition is known as the fogistic equation: 
dy/dt = cy — by*. (7) 


Nommally 6 is very small compared to c. The growth begins as usual (close to e“). 
The competition term by” is much smaller than cy, until y itself gets large. Then by” 
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(with its minus sign) slows the growth down. The solution follows an S-curve that 
we can compute exactly. 

What are the numbers 6 and c for human population? Ecologists estimate the 
natural growth rate as c = .029/year. That is not the actual rate, because of b. About 
1930, the world population was 3 billion. The cy term predicts a yearly increase of 
(.029)(3 billion) = 87 million. The actual growth was more like dy/dt = 60 million/year. 
That difference of 27 million/year was by: 


27 million/year = 6(3 billion)? leads to b = 3° 107+? /year. 


Certainly 5 is a small number (three trillionths) but its effect is not small. It reduces 
87 to 60. What is fascinating is to calculate the steady state, when the new term by” 
equals the old term cy. When these terms cancel each other, dy/dt = cy — by? is zero. 
The loss from competition balances the gain from new growth: cy = by? and y= c/b. 
The growth stops at this equilibrium point—the top of the S-curve: 


i= 7 = = 10’? = 10 billion people. 


According to Verhulst’s logistic equation, the world population is converging to 10 
billion. That is from the model. From present indications we are growing much faster. 
We will very probably go beyond 10 billion. The United Nations report in Section 3.3 
predicts 11 billion to 14 billion. 


Notice a special point halfway te y,, = c/b. (In the model this point is at 5 billion.) 
It is the inflection point where the S-curve begins to bend down. The second derivative 
d*y/dt? is zero. The slope dy/dt is a maximum. It is easier to find this point from the 
differential equation (which gives dy/dt) than from y. Take one more derivative: 


y" = (cy — by’) = cy’ — 2byy’ = (c — 2by)y’. (8) 
The factor c — 2by is zero at the inflection point y= c/2b, halfway up the S-curve. 


THE $-CURVE 


The logistic equation is solved by separating variables y and t: 
dy/dt = cy — by? becomes [artes — by’)= [a (9) 


The first question is whether we recognize this y-integral. No. The second question 
is whether it is listed in the cover of the book. No. The nearest is | dx/({a? — x”), which 
can be reached with considerable manipulation (Problem 21). The third question is 
whether a general method is available. Yes. “Partial fractions” is perfectly suited to 
Icy — by’), and Section 7.4 gives the following integral of equation (9): 

Yo 


Y ott and then In =C€, (10) 
— by c— bY 


In 
c 


That constant C makes the solution correct at t= 0. The logistic equation is integ- 
rated, but the solution can be improved. Take exponentials of both sides to remove 
the logarithms: 


y et Yo 


c~ by c— byg 


(11) 


This contains the same growth factor e‘ as in linear equations. But the logistic 
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equation is not linear—it is not y that increases so fast. According to (11), it is 
y/(c — by) that grows to infinity. This happens when ¢ — by approaches zero. 

The growth stops at y= c/b. That is the final population of the world (10 billion?). 

We stili need a formula for y. The perfect S-curve is the graph of y= 1/{1+e~"). It 
equals 1 when r= , it equals } when r= 0, it equals 0 when t= — oo. It satisfies 
y =y—y’, with c=b=1. The general formula cannot be so beautiful, because it 
allows any c, b, and yo. To find the S-curve, multiply equation (11) by c — by and 
solve for y: 


€ c 
oe wee or = —__. 
y y b+de“ 


When t approaches infinity, e “* approaches zero. The complicated part of the for- 
mula disappears. Then y approaches its steady state c/b, the asymptote in Figure 6.16. 
The S-shape comes from the inflection point halfway up. 


(12) 


10 billion — 
b 
nn c 
5 billion 5 
X 
wa 2 St +> ____—_—_————> | 
1234 1988 


Fig. 6.46 The standard S-curve y = 1/(t +e’). The population S-curve (with prediction). 


Surprising observation: z = 1/y satisfies a linear equation. By calculus z' = — y'/y?. So 
—cy + by? : 
ya ga cet h, (13) 
y y 
This equation z'= —c¢z+hb is solved by an exponential e~“ plus a constant: 
Year US Model eo upe. ; 
Population | z=Ae t+ ae (+ _ “ee +=. (14) 
17990 39 = 8.9 | ee Aan : 
1800 5.3 5.3 Turned upside down, y= 1/z is the S-curve (12). As z approaches b/c, the S-curve 
1810 7.2 7.2 | approaches c/b. Notice that z starts at 1/yo. 
1820 9.6 98 
1830, 12.9 13.4 EXAMPLE 4 (United States population) The table shows the actual population and 
oe ug Wes the model. Pearl and Reed used census figures for 1790, 1850, and 1910 to compute 
1850 =. 23.20 = 23.2 ; ; 
1860 34.4 30.4 cand 6. In between, the fit is good but not fantastic. One reason is war—another is 
: 4870 38.4 30.4 | depression. Probably more important is immigration.t In fact the Pearl-Reed steady 
4880 «50.2 50.2 ' state ¢/b is below 200 million, which the US has already passed. Certainly their model 
| 1890 62.9 62.8 can be and has been improved. The 1990 census predicted a stop before 300 million. 
4900 76.0 769 For constant immigration s we could still solve y' = cy — by? + s by partial fractions— 
1940 920 = 920 . but in practice the computer has taken over. The table comes from Braun's book 
1920 105.7 107.6 Differential Equations (Springer 1975). 
1930 122.8 123.1 | 
1940 1381.7 # 136.7 
Immigration does not enter for the world population model (at least not yet}. 


~ 1950 150.7 149.1 
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Remark For good scicnce the y* term should be explained and justified. It gave a 
nonlinear model that could be completely solved, but simplicity is not necessarily 
truth. The basic justification is this: In a population of size y, the number of encounters 
is proportional to y*. If thosc encounters are fights, the term is —hy?. If those 
encounters increase the population, as some like to think, the sign is changed. There 
is a cooperation term + by’, and the population increases very fast. 


EXAMPLES y’=cy+5y?:  y goes to infinity in a finite time. 


EXAMPLE 6 y'=—dy+ by: y dies to zero if yy < a/b. 


In Example 6 death wins. A smail population dies out before the cooperation by* 
can save it. A population below a/b is an endangered species. 
The logistic equation can't predict oscillations—those go beyond dy/di = f(y). 


The y fine Here is a way to understand every nonlinear equation y' =/{y). Draw a 
“y line.” Add arrows to show the sign of f(y). When y’ = f(y) is positive, y is increasing 
(it follows the arrow to the right). When fis negative, y goes to the left. When fis zero, 
the equation is y’ =0 and y is stationary: 


f>0 feo f>0 
x= eth ¥,.20 ved 
y =cy — by? (this is f(y) y = — dy + by? (this is f(y) 


The arrows take you left or right, to the steady state or to infinity. Arrows go toward 
stable steady states. The arrows go away, when the stationary point is unstable. The 
y line shows which way y moves and where it stops. 

The terminal velocity of a falling body is v., = Jz in Problem 6.7.54. For f(y} = 
sin y therc are several steady states: 


op : ‘ 
-Ve vi=Ve 0 Tm 2x3 


falling body: dvjdt = g — v? dy{dt = sin y 


EXAMPLE 7 Kinetics of a chemical reaction mA + nB — pC. 


The reaction combines m molecules of A with n molecules of B to produce p 
molecules of C. The numbers m,n, p are 1, , 2 for hydrogen chloride: H, + Cl, = 
2 HCI. The Law of Mass Action says that the reaction rate is proportional to the 
product of the concentrations [A] and [B]. Then [A] decays as [C] grows: 


a[A}jdt = — r[A][B] and a(C]/dt = + k[A}[B). (15) 


Chemistry measures r and k. Mathematics solves for [A] and [C]. Write y for the 
concentration [C], the number of molecules in a unit volume. Forming those y 
molecules drops the concentration [A] from ap to ag — (m/p)y. Similarly [B] drops 
from bg to by — (n/p)y. The mass action law (15) contains y: 


dy _ _m ia 
-(u-B)(o-3) c 
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This fits our nonlinear model (Problem 33-34). We now find this same mass action 
in biology. You recognize it whenever there is a product of two concentrations. 


THE MM EQUATION ay/dt= ~ ey/ty+ 


Biochemical reactions are the keys to life. They take place continually in every living 
organism. Their mathematical description is not easy! Engineering and physics go 
far with linear models, while biology is quickly nonlinear. It is true that y’ = cy is 
extremely effective in first-order kinetics (Section 6.3), but nature builds in a nonlinear 
regulator. 

It is enzymes that speed up a reaction. Without them, your life would be in slow 
motion. Blood would take years to clot. Steaks would take decades to digest. Calculus 
would take centuries to learn. The whole system is awesomely beautiful—DNA tells 
amino acids how to combine into useful proteins, and we get enzymes and elephants 
and Isaac Newton. 

Briefly, the enzyme enters the reaction and comes out again. It is the catafyst. Its 
combination with the substrate is an unstable intermediate, which breaks up into a 
new product and the enzyme (which is ready to start over). 

Here are examples of catalysts, some good and some bad. 


1. The platinum in a catalytic converter reacts with pollutants from the car engine. 
(But platinum also reacts with lead—ten gallons of leaded gasoline and you 
can forget the platinum.) 

2, Spray propellants (CFC’s) catalyze the change from ozone (O;) into ordinary 
oxygen (O,). This wipes out the ozone layer—our shield in the atmosphere. 

3. Milk becomes yoghurt and grape juice becomes wine. 

4, Blood clotting needs a whole cascade of enzymes, amplifying the reaction at 
every step. In hemophilia—the “Czar’s disease” —the enzyme called Factor VITI 
is missing. A small accident is disaster; the bleeding won’t stop. 

5. Adoiph’s Meat Tenderizer is a protein from papayas. It predigests the steak. 
Tbe same enzyme (chymopapain) is injected to soften herniated disks. 

6. Yeast makes bread rise. Enzymes put the sour in sourdough. 


Of course, it takes enzymes to make enzymes. The maternal egg contains the material 
for a cell, and also half of the DNA. The fertilized egg contains the full instructions. 


We now look at the Michaelis-Menten (MM) equation, to describe these reactions. 
It is based on the Law of Mass Action. An enzyme in concentration z converts a 
substrate in concentration y by dy/dt = — byz. The rate constant is b, and you see 
the product of “enzyme times substrate.” A similar law governs the other reactions 
(some go backwards). The equations are nonlinear, with no exact solution. It is 
typical of applied mathematics (and nature) that a pattern can still be found. 

What happens is that the enzyme concentration z(t) quickly drops to z9K/(y + K). 
The Michaelis constant K depends on the rates (like b) in the mass action laws. 
Later the enzyme reappears (z,, = Zg). But by then the first reaction is over. Its law 
of mass action is effectively 

en eee ee 
qo ae a 
with c =bz)K. This is the Michaelis-Menten equation—basic to biochemistry. 

The rate dy/dt is all-important in biology. Look at the function cy/(y + K): 


when y is large, dy/dt ~ —c when y is small, dy/dt ~ — cy/K. 
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The start and the finish operate at diflerent rates, depending whether y dominates K 
or K dominates y. The fastest rate is c. 
A biochemist solves the MM equation by separating variables: 


+ K : 
[? : ay=— ed gives y+Kiny=—ect+C. (18) 


Set 1 = 0 as usual. Then C = yo + K In yg. The exponentials of the two sides are 
ey =e tereyk (19) 


We don't have a simple formula for ». We are tucky to get this close. A computer 
can quickly graph y(t)—and we see the dynamics of enzymes. 

Problems 27-32 follow up the Michaelis-Menten theory. In science, concentrations 
and rate constants come with units. In mathematics, variables can be made dimen- 
sionless and constants become |. We solve d¥/dT = ¥/(¥+ 1) and then switch back 
to y, t,c, K. This idea applies to other equations too. 


Essential point: Most applications of calculus come through differential equations, 
That is the language of mathematics with populations and chemicals and epidemics 
obeying the samc equation. Running parallel to dy/dt = cy are the diflerence equations 
that come next. 
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Read-through questions 


The equations dyidt=cy and dy/dt=cy+s and dyidt= 
u(y)e{t) are calied _ a because we can separate y from t. 
Integration of [dyjy=fedt gives _b Integration of 
{dyi(y+sic)=[edt gives _¢ . The equation dyidx = 
—xfyleadsto_d__. Then y?+x?=__e and the solution 
stays on a circle, 


The logistic equation is dyjdf =__ | _. The new term —)y? 
represents g — when cy represents growth. Separation gives 
Jdyitey — by?) =f dt, and the y-integral is 1/¢ times In __h 
Substituting yg at ¢=Q and taking exponentials produces 
yi(e —by)=e(__1__). Ast 2c, y approaches _ J. That 


is the steady state where cy — by? = __k__. The graph of y 


looks like an __!'__. because it has an inflection point at 


y= 


In biology and chemistry, concentrations y and : react at 
a rate proportional to y times __a__. This is the Law of 

o _. Ina model equation dyidt = c( yy, the rate c depends 
on __p _. The MM equation is dyidt=__q _. Separating 
variables yields f_of ody=_s = -ct+C. 


Separate, integrate, and solve equations 1~8. 
1 dy/dt=y+5, yor? 
2 dyidt=liy, vo=l 
3 dyidx = xi", yy =1 


4 dyjdx=y? +1, yo =O 

§ dyfdx=(y t+ Di(x41), po =0 

6 dy/dx =tan ycosx, yo=l 

7 dyjdt=y sin t, yo =1 

8 dyidt=e", yo=e 

9 Suppose the rate of growth is proportional to ,/y instead 
of y. Solve dyfdi = ef y starting from yo. 
10 The equation dy/dx = nyjx for constant elasticity is the 
same as dln y)/d(In x) = . The solution is in y= 


11 When ¢ = 0 in the logistic equation, the only term is y' = 
—hby?. What is the steady state y,,? How long until + drops 
from yo {0 $¥o? 

12 Reversing signs in Problem t1, suppose y’ = + by?. At 
what time does the population explode to y= a, starting 
from vy = 2 (Adam + Eve)? 


Problems 13-26 deal with logistic equations y’ = cy — by’. 


13 Show that y= 1;(1 + e7‘) solves the equation y’ = y — y?. 

Draw the graph of } from starting values } and 4. 

14 (a) What logistic equation is solved by y = 2:(1 ~e7‘)? 
{b) Find ¢ and A in the equation solved by y = 1/(1 + e734), 


1§ Solve 2’ = —2+1 with zg =2. Turned upside down as in 
(13). what is ¥ = 1/2? 
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16 By algebra find the S-curve (12) from y = t/z in (14). 


17 How many years to grow from y, = 4c/b to » = 9c/b? Use 
equation (10) for the time ¢ since the inflection point in 1988. 
When does y reach 9 billion = .9c/b? 


18 Show by differentiating u = y/(c — by) that if y' = cy — by’ 
then uw’ =cu. This explains the logistic solution (11} — it is 
“= ue". 


19 Suppose Pittsburgh grows from yo =1 million people in 
1900 to y=3 million in the year 2000. If the growth rate is 
y = 12,000/year in 1900 and yp’ = 30,000/year in 2000, substi- 
tute in the logistic equation to find c and 5. What is the steady 
state? Extra credit: When does y = y,,/2 = c/2b? 


20 Suppose c = 1 but b = — 1, giving cooperation y = y + y?. 
Solve for p(t) if yp = 1. When does y become infinite? 

Zi Draw an S-curve through (0, 0) with horizontal asymp- 
totes y= — | and y= 1. Show that y=(¢ —e (4 +e "has 
those three properties. The graph of y? is shaped like 


22 To solve y =cy — by* change to w= 1/y*. Substitute for 
y’ in u’ = — 2y'/y* to find a linear equation for u. Solve it as 
in (14) but with wo = 1/y2. Then y= 1/\/u. 
23 With y=r¥ and t=sT, the equation dy/dt =cy— by? 
changes to d¥/dT = Y— Y?. Find r and s. 


24 Inachange to y=rYand t =sT, how are the initial values 
Yo and yp related to Y, and Yo? 


25 A rumor spreads according to y = (N—y). Hf y people 


know, then N — y don’t know. The product (NW — y) measures 
the number of meetings (to pass on the rumor). 
(a) Solve dy/dt = p(N — y) starting from yy = 1. 
(b) At what time T have N/2 people heard the rumor? 
(c) This model is terrible because T goes to as 
N— oo. A better model is y’ = byX{N — y). 


26 Suppose 6 and c are beth multiplied by 10. Does the 
middle of the S-curve get steeper or flatter? 


Problems 27-34 deal with mass action and the MM equation 
y= —cy/(y + K). 


27 Most drugs are eliminated acording to y' =—cy but 
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aspirin follows the MM equation, With c= K = yp) = 1, does 
aspirin decay faster? 


28 If you take aspirin at a constant rate d (the maintenance 
dose), find the steady state level where d= cy/(y + K). Then 
y=0. 

29 Sbow that the rate R =cy/(y+K) in the MM equation 
increases as y increases, and find the maximum as ) ~> 00. 


30 Graph the rate R as a function of y for K=1 and K= 
10, (Take c = 1.) As the Michaelis constant increases, the rate 
. At what value of y is R= 4c? 


31 With y=KY and ct= KT, find the ‘“nondimensional” 
MM equation for dY/dT From the solution e*Y= 
e~T e%°Y, recover the y, t solution (19). 


32 Graph p(¢) in (19) for different c and K (by computer). 


33 The Law of Mass Action for A+B34C its y'= 
Kao — y)(bo — y). Suppose yp = 0, ag = by = 3, K = t. Solve for 
y and find the time when y = 2, 


34 In addition to the equation for d[C]/dt, the mass action 
law gives d[A]/dt = : 


35 Solve y' = y+¢from yp = 0 by assuming y = Ae + B+ Dt. 
Find A, B, D. ; 


36 Rewrite cy —by? as a? — x?, with x =./by—c/2,/b and 


a= . Substitute for a and x in the integral taken 
from tables, to obtain the y-integral in the text: 
dx 1, at+x dy 1 y 
= J =7-! =I 
lee Qa a—x |= ee ce by 


37 (Important) Draw the pines (with arrows as in the text) 
for y' =y/(1~—y) and y =y—- y®. Which steady states are 
approached from which initial values yp? 

38 Explain in your own words how the p-line works. 


39 (a)Solve y=tan yp starting from yo=2/6 to find 
sin y=te'. 
(b) Explain why ¢ = 1 is never reached. 
(c) Draw arrows on the y-line to show that y approaches 
n/2 — when does it get there? 


40 Write the logistic equation as y’=cy{l—y/K). As y’ 
approaches zero, y approaches . Find y, y’, »* at the 
inflection point. 


You may remember our first look at e. It is the special base for which ¢* has slope 1 
at x=0. That led to the great equation of exponential growth: The derivative of 
e* equals e*. But our look at the actual number e = 2.71828... was very short. 
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It appeared as the limit of (1+ t/n)". This seems an unnatural way to write down 
such an important number. 

I want to show how (1 + 1/n)" and (1 + x/n)” arise naturally. They give discrete 
growth in finite steps—with applications to compound interest. Loans and life insur- 
ance and money market funds use the discrete form of y’ = cy + s. (We include extra 
information about bank rates, hoping this may be useful some day.) The applications 
in science and engineering are equally important. Scientific computing, like account- 
ing, has difference equations in parallel with differential equations. 


Knowing that this section will be full of formulas, I would like to jump ahead and 
tell you the best one. Jt is an infinite series for e*. What makes the series beautiful is 
that its derivative is itself. 

Start with y= 1+ x. This has y=1 and y' =1 at x=0, But y’ is zero, not one. 
Such a simple function doesn’t stand a chance! No polynomial can be its own deriva- 
tive, because the highest power x" drops down to nx*~?. The only way is to have no 
highest power. We are forced to consider infinitely many terms—a power serfes—to 
achieve ‘derivative equals function.” 

To produce the derivative 1 + x, we need | + x + 4x7. Then 4x? is the derivative 
of 4x*, which is the derivative of 7;x*. The best way is to write the whole series at 
once: 


Infinite series C =1+x+4x? +4hx3 + pxtter. (1) 
This must be the greatest power series ever discovered. Its derivative is itself: 
de*fdx=OF1Ltxtaxt teeter. (2) 


The derivative of each term is the term before it. The integral of each term is the one 
after it (so { e*dx = e* + C). The approximation e* ~ 1+ x appears in the first two 
terms. Other properties like (e*)(e") = e?* are not so obvious. (Multiplying series is 
hard but interesting.) it is not even clear why the sum is 2.718... when x= 1. 
Somehow 1+ {1 +44+4+ -- equals e. That is where (1 + 1/n)" will come in. 

Notice that x” is divided by the product 1+2+3+---+m. This is “n factorial.” Thus 
x‘ is divided by 1+2-3+4=4!= 24, and x9 is divided by 5! = 120. The derivative of 
x°/120 is x*/24, because 5 from the derivative cancels 5 from the factorial. In general 
x"jn! has derivative x”~'/(n — 1)! Surprisingly 0! is 1. 


Chapter 10 emphasizes that x"/n! becomes extremely small as 4 increases. The 
infinite series adds up to a finite number—which is e*. We turn now to discrete 
growth, which produces the same series in the limit. 

This headiine was on page one of the New York Times for May 27, 1990. 


213 Years After Loan, Uncle Sam is Dunned 


San Antonio, May 26—More than 200 years ago, a wealthy Pennsyivania 
merchant named Jacob DeHaven lent $450,000 to the Continental Congress to 
rescue the troops at Valley Forge. That loan was apparently never repaid. 

So Mr. DeHaven’s descendants are taking the United States Government to 
court to collect what they believe they are owed. The total: $141 billion if the 
interest is compounded daily at 6 percent, the going rate at the time. If com- 
pounded yearly, the bill is only $98 billion. 

The thousands of family members scattered around the country say they are 
not being greedy. “‘It’s not the money—it’s the principle of the thing,” said 
Carolyn Cokerham, a DeHaven on her father’s side who lives in San Antonio. 
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“You have to wonder whether there would even be a United States if this man 
had not made the sacrifice that he did. He gave everything he had.” 


The descendants say that they are willing to be flexible about the amount of 
settlement. But they also note that interest is accumulating at $190 a second. 

“None of these people have any intention of bankrupting the Government,” 
said Jo Beth Kloecker, a lawyer from Stafford, Texas. Fresh out of law school, 
Ms. Kloecker accepted the case for less than the customary 30 percent 
contingency. 

It is unclear how many descendants there are. Ms. Kloecker estimates that 
based on 10 generations with four children in each generation, there could be as 
many as half a million. 

The initial suit was dismissed on the ground that the statute of limitations is 
six years for a suit against the Federal Government. The family’s appeal asserts 
that this violates Article 6 of the Constitution, which declares as valid all debts 
owed by the Government before the Constitution was adopted. 

Mr. DeHaven died penniless in 1812. He had no children. 


COMPOUND INTEREST 


The idea of compound interest can be applied right away. Suppose you invest $1000 
at a rate of 100% (hard to do). If this is the annual rate, the interest after a year is 
another $1000. You receive $2000 in all. But if the interest is compounded you receive 
more: 


after six months: Interest of $500 is reinvested to give $1500 
end of year: New interest of $750 (50% of 1500} gives 32250 total. 


The bank multiplied twice by 1.5 (1000 to 1500 to 2250). Compounding quarterly 
multiplies four times by 1.25 (1 for principal, .25 for interest): 


after one quarter the total is 1000 + (.25){1000) = 1250 

after two quarters the total is 1250 + (.25)(1250) = 1562.50 

after nine months the total is 1562.50 + (.25)(1562.50) = 1953.12 

after a full year the total is 1953.12 + (.25)(1953.12) = 2441.41 

Each step multiplies hy 1 + (1/n), to add one nth ofa year’s interest—still at 100%: 

quarterly conversion: (1 + 1/4)* x 1000 = 2441.41 
monthly conversion: (1 + 1/12)!? x 1000 = 2613.04 
daily conversion: (1 + 1/365)?°> x 1000 = 2714.57. 


Many banks use 360 days in a year, although computers have made that obsolete. 
Very few banks use minutes (525,600 per year). Nobody compounds every second 
(n = 31,536,000). But some banks offer continuous compounding. This is the limiting 
case (n > 00) that produces e: 


(: + *) x 1000 approaches e x 1000 = 2718.28. 
n 


1 
1. Quick method for (1+ 1/n)": Take its logarithm. Use \n(1 + x)= x with x= 7 
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n( + Je n n( 1 + =) md (=) =1, (3) 
n n n 


As 1/n gets smaller, this approximation gets better. The limit is 1. Conclusion: 
(1 + 1/n}" approaches the number whose logarithm is 1. Sections 6.2 and 6.4 define 
the same number {which is e). 


2. Slow method for (1 + 1/n)": Multiply out all the terms. Then let n+ o. 


This is a brutal use of the binomial theorem. It involves nothing smart like logarithms, 
but the result is a fantastic new formula for e. 
1 3 
(3): 


: 1\ Ly... 3°277¥ 
=3: >) =1+3[-]+—[<] + 
Practice for n= 3 (: + 3) t (5) " (5) 
Binomial theorem for any positive integer n: 
ly 1 an-DALY  aa-1mn-Qly 1\' 
-|=1+n{- + =J 4-4 (=). 
(1+2) n(2) + LZ (7) ke2s3 n n 4) 
Each term in equation (4) approaches a limit as m + oo. Typical terms are 


n(n — = G9] 1 and n(n — 1){n — 2) (3) = 1 


3-2-1 
1-2-3 


1-2 \n} 71-2 1-2-3 \nJ 1-253 
Next comes 1/1°2°3+4, The sum of all those limits in (4) is our new formula for e: 
im(1+ Vaiss + ot gtene (5) 
In summation notation this is £77, 1/k! =e. The factorials give fast convergence: 
1+1+.5+ .16667 + .04167 + .00833 + .00139 + .00020 + .00002 = 2.71828. 


Those nine terms give an accuracy that was not reached by n = 365 compoundings. 
A limit is still involved (to add up the whole series). You mever see e without a limit! 
It can be defined by derivatives or integrals or powers (1 + 1/n)” or by an infinite 
series. Something goes to zero or infinity, and care is required. 

All terms in equation (4) are below (or equal to) the corresponding terms in (5). 
The power (1 + 1/n)" approaches e from below. There is a steady increase with n. Faster 
compounding yields more interest. Continuous compounding at 100% yields e, as 
each term in (4) moves up to its limit in (5). 


Remark Change (1+ 1/n)" to (1+ x/n)". Now the binomial theorem produces e*: 


x¥ x\ atn—-1) fx? x? 
— = = = tee + ——_ vee 
¢ + *) 1+ n(*) + 12 (*) +++ approaches | + x 12 +r, (6) 


Please recognize e* on the nght side! It is the infinite power series in equation (1). 
The next term is x7/6 (x can be positive or negative). This is a final formula for e”: 


The logarithm of that power is 1 In(1 + x/n) = n(x/n) = x. The power approaches e*. 

To summarize: The quick method proves (1 + I/n)" > e by logarithms. The slow 
method (multiplying out every term) led to the infinite series. Together they show the 
agreement of all our definitions of e. 


6.6 Powers Instead of Exponentials 
DIFFERENCE EQUATIONS VS. DIFFERENTIAL EQUATIONS 


We have the chance to see an important part of applied mathematics. This is not a 
course on differential equations, and it cannot become a course on difference equ- 
ations. But it is a course with a purpose—we aim to use what we know. Our main 
application of e was to solve y = cy and y’ = cy + s. Now we solve the corresponding 
difference equations. 

Above all, the goal is to see the connections. The purpose of mathematics is to 
understand and explain patterns. The path from ‘“‘discrete to continuous” is beautifully 
illustrated by these equations. Not every class will pursue them to the end, but I 
cannot fail to show the pattern in a difference equation: 


yt + 1) = aye). (7) 


Each step multiplies by the same number a. The starting value yy is followed by ayo, 
a*yp, and a°y9. The solution at discrete times t= 0, 1, 2, ... is W(t)=a'yp. 
This formula a'y, replaces the continuous solution ey, of the differential equation. 


decaying 
oscillation oo” 
decay ga growth 


0a390 | a’ oo 


Fig. 6.17 Growth for |a| > 1, decay for [a] < 1. Growth factor 2 compares to e. 


A source or sink (birth or death, deposit or withdrawal) is like y’ = cy + s: 
yet Y=ayt)+s. (8) 
Each step multiplies by a and adds s. The first outputs are 
W1)=ayot+s, y2)=a*yotasts, yW3)=a%yota*st+asts. 


We saw this pattern for differential equations—every input s becomes a new starting 
point, It is multiplied by powers of a. Since s enters later than yp, the powers stop at 
t~ 1. Algebra turns the sum into a clean formula by adding the geometric series: 


KL) =a yo +s| a‘? +-t+e tat 1]= a'y, + s(a' — 1)/{a — 1). (9) 


EXAMPLE 1 Interest at 8% from annual FRA deposits of s = $2000 (here yy = 0). 


The first deposit is at year f= 1. In a year it is multiphed by a = 1.08, because 8% is 
added. At the same time a new s= 2000 goes in. At t=3 the first deposit has been 
multiplied by (1.08)*, the second by 1.08, and there is another s = 2000. After year t, 


y(t) = 2000(1.08" — 1)/(1.08 - 1). (10) 


With t= 1 this is 2000. With t= 2 it is 2000 (1.08 + 1)—two deposits. Notice how 
a— 1 (the interest rate .08) appears in the denominator. 


EXAMPLE 2 Approach to steady state when |a| < 1. Compare with c <0. 


With a> 1, everything has been increasing. That corresponds to c>0O in the 
differential equation (which is growth). But things die, and money is spent, so a can 
be smailer than one. In that case a'yp approaches zero—the starting balance disap- 
pears. What happens if there is also a source? Every year half of the balance y(t) is 
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spent and a new $2000 is deposited. Now a=}: 


wt + 1) =3y{t)+ 2000 yields y{2}= G)tyo + 2000 (Gy — Wie- 1) |. 
The limit as ¢ + oo is an equilibrium point. As (4) goes to zero, y(t) stabilizes to 
Von = 20000 — 1)/( — 1) = 4000 = steady state. (14) 


Why is 4000 steady? Because haif is lost and the new 2000 makes it up again. The 
iteration is y,+, =4y, + 2000. Its fixed point is where y.. = Zy.. + 2000. 

In general the steady equation is y,,.~ay,,+s. Solving for y,, gives s/(1 — a). 
Compare with the steady differential equation y’ = cy + s = 0: 


Va=— = (differential equation) vs. y,= a iifference equation). 12) 
Fe _ 


EXAMPLE 3 Demand equals supply when the price is nght. 


Difference equations are basic to economics. Decisions are made every year (by a 
farmer) or every day (by a bank) or every minute (by the stock market). There are 
three assumptions: 


1. Supply next time depends on price this time: S(t + 1) =cP(#). 
2. Demand next time depends on price next time: D(t + 1)= — dP(t+ 1) +b. 
3. Demand next time equals supply next time: D(t + 1) = S(t + 1). 


Comment on 3: the price sets itself to make demand = supply. The demand slope — d 
is negative. The supply slope c is positive. Those lines intersect at the competitive 
price, where supply equals demand. To find the difference equation, substitute I and 
2 into 3: 

Difference equation. —dP(t+ 1I)+b=cP(t) 

Steady state price: —dP,,+b=cP,,. Thus P,, = 5j/{c + d). 


If the price starts above P,,, the difference equation brings it down. If below, the 
price goes up. When the price is P.,, it stays there. This is not news—economic 
theory depends on approach to a steady state. But convergence only occurs if c < d. 
If supply is less sensitive than demand, the economy is stable. 

Biow-up example: ¢ = 2,b = d = 1. The difference equation is — P(t + 1)+ 1 = 2P(t). 
From P(0)= 1 the price oscillates as it grows: P= — 1,3, —5, 11,.... 

Stable example: c = 1/2, b= d= 1. The price moves from P(0)= 1 to P(co) = 2/3: 


i 135 2 
—P(tt+1)+1== ields P=1,-,-,3, ... ing =. 
P(t+1)+1 5 Plt) yields 1, ag approaching 3 
Increasing d gives greater stability. That is the effect of price supports. For d=0 
(fixed demand regardless of price) the economy is out of control. 


THE MATHEMAJICS OF FINANCE 


It would be a pleasure to make this supply-demand model more realistic—with 
curves, not straight lines. Stability depends on the slope—calculus enters. But we 
also have to be realistic about class time. I believe the most practical application is 
to solve the fundamental problems of finance. Section 6.3 answered six questions about 
continuous interest. We now answer the same six questions when the annual rate is 
x= .05 = 5% and interest is compounded n times a year. 
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First we compute effective rates, higher than .05 because of compounding: 


4 
compounded quarterly ( + *) = 1.0509 [ effective rate .0509 = 5.09% | 


compounded continuously e°°? = 1.0513 [ effective rate 5.13% | 


Now come the six questions. Next to the new answer (discrete) we write the old 
answer (continuous). One is algebra, the other is calculus. The time period is 20 years, 
so simple interest on yp would produce {.05)(20){ yo). That equals yy) — money doubles 
in 20 years at 5% simple interest. 

Questions 1 and 2 ask for the future value y and present value y. with compound 
interest nm times a year: 


05 20a 
1. y growing from yo: y= (1 + =) Yo y = eb 051205 
. 05\7 20 
2. deposit yp to reach y: Yo= (1 + =) y Vo = ef O5N205, 


Each step multiplies by a= (1+ .05/n). There are 20n steps in 20 years. Time goes 
backward in Question 2. We divide by the growth factor instead of multiplying. The 
future value is greater than the present value (unless the interest rate is negative!). As 
n— oo the discrete y on the left approaches the continuous y on the right. 

Questions 3 and 4 connect y to s (with yo = 0 at the start). As soon as each s is 
deposited, it starts growing. Then y=s+as+a’s+-. 


200 _ (.05)(20) _ 
3. y growing from deposits s: y= {oo y= {| 


(1+ .05/n)2"— 1 


Questions 5 and 6 connect yo to s. This time y is zero—there is nothing left at the 
end. Everything is paid. The deposit y, is just enough to allow payments of s. This 
is an annuity, where the bank earns interest on your yp while it pays you s (n times 
a year for 20 years). So your deposit in Question 5 is less than 20ns. 

Question 6 is the opposite—a foan. At the start you borrow yg (instead of giving 
the bank yo). You can earn interest on it as you pay it back. Therefore your payments 
have to total more than yo. This is the calculation for car loans and mortgages. 


5 05 .05 
4. deposits s to reach y: s= nescesd s= | zoster | 


5. Annuity: Deposit yg to receive 20n payments of s: 


ae 1 -{(1 + 05/n)~ 20" ee 1- e 7 (05220) 
a 05/n Yo 05 


6. Loam:. Repay yp with 20n payments of s: 


: 05/n 7 05 
5= Yo 1 — (1+ .05/n)~ 2" S= ¥o 1 — 27 &95)20) 


Questions 2, 4, 6 are the inverses of 1,3, 5. Notice the pattern: There are three num- 
bers y, yo, and s. One of them is zero each time. If all three are present, go back to 
equation (9). 


The algebra for these lines is in the exercises. [t is not calculus because At is not at. 
All factors in brackets [ ] are listed in tables, and the banks keep copies. It might 
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also be helpful to know their symbols. If a bank has interest rate i per period over 
N periods, then in our notation a= 1+i=1+ .05/n and t= N = 20n: 


future value of yp = $1 (line 1): KN) =(1+ i)" 
present value of y= $1 (line 2): yy=(1+i)7% 
future value of s = $1 (line 3): KN) = sai = [a +i¥- 1 |i 


present value of s = $1 (line 5): yo = au = E —C+ a |i 


To teil the truth, I never knew the last two formulas until writing this book. The 
mortgage on my home has N =(1i2)(25) monthly payments with interest rate i= 
07/12, In 1972 tbe present value was $42,000 = amount borrowed. J] am now going 
to see if the bank is honest.t 


Remark Jn many loans, the bank computes interest on the amount paid back 
instead of the amount received. This is called discounting. A loan of $1000 at 5% 
for one year costs $50 interest. Normally you receive $1000 and pay back $1050. 
With discounting you receive $950 (called the proceeds) and you pay back $1000. 
The true interest rate is higher than 5%—because the 550 interest is paid on the 
smaller amount $950. In this case the ‘discount rate”’ is 50/950 = 5.26%. 


SCIENTIARC COMPUTING: DIFFERENTIAL EQUATIONS BY DIFFERENCE EQUATIONS 


In biology and business, most events are discrete. In engineering and physics, time 
and space are continuous. Maybe at some quantum level it’s all the same, but the 
equations of physics (starting with Newton’s law F = ma) are differential equations. 
The great contnbution of caiculus is to model the rates of change we see in nature. 
But to solve that model with a computer, it needs to be made digital and discrete. 

These paragraphs work with dy/dt = cy. It is the test equation that all analysts use, 
as soon as a new computing method is proposed. Its solution is y = e”, starting from 
Yo = 1, Here we test Euler’s method (nearly ancient, and not well thought of). He 
replaced dy/dt by Ay/At: 


Euler’s Method 


yt + Ath— yt) 
Se age cyt). (13) 


The left side is dy/dt, in the limit At + 0. We stop earlier, when At > 0. 
The problem is to solve (13). Multiplying by At, the equation is 


yet AjN=(1+cAnyp(t) (with »(0)= 1). 
Each step multiplies by a= 1 + cAt, so n steps multiply by a*: 
y=a"=(1 + cAty at time nAt. (14) 


This is growth or decay, depending on a. The correct ¢* is growth or decay, depending 
on c. The question is whether a" and e stay clase. Can one of them grow while the 
other decays? We expect the difference equation to copy y’=cy, but we might be 
wrong. 

A good example is y’= — y. Then c= — 1 and y=e~‘—the true solution decays. 


fIt’s not. s is too big. I knew it. 


6.6 Powers instead of Exponentials 


The calculator gives the following answers a” for n = 2, 10, 20: 


At a=t+cAt @& gi? qa? 

3 =2 4 1024 1048576 

1 0 0 0 0 
10 #390) (SCD 


1/20 95 90 .60 36 


The big step Ar =3 shows total instability (top row). The numbers blow up when 
they should decay. The row with At = 1 is equally useless (ail zeros). In practice the 
magnitude of cAf must come down to .[0 or .05. For accurate calculations it would 
have to be even smaller, unless we change to a better difference equation. That is the 
right thing to do. 

Notice the two reasonable numbers. They are .35 and .36, approaching e~ ' = .37. 
They come from n= 10 (with At = 1/10) and n= 20 (with At = 1/20). Those have the 
same clock time nAt= 1: 


1 10 1 20 1" ; 
(1-76) = 35 (1-55) = 36 (1-3) +e = 37. 


The main diagonal of the table is executing (1 + x/n)" — e* in the case x = — 1. 


Final question: How quickly are .35 and .36 converging to e~' = .37? With At = .10 
the error is .02. With At = .05 the error is .01. Cutting the time step in half cuts the 
error in half. We are not keeping enough digits to be sure, but the error seems close 
to At. To test that, apply the “quick method” and estimate a* = (1 — At)’ from its 
logarithm: 


in(1 — At)" =n In(t — At) wn[ — At — 440)? ]= - 1-4Ae. (15) 
The clock time is nAt = 1. Now take exponentials of the far left and night: 
a"=(1— At)'ze te 4? we (1 — 4A). (16) 


The difference between a" and e~? is the last term 4Ate~’. Everything comes down 
to one question: Is that error the same as Af? The answer is yes, because e~'/2 is 
1/5. If we keep only one digit, the prediction is perfect! 


That took an hour to work out, and I hope it takes longer than At to read. I wanted 
you to see in use the properties of In x and e*. The exact property In a" = nin a came 
first. In the middle of (15) was the key approximation In(1 + x) = x —4x?, with x = 
— At. That x? term uses the second derivative (Section 6.4). At the very end came 
eailtex. 

A linear approximation shows convergence: (1 + x/n)" > e*. A quadratic shows the 
error: proportional to At = 1/n. It is like using rectangles for areas, with error propor- 
tional to Ax. This minimal accuracy was enough to define the integral, and here it is 
enough to define e. it is completely unacceptable for scientific computing. 

The trapezoidal rule, for integrals or for y’ = cy, has errors of order (Ax)? and (Az)’. 
All good software goes further than that. Euler’s first-order method could not predict 
the weather before it happens. 


Euler’s Method for Yo Fly, t): we + At) i) 


dt : A = F(t), 0). 
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6.6 EXERCISES 


Read-throagh questions 
The infinite series for eis __a__. Its derivativeis__b . The 
denominator nl is called“ ¢ "anditequals d_.Atx= 


1 the series fore is @ 


To match the original definition of e, multiply out 
(1+ t/n’=__t (first three terms). As n— oo those terms 
approach g __ in agreement with e. The first three terms of 
(1+ x/nf are bh. As n-+o they approach _! in 
agreement with e*. Thus {1+ x/n/ approaches __j . A 
quicker method computes Infl+x/n)"=__k (first term 
only) and takes the exponential. 


Compound interest (n times in one year at annual rate x) 
multiplies by (__!__)". As m > 00, continuous compounding 
multiplies by_om__. At x = 10% with continuous compound- 
ing, $1 growsto__ _ ina year. 


The difference equation y(t + t) = ay{r) yields y(t) =__o 
times )o. The equation y(t+ 1)=ay{t)+ is solved by y= 
a'yg+s{l+a+°+a‘~']. The sum im brackets is _ p 
When a= 1,08 and yp =0, annual deposits of s = 1 produce 
y=_4__ after ¢ years. If a=4 and y, =0, annual deposits 
ofs=6 leave __' after t years, approaching y, =__$ 


The steady equation y,. = ay,, +s gives py, =_1t 


When i = interest rate per period, the value of yp = $1 after 
N periods is (XN) =__u__. The deposit to produce y(N) = | 
iS Wo =__¥__. The value of s = $1 deposited after each pertod 
grows to (WN)=__w__. The deposit to reach WN) = 1 is s= 

x 


Euler’s method replaces y’=cy by Ay =cyAt. Each step 
multiplies y by _y _. Therefore y at t= 1 is (1 + cAd)!"yo, 
which convergesto__2 as At > 0. The error is proportional 
to A__, whichis too 8 ___ for scientific computing, 


1 Write down a power series y = 1 — x + --- whose derivative 
is —y. 

2 Write down a power series y = 1 + 2x +-:- whose deriva- 
tive is 2y. 

3 Find two series that are equal to their second derivatives. 


4 By comparing e=14+1+4+%+2%+- with a larger 
series (whose sum is easier} show that e < 3, 


5 At 5% interest compute the output from $1000 in a year 
with 6-month and 3-month and weekly compounding, 


6 With the quick method In(1 + x} = x, estimate In{1 — 1/n)" 
and In(i + 2/n}*. Then take exponentials to find the two limits. 


7 With the slow method multiply out the three terms of 
{1 — 4) and the five terms of (1 — })*. What are the first three 
terms of (1 — 1/n}*, and what are their limits as n — 00? 


8 The slow method leads to 1 —1+4 1/2!-—1/3!+-- for the 


limit of (1 — 1/n)". What is the sum of this infinite series — 
the exact sur and the sum after five terms? 


9 Knowing that (1 + 1/n)" > e, explain (1 + 1/n)?" > e? and 
(1 +2/N)*  e2, 


10 What are the limits of (1+1/n?)" and (141/n)"? 
OK to use a calculator to guess these limits. 


11 (a) The power (1 + I/n)” (decreases) (increases) with n, as 
we compound more often. (b) The derivative of f(x)= 
x In(1 + 1/x), which is , Should be {(<0)}(>0). This is 
confirmed by Problem 12. 


12 Show that In(1 + 1/x) > 1x + 1) by drawing the graph of 
i/t. The area fromt=1 to 1+ I/xis . The rectangle 
inside it has area 


13 Take three steps of t+ 1) = 2y(t) from yy = 1. 
14 Take three steps of yr + 1)= 2y(t) + 1 from yp =O. 


Solve the difference equations 15-22. 

15 pie + 1)= 30), yo =4 16 Wet 1) =$H(0, yo = 1 

17 ¥t+ D=HH+1, ¥o=0 18 e+ N=p)—-1, yo=O 
19 yt+1)=3pt) +1, yo=O WO WE+ N=3yt) +5, oH 1 
21 ptt D=ayt)t+s, yo =0 22 Wet l)=ay(t)+s, po =S 


In 23-26, which initial value produces y, = yo (steady state)? 
23 e+ 1} = 2y(t)-—6 24 y(t t+ 1} =F) -6 
25 we+ = — pi)+6 26 tt LI = —4p} +6 


27 In Problems 23 and 24, start from yg = 2 and take three 
steps to reach y3. Is this approaching a steady state? 


28 For which numbers a does (1 — a‘)/(! — a} approach a limit 
as tf oc and what is the limit? 


29 The price P is determined by supply=demand or 
—dP(t + 1)+b=cP(t). Which price P is not changed from 
one year to the next? 


30 Find P(t) from the supply-demand equation with c = 1, 
d=2, b=8, P(0)=0. What is the steady state as t > «0? 


Assume 10% interest (so a = 1 +i = 1.1) in Problems 31-38. 


31 At 10% interest compounded quarterly, what is the effec- 
tive rate? 


32 At 10% interest compounded daily, what is the effective 
rate? 


33 Find the future value in 20 years of $100 deposited now. 


34 Find the present value of $1000 promised in twenty years. 


6.7. Hyperbolic Functions Zi 


35 For a mortgage of $100,000 over 20 years, what is the 
monthly payment? 


36 For acar loan of $10,000 over 6 years, what is the monthly 
payment? 

37 With annual compounding of deposits s = $1000, what is 
the balance in 20 years? 


38 If you repay s = $1000 annually on a loan of $8000, when 
are you paid up? (Remember interest.) 


39 Every year two thirds of the available houses are sold, and 
1000 new houses are built. What is the steady state of the 
housing market — how many are available? 


40 Ifa loan shark charges 5% interest a month on the $1000 
you need for blackmail, and you pay $60 a month, how much 


6.7 Hyperbolic Functions 


do you still owe after one month (and after a year)? 


41 Euler charges c = 100% interest on his $1 fee for discover- 
ing e. What do you owe (including the $1) after a year with 
(a) no compounding; (b) compounding every week; (c) con- 
tinuous compounding? 

42 Approximate (1 + 1/n)" as in (15) and (16) to show that 
you owe Euler about e — e/2n. Compare Problem 6.2.5. 


43 My Visa statement says monthly rate = 1.42% and yearly 
rate = 17%. What is the true yearly rate, since Visa com- 
pounds the interest? Give a formula or a number. 


44 You borrow yo = $80,000 at 9% to buy a house. 
(a) What are your monthly payments s over 30 years? 
(b) How much do you pay altogether? 


This section combines e* with e *. Up to now those functions have gone separate 
ways—one increasing, the other decreasing. But two particular combinations have 
earned names of their own (cosh x and sinh x): 


e + e* 


hyperbolic cosine cosh x = hyperbolic sine sinh x = : 


2 2 


The first name rhymes with “gosh”. The second is usually pronounced “‘cinch”’. 

The graphs in Figure 6.18 show that cosh x > sinh x. For large x both hyperbolic 
functions come extremely close to $e*. When x is large and negative, it is e~* that 
dominates. Cosh x still goes up to + oo while sinh x goes down to — co (because 
sinh x has a minus sign in front of e~*). 


Fig. 6.18 Cosh x and sinh x. The hyperbolic 
functions combine $e* and $e~*. 


Louis Visitors Commission. 


The following facts come directly from 4(e* + e~*) and 4(e* — e~*): 
cosh(— x)= cosh x and cosh 0=1 (cosh is even like the cosine) 


sinh(— x) = — sinh x and sinh 0=0 (sinh is odd like the sine) 


Fig. 6.49 Gateway Arch courtesy of the St. 
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The graph of cosh x corresponds to a hanging cable (hanging under its weight). 
Turned upside down, it has the shape of the Gateway Arch in St. Louis. That must 
be the largest upside-down cosh function ever built. A cable is easier to construct 
than an arch, because gravity does the work. With the right axes in Problem 55, the 
height of the cable is a stretched-out cosh function called a catenary: 


y =a cosh (x/a) (cable tension/cable density = a). 
Busch Stadium in St. Louis has 96 catenary curves, to match the Arch. 


The properties of the hyperbolic functions come directly from the definitions. There 
are too many properties to memorize—and no reason to do it! One rule is the most 
important. Every fact about sines and cosines is reflected in a corresponding fact about 
sinh x and cosh x. Often the only difference is a minus sign. Here are four properties: 


1. (cosh x)? — (sinh x)? = 1 [ instead of (cos x)? + (sin x)? = 1| 


=1 


+e* 2 wigs 12 2x4.94 ~2x — 2x49 — —2x 
Ctinck E e [-7 e [-! e e e 


2 Z 4 


2. a, (cosh x) = sinh x instead of a (cos x)= — sin x| 
dx dx 


3. . (sinh x)=cosh x | like ha sin x =cos x 
dx dx 


4. {sins x dx =cosh x +C and [cost x dx =sinh x + C 


(cos f, sin 1) 
area 1/2 


Fig. 6.20 The unit circle cos*t + sin?s = 1 and the unit hyperbola cosh? — sinh?1 = 1. 


Property | is the connection to hyperbolas. It is responsible for the “h” in cosh and 
sinh. Remember that (cos x)? + (sin x)? = 1 puts the point (cos x, sin x) onto a unit 
circle. As x varies, the point goes around the circle. The ordinary sine and cosine are 
“circular functions.” Now look at (cosh x, sinh x). Property 1 is (cosh x)? — (sinh x)? = 
1, so this point travels on the unit hyperbola in Figure 6.20. 

You will guess the definitions of the other four hyperbolic functions: 


sinh. x'__¢—e¢"* _cosh:x ete * 


tanh x= = - coth x - 
coshx e*+e™* sinhx e*—-e~* 
2 1 2 
sech x = = cesch x= -— = 
coshx e*+e * sinhx e*—e* 


I think “tanh” is pronounceable, and “sech” is easy. The others are harder. Their 
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properties come directly from cosh?x — sinh*x = 1, Divide by cosh?x and sinh?x: 
1—tanh ?x=sech?x and coth*x — 1=csch?x 


(tanh x =sech?x and (sech x)’ = —sech x tanh x 


[cant x dx= (ae dx = In(cosh x) + C. 
cosh x 


INVERSE HYPERBOLIC FUNCTIONS 


You remember the angles sin~'x and tan~’x and sec”‘x. In Section4.4 we 
differentiated those inverse functions by the chain rule. The main application was to 
integrals. If we happen to meet jdx/{i +x”), it is tan~'x+C. The situation for 
sinh” ‘x and tanh” 'x and sech” ‘x is the same except for sign changes — which are 
expected for hyperholic functions. We write down the three mew derivatives: 


: 2 : dy 1 
= h7! = h we ee 
y= sinh™ ‘x (meaning x =sinh y} has ax ay (1) 
=tanh~'x (meaning x = tanh y) has By st (2) 
is Bx J dx 1-x? 
-1 . dy -1 
y=sech ‘x (meaning x =sech y) has — = (3) 


dx x /|—x? 


Problems 44-46 compute dy/dx from 1/(dx/dy). The alternative is to use logarithms. 
Since In x is the inverse of e*, we can express sinh~ 1x and tanh” ‘x and sech™'x as 
logarithms. Here is y = tanh ‘x: 


“1, fitx dy 11 11 4 
jin] = as stope 2 = 5 1 2i-x 1-x?" 4) 


The last step is an ordinary derivative of 41n(1 + x)—4In{{ — x). Nothing is new 

except the answer. But where did the logarithms come from? In the middle of the 

following identity, multiply ahove and below by cosh y: 
I1+x Ilttanhy coshyt+sinhy 2 


1-x 1-tanhy coshy~-sinhy e’ 


Then 2y is the logarithm of the left side. This is the first equation in (4), and it is the 
third formula in the following list: 


sinh7+x = In| x + f/x? + 1] eoahee tye Inf x rs Jet] 
tanh~*x = ; esd sech~!x = a oe) 


I-x 


Remark 1 Those are listed only for reference. If possible do not memorize them. The 
derivatives in equations (1), (2), (3) offer a choice of antiderivatives — either inverse 
functions or logarithms {most tables prefer logarithms). The inside cover of the book 
has 
dx | l+x : -~4 
Fe => in Fed +C (in place of tanh” 'x + C). 


1 


Remark 2 Logarithms were not seen for sin” 'x and tan” 'x and sec” +x. You might 
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6 Exponentials and Logarithms 


wonder why. How does it happen that tanh” ‘x is expressed by logarithms, when the 
parallel formula for tan™'x was missing? Answer: There must be a parallel formula. 
To display it I have to reveal a secret that has been hidden throughout this section. 
The secret is one of the great equations of mathematics. What formulas for cos x 
and sin x correspond to He*+e~*) and Ye*~-e™*)? With so many analogies 
(circular vs. hyperbolic) you would expect to find something. The formulas do exist, 
but they involve imaginary numbers, Fortunately they are very simple and there is 
no reason to withhold the truth any longer: 
cos x= ste +e ™) and sinx= = le" ~e*), (5) 
It is the imaginary exponents that kept those identities hidden. Multiplying sin x by 
i and adding to cos x gives Euler’s unbelievably beautiful equation 


cos x+isin x= e*, (6) 


That is parallel to the non-beautiful hyperbolic equation cosh x + sinh x = e*. 

I have to say that (6) is infinitely more important than anything hyperbolic will 
ever be. The sine and cosine are far more useful than the sinh and cosh. So we end 
our record of the main properties, with exercises to bring out their applications. 


6.7__EXERCISES 


Read-through questions 


Cosh x=__@ and sinhx=__b__ and cosh?x —sinh*x = 
c__. Their derivatives are__d sande = and_ ft. 

The point (x, y)=(cosh t, sinh) travels on the hyperbola 
g__. A cable hangs in the shape of a catenary y=__h 


The inverse functions sinh ‘x and tanh” ‘x are equal to 


In[x + ./x? + 1] and $In____. Their derivatives are 3 
and__k__. So we have two ways to write the anti__!_. The 


parallel to cosh x + sinh x =e* is Euler’s formula _m 
The formula cos x =4(e +¢7") involves _m _ exponents, 
The parailel formula for sin x is __© 


1 Find cosh x + sinh x, cosh x — sinh x, and cosh x sinh x. 


2 From the definitions of cosh x and sinh x, find their derivy- 
alives. 


3 Show that both functions satisfy ” = y. 

4 By the quotient rule, verify (tanh x)‘ = sech?x. 

5 Derive cosh?x + sinh*x = cosh 2x, from the definitions. 
6 From the derivative of Problem § find sinh 2x. 


7 The parallel to {cos x +isin x!"=cosnx+isinazn is a 
hyperbolic formula (cosh x + stnh x)" = cosh nx + 


8 Prove = sinh(x + ¥)=sinhxcoshy+coshxsinhy by 
changing to cxponentials. Then the x-derivative gives 
cosh(x + y) = 


Find the derivatives of the functions 9-18: 
9 cosh{3x + 1} 10 sinh x? 
11 l/cosh x 12 sinh(In x) 


14 cosh?x — sinh?x 


16 (1 +tanh x)/(1 —tanh x) 


13 cosh?x + sinh?x 
15 tanh Vx 
17 sinh®x 18 In(sech x + tanh x) 

19 Find the minimum value of cosh(In x) for x > 0. 

20 From tanh x = 2 find sech x, cosh x, sinh x, coth x, csch x. 
21 Do the same if tanh x = — 12:13. 

22 Find the other five values if sinh x = 2. 

23 Find the other five values if cosh x = 1. 


24 Compute sinh(!n 5) and tanh(2 In 4), 


Find antiderivatives for the functions in 25-32: 
25 cosh(2« + 1) 


27 cosh*x sinh x 


26 x cosh(x?} 
28 tanh*x sech*x 


sinh x e+e% 
a 30 coth x= 
1+cosh x ev —e7* 


31 sinh x + cosh x 32 (stnh x +cosh x)" 
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33 The triangle in Figure 6.20 has area 4 cosh ¢ sinh ¢. 
(a) Tntegrate to find the shaded area below the hyperbola 
(b} For the area A in red verify that dA/dt =4 
(c) Conclude that A = 44+ C and show C =0, 


Sketch praphs of the functions in 34-40. 

34 y=tanh x (with inflection point) 

35 y =coth x (in the limit as x — oo) 

3%6 y=sech x 

37 y=sinh"'x 

38 y=cosh”'x for x21 

39 y=sech ‘x forO<x<l 

1 t 

40 y=tanh"'x= 2 in(;*4) for |x| <1 

4% (a) Multiplying x=sinhy=d(e’—e7%) by 2c gives 
(2)? — 2x{e") — 1 =0. Solve as a quadratic equation for e”. 
(b} Take logarithms to find y = sinh” ‘x and compare with 
the text. 

42 (a) Multiplying x=cosh y=4(e+e7%) by 2e” gives 
(e?)? — 2x(e) + 1 = 0. Solve for e”. 
(b) Take logarithms to find y = cosh™ 'x and compare with 
the text. 


43 Turn (4} upside down to prove y= — 1/(1—~x?), if y= 
coth~'x, 

44 Compute dy/dx = 1/,/x? + 1 by differentiating x = sinh y 
and using cosh?y —sinh”y = 1. 

45 Compute dy/dx = 1/1 — x?) if y=tanh~'x by differen- 
tiating x = tanh y and using sech”y + tanh?y = 1. 


46 Compute dy/dx=—1/x,/1—x? for y=sech™'x, by 
differentiating x = sech y. 


From formnolas (1), (2), (3) or otherwise, find antiderivatives in 
47-§2: 


47 | dx/(4— x?) 48 | dxj(a? — x”) 


49 asia 50 fe ax//x* +1 
51 [és Ji=# 52 [en Ji=2 


1/2 1 
53 Compute | Je and | cl 


o 1—x? 0 t— x?" 


54 A falling body with friction equal to velocity squared 

obeys do/dt = g —v’. 
(a) Show that x(t} = Js tanh Jat satisfies the equation. 
(b) Derive this v yourself, by integrating dv/{g — v7) = dt. 
{c) Integrate oft) to find the distance f(t). 

55 A cable hanging under its own weight has slope § = dy/dx 


that satisfies dS/dx =c./1 +S. The constant c is the ratio of 
cable density to tension. 


{a} Show that S =sinh cx satisfies the equation. 

(b) Integrate dy/dx = sinh ex to find the cable height yx), 
if (0) = Ic. 

(c) Sketch the cable hanging between x = —L and x =L 
and find how far it sags down at x = 0. 


56 The simplest nonlinear wave equation (Burgers’ equation} 
yields a waveform W(x) that satisfies W” = WW’ — W’. One 
integration gives W’ = 4W? — W. 
(a) Separate variables and integrate: 
dx =dW/QW? — W)= —dW/Q2— W)- dw/W. 
(b} Check W =4W?— Ww. 
57 A solitary water wave has a shape satisfying the KdV 
equation y” = y — 6py’. 
(a) Integrate once to find y’. Multiply the answer by y’. 
(b) Integrate again to find y (all constants of integration 
are Zero). 
(c) Show that p=4sech?(x/2) gives the shape of the 
“soliton.” 


58 Derive cos ix=coshx from equation (5). What is the 
cosine of the imaginary angle i=./ —1? 


59 Derive sin ix =i sinh x from (5). What is sin i? 


@ The derivative of e'* =cos x + isin x is 
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CHAPTER 7 


Techniques of Integration 


Chapter 5 introduced the integral as a limit of sums. The calculation of areas was 
started— by hand or computer. Chapter 6 opened a different door. Its new functions 
e* and In x led to differential equations. You might say that all along we have been 
solving the special differential equation df/dx = (x). The solution is f= { o(x)dx. But 
the step to dy/dx = cy was a breakthrough, 

The truth is that we are able to do remarkable things. Mathematics has a language, 
and you are learning to speak it, A short time ago the symbols dy/dx and { »(x)dx 
were a mystery. (My own class was not too sure about x(x) itself—the symbol for a 
function.) It is easy to forget how far we have come, in looking ahead to what is next. 


I do want to look ahead. For integrals there are two steps to take—more functions 
and more applications. By using mathematics we make it live. The applications are 
most complete when we know the integral. This short chapter will widen (very much) 
the range of functions we can integrate. A computer with symbolic algebra widens it 
more. 

Up to now, integration depended on recognizing derivatives. If (x)= sec*x then 
f(x) = tan x. To integrate tan x we use a substitution: 


sin x du 
dx = — |— = —Inu= —I1n cos x. 
COS x u 


What we need now are techniques for other integrals, to change them around until 
we can attack them. Two examples are { x cos x dx and J _/1 — x? dx, which are not 
immediately recognizable. With integration by parts, and a new substitution, they 
become simple, 

Those examples indicate where this chapter starts and stops. With reasonable effort 
(and the help of tables, which is fair) you can integrate important functions. With 
intense effort you could integrate even more functions. In older books that extra 
exertion was made—it tended to dominate the course. They had integrals like - 
f(x + Bdx/./2x? — 6x+4, which we could work on if we had to. Our time is too 
valuable for that! Like long division, the ideas are for us and their intricate elaboration 
is for the computer. 

Integration by parts comes first. Then we do new substitutions. Partial fractions 
is a useful idea (already applied to the logistic equation y’ = cy — by’). In the last 
section x goes to infinity or y(x) goes to infinity—but the area stays finite. These 
improper integrals are quite common. Chapter 8 brings the applications. 
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ee =—7.1 Integration by Parts 


There are two major ways to manipulate integrals (with the hope of making them 
easier). Substitutions are based on the chain rule, and more are ahead. Here we 
present the other method, based on the product rule. The reverse of the product rule, 
to find integrals not derivatives, is integration by parts. 

We have mentioned { cos*x dx and | In x dx. Now is the right time to compute 
them (plus more examples). You will see how f In x dx is exchanged for [ 1 dx—a 
definite improvement. Also { xe* dx is exchanged for { e* dx. The difference between 
the harder integral and the easier integral is a known term—that is the point. 

One note before starting: Integration by parts is not just a trick with no meaning. 
On the contrary, it expresses basic physical laws of equilibrium and force balance. 
It is a foundation for the theory of differential equations (and even delta functions). 
The final paragraphs, which are completely optional, illustrate those points too. 


We begin with the product rule for the derivative of u(x) times v(x): 
dv du ad 
) — + v(x) — = — (u(x)v(x)). 
ulx) a + Ux) a = Ge (ued) (1) 


Integrate both sides. On the right, integration brings back u(x)v(x). On the left are 
two integrals, and one of them moves to the other side (with a minus sign): 


{us @ dx = u(x)v(x) — {0 a dx. (2) 
dx dx 


That is the key to this section—not too impressive at first, but very powerful. It is 
integration by parts (u and v are the parts). In practice we write it without x’s: 


7A The integration by parts formula is fu dv = uv — fv du. (3) 


The problem of integrating udv/dx is changed into the problem of integrating 
v du/dx. There is a minus sign to remember, and there is the “integrated term” u(x)v(x). 
In the definite integral, that product u(x)v(x) is evaluated at the endpoints a and b: 


b d . b 

u ss dx = u(b)u(b) — u(a)v(a) — | ov it dx. (4) 
~ ax 5 ax 

The key is in choosing u and v. The goal of that choice is to make ) v du easier than 

fu dv. This is best seen by examples. 


EXAMPLE 41 For | In x dx choose u = In x and dv = dx (so v= x): 


um x dx = uv — fe du=xiInx- x za 


I used the basic formula (3). Instead of working with In x (searching for an antideriva- 
tive), we now work with the right hand side. There x times 1/x is 1. The integral of 
1 is x. Including the minus sign and the integrated term uv = x In x and the constant 
C, the answer is 

fin x dx=xInx-—x+C. (5) 


For safety, take the derivative. The product rule gives In x + x(1/x) — 1, which is In x. 
The area under y= In x from 2 to 3 is 3In3 -—3—21n2+2. 
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To repeat: We exchanged the integral of In x for the integral of 1. 


EXAMPLE 2 For { x cos x dx choose u = x and dv = cos x dx (so v(x) = sin x): 
fx cos x dx=uv — fv du=x sin x— fsin x dx. (6) 
Again the night side has a simple integral, which completes the solution: 
fxcos x dx=x sin x + cos x+C. (7) 


Note The new integral is not always simpler. We could have chosen u = cos x and 
dv = x dx. Then v = 4x”. Integration using those parts give the true but useless result 


Jx cos x dx =uv—fv du= 4x? cos x + f}x? sin x dx. 
The last integral is harder instead of easier (x* is worse than x). [n the forward 
direction this is no help. But in the opposite direction it simplifies {4x7 sin x dx. The 
idea in choosing uw and pv is this: Try to give u a nice derivative and dv a nice integral. 
EXAMPLE 3 For { (cos x)? dx choose » = cos x and dv = cos x dx (so v= sin x): 
J(cos x/'dx = uw — fv du=cos x sin x + f (sin x)? dx. 


The integral of (sin x)? is no better and no worse than the integral of (cos x)*. But we 
never see (sin x)? without thinking of 1 — (cos x). So substitute for (sin x)’: 


f(cos x)? dx =cos x sin x + § 1 dx — f(cos x)’ dx. 
The last integral on the right joins its twin on the left, and [1 dx = x: 
29 (cos x)? dx = cos x sin x + x. 
Dividing by 2 gives the answer, which is definitely not }(cos x)>. Add any C: 
S(cos x)? dx =4(cos x sin x + x)+C. (8) 


Question Integrate (cos x)* from 0 to 2x, Why should the area be z? 

Answer The definite integral is #{cos x sin x + x)]". This does give x. That area can 
also be found by common sense, starting from {cos x)? + (sin x)? = I. The area under 
1 is 2x. The areas under (cos x)” and (sin x)* are the same. So each one is x. 


EXAMPLE 4 Evaluate ftan~'x dx by choosing u = tan ‘x and v= x: 
d 
(tants dx= wo [1 du=x tants | i (9) 


x —. 
1+x? 


The last integral has w = 1 + x? below and almost has dw = 2x dx above: 


dx 1fd 
le <= [ex fin w= Fin + 4, 


l+x? 2]w 2 


Substituting back into (9) gives {tan~ 'x dx as x tan™'x — $\n(1 + x?). All the familiar 
inverse functions can be integrated by parts (take vp = x, and add “+ C” at the end). 

Our final example shows how two integrations by parts may be needed, when the 
first one only simplifies the problem half way. 


EXAMPLE 5 For § x7e*dx choose y= x? and dv=e*dx (so p=e’): 
J x?e*dx = uv — fv du = x7e* — f e*(2x dx). (10) 


7.4 Integration by Parts 


The last integral involves xe*. This is better than xe*, but it still needs work: 
J xe*dx = uv — fv du=xe*—fe*dx (now u=x). (11) 


Finally e* is alone. After two integrations by parts, we reach { e*dx. In equation (11), 
the integral of xe* is xe“ — e*. Substituting back into (10), 


fx*e*dx = xe" — 2[xe*— 7] +C. (12) 
These five examples are in the list of prime candidates for integration by parts: 
ve, x sin x: xoosx, x<ind% e'sinx, ecoss) sin’ °x,, tan “x; .... 


This concludes the presentation of the method—brief and straightforward. 
Figure 7.1a shows how the areas {u dv and {udu fill out the difference between the 
big area u(b)v(b) and the smaller area u(a)v(a). 


v(x) B(x) fy = V(0) 5(x) 


red area = large box spike #4 v(x) 


— small box — gray area area 0 


th] 
= Dy — Uy Uy — Jodu v(0) 


Fig. 7.4 The geometry of integration by parts. Delta function (area 1) multiplies v(x) at x = 0. 


In the movie Stand and Deliver, the Los Angeles teacher Jaime Escalante computed 
J x?sin x dx with two integrations by parts. His success was through exercises— plus 
insight in choosing u and v. (Notice the difference from {x sin x? dx. That falls the 
other way—to a substitution.) The class did extremely well on the Advanced Place- 
ment Exam. If you saw the movie, you remember that the examiner didn’t believe 
it was possible. I spoke to him long after, and he confirms that practice was the key. 


THE DELTA FUNCTION 


From the most familiar functions we move to the least familiar. The delta function is 
the derivative of a step function. The step function U(x) jumps from 0 to | at x =0. 
We write 6(x) = dU/dx, recognizing as we do it that there is no genuine derivative at 
the jump. The delta function is the limit of higher and higher spikes—from the 
“burst of speed” in Section 1.2. They approach an infinite spike concentrated at a 
single point (where U jumps). This “‘non-function” may be unconventional—it is 
certainly optional—but it is important enough to come back to. 

The slope dU/dx is zero except at x =0, where the step function jumps. Thus 
6(x) = 0 except at that one point, where the delta function has a “spike.”” We cannot 
give a value for 6 at x = 0, but we know its integral across the jump. On every interval 
from —A to A, the integral of dU /dx brings back U: 


A - ” A dU a a 
[3 ax= |" Fave ve] .=1 (13) 


“The area under the infinitely tall and infinitely thin spike 6(x) equals 1.” 

So far so good. The integral of d(x) is U(x). We now integrate by parts for a crucial 
purpose—to find the area under v(x)0(x). This is an ordinary function times the delta 
function. In some sense v(x) times d(x) equals v(0) times d(x)—because away from 
x = 0 the product is always zero. Thus e*d(x) equals 6(x), and sin x 6(x) = 0. 
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The area under v(x)d(x) is v(0)—which integration by parts will prove: 


7B The integral of v(x) times 5(x) is [* , v(x)d(x)dx = v(0). 


The area is v(0) because the spike is multiplied by v(0)—the value of the smooth 
function v(x) at the spike. But multiplying infinity is dangerous, to say the least. (Two 
times infinity is infinity). We cannot deal directly with the delta function. Jt is only 
known by its integrals! As long as the applications produce integrals (as they do), we 
can avoid the fact that 6 is not a true function. 

The integral of v(x)d(x) = v(x)dU/dx is computed “by parts:” 


A A 
| v(x)5(x) dx = (x)U(x) }* , — | u(x) ax. (14) 
A -A dx 


Remember that U = 0 or U = 1. The right side of (14) is our area v(0): 
A dv 
wA)-1-} 1 = dx = (A) — (v(A) — v(0)) = v(0). (15) 
0 


When v(x) = 1, this answer matches f ddx = 1. We give three examples: 
[2., cos x d(x) dx=1 f°, (U(x) + d(x))dx=7 fh, (6(X))?dx = co. 


A nightmare question occurs to me. What is the derivative of the delta function? 
INTEGRATION BY PARTS IN ENGINEERING 


Physics and engineering and economics frequently involve products. Work is force 
times distance. Power is voltage times current. Income is price times quantity. When 
there are several forces or currents or sales, we add the products. When there are 
infinitely many, we integrate (probably by parts). 

I start with differential equations for the displacement u at point x in a bar: 


du 


dv ; _, au 
— 5. = Six) with o(x) = k=. (16) 


This describes a hanging bar pulled down by a force f(x). Each point x moves through 
a distance u(x). The top of the bar is fixed, so u(0)= 0. The stretching in the bar is 
du/dx. The internal force created by stretching is v = k du/dx. (This is Hooke’s law.) 
Equation (16) is a balance of forces on the small piece of the bar in Figure 7.2. 


Fig. 7.2 Difference in internal force balances external force 


—Av=fAx or —dv/dx =f(x) 


v= W at x =1 balances hanging weight 


7.4 Integratton by Parts 


EXAMPLE 6 Suppose f{x)= F, a constant force per unit length. We can solve (16): 
uxJ=—Fx+C and = kux)=—$Fx?+Cx+D. (17) 


The constants C and D are settled at the endpoints (as usual for integrals}. At x =0 
we are given u=Qso D=Q, At x= 1 we are given v= W so C= W+ F. Then v(x} 
and u(x) give force and displacement in the bar. 


To see integration by parts, multiply — dv/dx ~ f(x} by u(x) and integrate: 


1 1d 1 f' 44 
{, fladudx) dx = - i Fe M(x) dx = — u4x)0(x) Jo + {, ox) dx. (18) 


The left side is force times displacement, or external work. The last term is internal 
force times stretching—or internal work. The integrated term has u{0) = 0—the fixed 
support does no work. It also has — u{1)W, the work by the hanging weight. The 
balance of forces has been replaced by a balance of work. 

This is a touch of engineering mathematics, and here is the main point. Integration 
by parts makes physical sense! When —du/dx =f is multiplied by other functions— 
called test functions or virtual displacements—then equation (18) becomes the 
principle of virtual work. It is absolutely basic to mechanics. 


7.4 EXERCISES 
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Read-through questions 


Integration by paris is the reverse of the _o _ rule. It 
changes Judy into _b minus _¢__. In case w= x and 


dv =e**dx, it changes [xe*dx to_d minus __@ __. The 


definite integral [> xe”*dx becomes _¢ minus_g_. 


In choosing u and dv, the hoof u and the __/ of 
dv/dx should be as simple as possible. Normally In x goes into 
| and e*goesinto__k __. Prime candidates are u = x or 


x* and v=sinx or__1__ or __m_, When w=x? we need 


n__ integrations by parts. For j sin™ ‘x dx, the choice dy = 
dx leads to__o minus _p_. 


If U is the unit step function, dU//dx = 6 is the unit __a 
function. The integral from —A to A is U(A)— U{—A)= 
s__. The integral of u(x}i{x} equals __s _. The integral 


Rey cos x 6(x)dx equals__+ _. In engineering, the balance of 
forces —dv/dx =f is multiplied by a displacement u(x) and 


integrated to give a balance of __u 


Integrate 1-16, usually by parts (sometimes twice). 


I fxsin x dx 2 [xe dx 

3 [xe *dx 4 { xcos 3x dx 
5 | x*cos x dx (use Problem 1} 

6 [xinxdx 7 f In(2x + 1)dx 


8 { x? e** dx (use Problem 2} 


9 f etsin x dx 10 f e*cos x dx 
[9 and 10 need two integrations. I think e* can be uw or v.] 
Hi fe sin bx dx 12 f xe~*"dx 

13 f sin(tn x) dx 14 § cos(in x) dx 
15 f (In x)’dx 16 § x7In x dx 
17 f sin” ‘x dx 18 f cos” (2x) dx 
19 { xtan™’x dx 
20 { x?sin x dx (from the movie) 
21 f x?cos x dx 22: f x?sin x dx 
23 | x°e*dx 24 [x sec™'x dx 
25 { x sec?x dx 26 § x cosh x dx 


Compute the definite integrals 27-34. 

28 [i ev* dx (let w= ./x) 
30 Jf In{(x?)adx 

32 [2 xsin x dx 


34 5? x? sin x dx. 


27 fin x dx 

29 [x e"7*dx 

31 [5 xcos x dx 
33 fo In(x? + 1)dx 
In 35—40 derive “reduction formulas” from higher to lower 
powers. . 

35 [ x*edx =x" —n J x" e*dx 
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6 f x"edx = 

37 | x*cos x dx =x"sin x—n f x*~'sin x dx 

38 [ x"sin x dx = 

39 f (in x)'dx = x{In x)!"—n f (ln xp~ dx 

40 f x(In x)*dx = 

41 How would you compute { x sin x e“dx using Problem 9? 
Not necessary to do it. 

42 How would you compute f x e tan” 'x dx? Don’t do it. 


43 (a) Integrate [xsinx?dx by substitution and parts. 
(b) The integral f x’sin x?dx is possible if n is 


44-54 are about optional topics at the end of the section. 


44 For the delta function 6(x) find these integrals: 
(a) f°, e7*3(xjdx  (b) J? , x(x)dx (0) S$ cos x d(x)dx. 


45 Solve dy/dx = 3d(x) and dy/dx = 35{x) + p{x). 


46 Strange fact: 6(2x) is different from d(x). Integrate them 
both from —1 to 1. 


47 The integral of 5(x) is the unit step U(x). Graph the next 
integrals R(x)=f U(x)dx and Q(x}=[ Rixjdx. The ramp R 
and quadratic spline Q are zero at x =0. 

48 In 5(x —4), the spike shifts to x = $. It is the derivative of 


the shifted step U(x —4). The integrat of v(x}S(x — }} equals 
the value of » at x = 4. Compute 


(a) {5 dtx— dx; (b) 5 e*5(x — 4)dx; 
(C) f°, S(x)i(x — Hdx. ; 
49 The derivative of 6(x} is extremely singular. It is a “dipole” 
known by its integrals. Integrate by parts in (b} and (c): 
' dé 1 465 : dé 

—d — "| —_— = — p{0). 
@] re x {b) i x ix dx (c} [., v(x) ay dx v{0) 
50 Why is J), U(x)d(x)dx equal to 4? (By parts.) 


51 Choose limits of integration in w(x) =f f(x)dx so that 
dvidx = —f(x) and »=Oatx=t. 


The next section will put old integrals into new forms. For example [ x? 
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52 Draw the graph of a(x) if (1) =0 and —do/dx = fix): 
faf=x, (bY f=U(x—¥); (f= H{x— 4), 


53 What integral u(x} solves & du/dx = v(x) with end con- 
dition «(0}=0? Find u{x) for the three »’s (not f’s) in 
Problem 52, and graph the three u’s, 


54 Draw the graph of AU/Ax =[U(x + Ax) — U(x)V/Ax. 
What is the area under this graph? 
Problems 55-62 need more than one integration. 
55 Two integrations by parts lead to V = integral of v: 
J uv'dx = uv — Vu +f Vu"dx, 

Test this rule on J x’sin x dx. 
56 After n integrations by parts, j u{dvidxjdx becomes 

wy — Uo tua) — or HEI fw 0G yx. 


u™ is the ath derivative of u, and v,,, is the nth integral of v. 
Integrate the last term by parts to stretch this formula to 
n+ | integrations. 


57 Use Problem 56 to find f x?e*dx. 


58 From f(x) —/(0)=f5/'(t)di, integrate by parts (notice ds 
not dx) to reach fix) =f(0) +/'(O)x + Jo s"(otx — odt. Con- 
tinuing as in Problem 56 produces Taylor's formula: 


1 x : —~f a 
fix} =f(0} + f'(O)x + af "Ox? ++ | hoe ay 
~ it] . 
59 What is the difference between fh uw"dx and iis u’w dx? 


60 Compute the areas A = ff Inx dx and B=f) e” dy. Mark 
them on the rectangle with corners (0, 0), (e, 0), (e, 1), (0, 1). 


61 Find the mistake. 1 don’t believe e“ cosh x = e7 sinh x: 
J e*sinh x dx =e* cosh x—f e*cosh x dx 
=e*cosh x—e*sinh x+ f e* sinh x dx. 


62 Choose C and D to make the derivative of 
C e*cos bx + D e“*sin bx equal to e*cos bx. Is this easier 
than tntegrating e**cos bx twice by parts? 


1 — x? dx 


will become f sin?@ cos*@ d@. That looks simpler because the square root is gone. But 
still sin? cos?@ has to be integrated. This brief section integrates any product of sines 


and cosines and secants and tangents. 


There are two methods to choose from. One uses integration by parts, the other 
is based on trigonometric identities. Both methods try to make the integral easy (but 
that may take time). We follow convention by changing the letter @ back to x. 


7.2 Tdgonometric Integrals 


Notice that sin+x cos x dx is easy to integrate. It is u*du. This is the goal in 
Example 1—to separate out cos x dx. It becomes du, and sin x is u. 
EXAMPLE 4 [{ sin*x cos*x dx (the exponent 3 is odd) 
Solution Keep cos x dx as du. Convert the other cos?x to 1 — sin*x: 
sin?x  sin>x 


3 5 


[sins cos*x dx = { sre ~—sin?x)cos x dx = 


EXAMPLE 2 fsin*xdx (the exponent 5 is odd) 


Solution Keep sin x dx and convert everything else to cosines. The conversion is 
always based on sin?x + cos?x = 1: 


Jl — cos*x)sin x dx = [ (1 ~ 2 cos?x + cos*x) sin x dx. 
Now cos x is uand —sin x dx is du. We have {(—1 + 2u? — w4)du. 
General method for { sin"x cos"x dx, when m or n is odd 


If n is odd, separate out a single cos x dx. That leaves an even numher of cosines. 
Convert them to sines. Then cos x dx is du and the sines are w’s. 

If m is odd, separate out a single sin x dx as du. Convert the rest to cosines. 

If m and 7 are both odd, use either method. 

If m and n are both even, a new method is needed. Here are two examples. 


EXAMPLE 3 [cos*x dx (m=0, n= 2, both even) 


There are two good ways to integrate cos?x, but substitution is not one of them. If 
uw equals cos x, then du is not here. The successful methods are integration by parts 
and double-angle formulas. Both answers are in equation (2) below—I don’t see 
either one as the obvious winner. 

Integrating cos?x by parts was Example 3 of Section 7.1. The other approach, by 
double angles, is based on these formulas from trigonometry: 


cos*x =4(1+cos2x) — sin?x = 3(1 — cos 2x) (1) 


The integral of cos 2x is $ sin 2x. So these formulas can be integrated directly. They 
give the only integrals you should memorize—either the integration by parts form, 
or the result from these double angles: 


Scos*x dx equals }x+sinxcosx} or  4x+4%sin2x (plusC). (2) 


fsin?x dx equals }(x—sinxcosx) or  4x-4sin2x ({plusC). (3) 


EXAMPLE 4 {cos*xdx (m=0,n=4, both are even) 


Changing cos*x to ! — sin?x gets us nowhere. All exponents stay even. Substituting 
u=sin x won't simplify sin*x dx, without du. Integrate by parts or switch to 2x. ° 


First solution Integrate by parts. Take u=cos*x and dv =cos x dx: 
{ (cos?x}{cos x dx) = uv ~ fv du=cos*x sin x — f (sin x)(— 3 cos?x sin x dx). 


The jast integral has even powers sin*x and cos*x. This looks like no progress. 
Replacing sin?x by 1 — cos?x produces cos*x on the right-hand side also: 


J cos*x dx =cos*x sin x + 3 f cos?x(1 — cos?x)dx. 
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Always even powers in the integrals. But now move 3 [ cos*x dx to the left side: 
Reduction 4 | cos*x dx =cos?x sin x + 3 J cos?x dx. (4) 


Partial success—the problem is reduced from cos*x to cos?x. Still an even power, 
but a lower power. The integral of cos?x is already known. Use it in equation (4): 


J cos*x dx = 4 cos*x sin x + 3+4(x + sin x cos x} + C. (5) 
Second solution Substitute the double-angle formula cos?x = 4 + 4 cos 2x: 
J cos*x dx = f ($+ 4 cos 2x)’dx =4 J (1+ 2 cos 2x + cos? 2x}dx. 
Certainly J dx = x. Also 2 J cos 2x dx = sin 2x. That leaves the cosine squared: 
f cos?2x = [ $(1 + cos 4x)dx =4x+ § sin 4x + C. 
The integral of cos*x using double angles is 
A[x + sin 2x + $x + gsin 4x] + C. (6) 


That solution looks different from equation (5), but it can’t be. There all angles were 
x, here we have 2x and 4x. We went from cos*x to cos?2x to cos 4x, which was 
integrated immediately. The powers were cut in half as the angle was doubled. 


Double-angle method for | sin™x cos"x dx, when m and n are even. 


Replace sin?x by $(1 ~ cos 2x) and cos*x by 3{1 + cos 2x). The exponents drop to 
m/2 and n/2. If those are even, repeat the idea (2x goes to 4x). If m/2 or n/2 is odd, 
switch to the ‘general method” using substitution. With an odd power, we have du. 


EXAMPLES (Double angle) f sin?x cos?x dx = [ 4(1 — cos 2x)(1 + cos 2x)dx. 


This leaves 1 — cos* 2x in the last integral. That is familiar but not necessarily easy. 
We can look it up (safest) or remember it (quickest) or use double angles again: 


I 2 1 | I 1 x sin 4x 
= = =_ Boe 4x |dx=—-— +C. 
zo cos* 2x)dx 4 (: 73 cos x) x= 3 33 Cc 


Conclusion Every sin”x cos"x can be integrated. This includes negative m and n— 
see tangents and secants below. Symbolic codes like MACSYMA or Mathematica 
give the answer directly. Do they use double angles or integration by parts? 

You may prefer the answer from integration by parts (I usually do). It avoids 2x 
and 4x, But it makes no sense to go through every step every time. Either a computer 
does the algebra, or we use a “reduction formula” from n to n— 2: 


Reduction n [ cos*x dx = cos"~'x sin x + (n— 1) J cos"” *x dx. (7) 


For n= 2 this is f cos?x dx—the integral to learn. For n= 4 the reduction produces 
cos?x. The integral of cos*x goes to cos*x. There are similar reduction formulas for 
sin™x and also for sin™x cos"x. I don’t see a good reason to memorize them. 


INTEGRALS WITH ANGLES px AND qx 
Instead of sin*x times cos°x, suppose you have sin 8x times cos 6x. How do you 


integrate? Separately a sine and cosine are easy. The new question is the integral of 
the product: 


7.2 Trigonometric Integrals 


EXAMPLE 6 = Find [2* sin 8x cos 6x dx. More generally find (2 sin px cos qx dx. 


This is not for the sake of making up new problems. I believe these are the most 
important definite integrals in this chapter {p and q are 0, 1, 2, ...). They may be the 
most important in all of mathematics, especially because the question has such a 
beautiful answer. The integrals are zero. On that fact rests the success of Fourier 
series, and the whole industry of signal processing. 

One approach (the slow way) is to replace sin 8x and cos 6x by powers of cosines. 
That involves cos'*x. The integration is not fun. A better approach, which applies to 
all angles px and qx, is to use the identity 


sin px cos gx =} sin(p + q)x + 4 sin(p — q)x. (8) 


Thus sin 8x cos 6x =}sin 14x +4sin 2x. Separated like that, sines are easy to 


integrate: 
a 1cos 14x 1 cos 2x |?* 
| sin 8x cos 6x dx=| — ZOE 3 20823) <n 


© 0 


Since cos 14x is periodic, it has the same value at 0 and 2z. Subtraction gives zero. 
The same is true for cos 2x, The integral of sine times cosine is always zero over a 
complete period {like 0 to 2). 


What about sin px sin gx and cos px cos qx? Their integrals are also zero, provided 
p is different from q. When p = q we have a perfect square. There is no negative area 
to cancel the positive area. The integral of cos*px or sin*px is x. 
EXAMPLE 7 J" sin 8x sin 7x dx=0 and 3" sin? 8x dx =z. 


With two sines or two cosines (instead of sine times cosine), we go back to the 
addition formulas of Section 1.5. Problem 24 derives these formulas: 


sin px sin gx = — 4 cos(p + q)x + 4 cox(p — g)x (9) 


cos px cos gx= 4$cos(p+q)x +4 cos(p— q)x. (10) 


With p= 8 and q=7, we get cos 15x and cos x. Their definite integrals are zero. With 
p=8 and g=8, we get cos 16x and cos 0x (which is 1). Formulas (9} and (10) also 
give a factor 4. The integral of } is 1: 


JQ” sin 8x sin 7x dx = —4{2" cos 15x dx +42" cos x dx=0+0 
fo" sin 8x sin 8x dx = ~ 45" cos 16x dx + $5" cos Ox dx=O+n7 


The answer zero is memorable. The answer x appears constantly in Fourier series. 
No ordinary numbers are seen in these integrals. The case p=q=1 brings back 
J cos?x dx = 4+ 4 sin 2x. 


SECANTS AND TANGENTS 


When we allow negative powers m and n, the main fact remains true. All integrals 
J sin™x cos"x dx can be expressed by known functions. The novelty for negative pow- 
ers is that logarithms appear. That happens right at the start, for sin x/cos x and for 
ljcos x (tangent and secant): 


f tan x dx = — f du/u= —In|cos x| (here u = cos x} 


fsecxdx= [duju= In|secx+tanx| (here u=sec x + tan x). 
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For higher powers there is one key identity: 1 + tan*x = sec*x. That is the old 
identity cos?x + sin?x = 1 in disguise (just divide by cos?x). We switch tangents to 
secants just as we switched sines to cosines. Since (tan x)’ =sec”x and (sec x)’ = 
sec x tan x, nothing else comes in. 


EXAMPLE 8 { tan?x dx = [(sec?x — I}dx = tan x—x+C. 


EXAMPLE? { tan*x dx =f tan x(sec*x — I)dx. 


The first integral on the right is [ udu=4u?, with u=tanx. The last integral is 
—f tan x dx. The complete answer is (tan x)’ + In|cos x | + C. By taking absolute 
values, a negative cosine is also allowed. Avoid cos x = 0. 


(tan xy"? 


EXAMPLE 10 Reduction [con x)"dx = =] 


— [can xy" ~ 2dx 

Same idea—separate off (tan x)? as sec?x —1. Then integrate (tan x)" "sec?x dx, 
which is u™~?d@u. This leaves the integral on the right, with the exponent lowered by 
2. Every power (tan x)” is eventually reduced to Example 8 or 9. 


EXAMPLE 44 { sec?x dx = uv ~ [ v du=sec x tan x — { tan?x sec x dx 


This was integration by parts, with w= sec x and v=tan x. In the integral on the 
right, replace tan?x by sec?x — 1 (this identity is basic): 


J sec*x dx = sec x tan x — f sec*x dx + [ sec x dx. 


Bring { sec*x dx to the left side. That reduces the problem from sec*x to sec x. 


I believe those examples make the point— trigonometric integrals are computable. 
Every product tan™x sec"x can be reduced to one of these examples. If n is even we 
substitute u=tan x. If m is odd we set u=sec x. If m is even and n is odd, use a 
reduction formula (and always use tan?x = sec?x — 1). 

I mention very briefly a completely different substitution » = tan 3x. This seems to 
all students and instructors (quite correctly) to come out of the blue: 

2u 1-4? 2du 


sin x = —— and cos x = ——; and dx = ——. 
1+ 1+u? 


1+ 2? (11) 


The x-integral can involve sums as well as products—not only sin™x cos"x but also 
1/(5 + sin x — tan x). (No square roots.) The x-integral is a ratio of ordinary polynomi- 
als. It is done by partial fractions. 


Application of { sec x dx to distance on a map (Mercator projection) 


The strange integral In(sec x + tan x) has an everyday application. It measures the 
distance from the equator to latitude x, on a Mercator map of the world. 

All mapmakers face the impossibility of putting part of a sphere onto a flat page. 
You can’t preserve distances, when an orange peel is flattened. But angles can be 
preserved, and Mercator found a way to do it. His map came before Newton and 
Leibniz. Amazingly, and accidentally, somebody matched distances on the map with 
a table of logarithms—and discovered { sec x dx before calculus. You would not be 
surprised to meet sin x, but who would recognize in(sec x + tan x)? 

The map starts with strips at all latitudes x. The heights are dx, the lengths are 
proportional to cos x. We stretch the strips by 1/cos x—then Figure 7.3c lines up 
evenly on the page. When dx is also divided by cos x, angles are preserved—a small 
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Rat COU Es 


5 map width 


0 
Fig. 7.3 Strips at latitude x are scaled by sec x, making Greenland too large. 


square becomes a bigger square. The distance north adds up the strip heights 


dx/cos x. This gives | sec x dx. 


The distance to the North Pole is infinite! Close to the Pole, maps are stretched 
totally out of shape. When sailors wanted to go from A to B at a constant angle with 
the North Star, they looked on Mercator’s map to find the angle. 


7.2 EXERCISES 


Read-through questions 


To integrate sin*xcos>x, replace cos*x by _@ . Then 
(sin*x — sin®x)cosx dx is __b du. In terms of u=sin x the 
integral is _¢ . This idea works for sin™x cos’x if etther m 
ornis 4d 


If both m and n are __@__, one method is integration by 
t__. For J sin*x dx, split off dv = sin x dx. Then —f » du is 
g_. Replacing cos?x by__h __ creates a new sin*x dx that 

combines with the original one. The result is a reduction to 
J sin?x dx, which is known to equal __t 


The second method uses the double-angle formula sin?x = 

i. Then sin*x involves cos? __& _. Another doubling 

comes from cos?2x =__!_. The integral contains the sine of 
m 


To integrate sin 6x cos 4x, rewrite it as }sin 10x+_ on 
The indefinite integral is _ ©. The definite tntegral from 
0 to 2x is__ ® _. The product cos px cos gx is written as 


tcos(p+q)x+_ 4. Its integral is also zero, except if 
s__ when the answer is __§ 


With w= tan x, the integral of tan®x sec*x is__t_. Simi- 


larly { sec®x (sec x tan x dx)=__4__. For the combination 
tan™x sec"x we apply the identity tan?x =__¥_. After reduc- 
tion we may need ftanxdx=__w_and[secxdx=_ x 


Compute 1-8 by the “general method,” when m or n is odd. 
2 § cos*x dx 


4 J cos®x dx 


1 f sin?x dx 

3 j sin x cos x dx 
§ J sin°x cos’x dx 6 { sin’x cos*x dx 

7 [/sin x cos x dx 8 { \/sin x cos*x dx 


9 Repeat Problem 6 starting with sin x cos x = $sin 2x. 


10 Find { sin?ax cos ax dx and | sin ax cos ax dx. 


In 11-16 use the double-angle formulas (m, n even). 
11 f% sin?x dx 12 fF sin*x dx 

13 { cos?3x dx 14 f sin?x cos?x dx 
15 § sin?x dx+{cos?x dx 16 j sin*x cos?2x dx 
17 Use the reduction formula (7) to integrate cos®x. 


18 For n> 1 use formula (7) to prove 


mi2 n—1 ai2 
cos"x dx = —— cos"~ x dx. 
o nm 0 


19 For n= 2, 4, 6, ... deduce from Problem 18 that 
Ie ‘ zm (1}3}---(n— 1) 


5 ee OM 


20 For n= 3, 5, 7, ... deduce from Problem 18 that 


maya) 
[ SOR ease) 


21 (a) Separate dv = sin x dx from u=sin®~'x and integrate 
f sin®x dx by parts. 
(b) Substitute 1—sin?x for cos?x to find a reduction 
formula like equation (7). 

22 For which n does symmetry give {5 cos"x dx = 0? 


23 Are the integrals (a)-(f) positive, negative, or zero? 
(a) [5 cos 3x sin 3x dx — (b) f§ cos x sin 2x dx 
(c) [°,, cos x sinx dx — (d) f§ (cos?x — sin? x} dx 
(e) [2* cos px sin qx dx (f) J? cos*x dx 
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24 Write down equation (9) for p= q = 1, and (10) for p= 2, 
q= 1, Derive (9) from the addition formulas for cos(s + ¢) and 
cos(s — f) in Section 1.5. 


In 25—32 compute the indefinite integrals first, then the definite 
integrals. 

25 [5 cos x sin 2x dx 

27 Jf cos 99x sin 101x dx 

29 [5 cos 99x cos 101x dx 

31 §3" cos x/2 sin x/2 dx 


33 Suppose a Fourier sine series Asinx+ Bsin2x+ 
C sin 3x + -: adds up to x on the interval from 0 to x. Find 
A by multiplying all those functions (including x) by sin x 
and integrating from 0 to 2. (B and C will disappear.} 


26 f§ sin 3x sin 5x dx 
28 J%, cos*3x dx 
3 fg" sin x sin 2x sin 3x dx 


32 fj x cos x dx (by parts} 


34 Suppose a Fourier sine series Asinx+Bsin 2x + 
C sin 3x + «-- adds up to 1 on the interval from 0 to x. Find 
C by multiplying all functions (including 1} by sin 3x 
and integrating from 0 to x. {A and B will disappear.) 

35 In 33, the series also equals x from —2 to 0, because all 
functions are odd, Sketch the “sawtooth function,” which 
equals x from —x to x and then has period 2x. What is the 
sum of the sine sertes at x = 2? 


36 In 34, the series equals —!f from —x to 0, because sines 
are odd functions. Sketch the ‘square wave,” which is 
alternately —1 and +1, and find A and B. 


37 The area under y=sin x from 0 to x is positive. Which 
frequencies p have fj sin px dx = 0? 


38 Which frequencies g have [5 cos qx dx = 0? 
39 For which p,q is Jj sin px cos gx dx =0? 


40 Show that f§ sin px sin qx dx is always zero. 


Compute the indefinite integrals 41-52. 
41 § sec x tan x dx 42 f tan 5x dx 


43 { tan’x sec*x dx 44 { tan*x sec x dx 
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45 { tan x sec*x dx 46 { sec*x dx 
47 f tan*x dx 48 f tanéx dx 
49 { cot x dx 50 f csc x dx 
. ine 
51 iE dx 52 j= dx 
cos*x cos*x 


53 Choose A so that cos x sin x =A cos(x + 2/4). Then 
integrate 1/(cos x — sin x). 


54 Choose A so that cos x — J/3 sin x = A cos(x + 1/3). Then 
integrate l/{cos x — Ja sin x)?. 
55 Evaluate iB" |cos x — sin x| dx. 


56 Show that acosx+dsinx=,/a? +? cos(x—a) and 
find the correct phase angle 2. 


$7 If a square Mercator map shows 1000 miles at latitude 
30°, how many miles does it show at latitude 60°? 


58 When lengths are scaled by sec x, area is scaled by 
. Why is the area from the equator to latitude x 
proportional to tan x? 


59 Use substitution (12) to find f dx/{1 + cos x). 


60 Explain from areas why {} sin?x dx =f} cos?x dx. These 
integrals add to [5 1 dx, so they both equal 


61 What product sin px sin gx is graphed below? Check 
that (pcos px sin gx — q sin px cos gx}{q? — p*) has _ this 
derivative. 


62 Finish j sec>x dx in Example 11. This is needed for the 
length of a parabola and a spiral (Problem 7.3.8 and 
Sections 8.2 and 9.3). 


The most powerful tool we have, for integrating with pencil and paper and brain, is 
the method of substitution. To make it work, we have to think of good substitutions— 
which make the integral simpler. This section concentrates on the single most valu- 
able collection of substitutions. They are the only ones you should memorize, and 


two examples are given immediately. 
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3 ‘ eee 
To integrate ./1 — x’, substitute x = sin 6. Do not set u=1— x? (= is Missing 


_ f dx os @ dé 
[vi — x? ax - | (cos A)(cos @ dé) { ge tes (= 


vias cos 0 


The expression \/1— x? is awkward as a function of x. It becomes graceful as a 
function of @. We are practically invited to use the equation 1 — (sin @)? = (cos 6)’. 
Then the square root is simply cos 8— provided this cosine is positive. 

Notice the change in dx. When x is sin @, dx is cos @ d@. Figure 7.4a shows the 
original area with new letters. Figure 7.4b shows an equal area, after rewriting 
[ (cos 8)(cos @ d6) as | (cos*@) dé. Changing from x to 6 gives a new height and a new 
base. There is no change in area—that is the point of substitution. 

To put it bluntly: If we go from ,/1 — x? to cos @, and forget the difference between 
dx and d@, and just compute { cos 6 d6, the answer is totally wrong. 


tl cos @ 
we | TN 
d@ r d8 R 
2 2 


Fig. 7.4 Same area for ./1 — x? dx and cos?6 d@. Third area is wrong: dx #d6 


We still need the integral of cos*@. This was Example 3 of integration by parts, and 
also equation 7.2.6, It is worth memorizing. The example shows this @ integral, and 
Teturns to x: 


EXAMPLE 4 j cos?@ d@ =} sin 6 cos 9+ 36 is after substitution 
J /1—x? dx =4x,/1— x? +} sin” 'x is the original problem. 


We changed sin 8 back to x and cos @ to \/ 1 — x”. Notice that @ is sin~'x, The answer 
is trickier than you might expect for the area under a circular arc. Figure 7.5 shows 
how the two pieces of the integral are the areas of a pie-shaped wedge and a triangle. 


dx cos @ dé 
EXAMPLE 2 me | an 8+ C=sin 'x+C 


Remember: We already know sin ‘x. Its derivative t/,/1— x? was computed in 
Section 4.4. That solves the example. But instead of matching this special problem 


tg dgiciy 
wedpe area 5 8 7 sin x 
yavinx? 
triangle | area svi x7 
x 0 ] 


Fig. 7.5 | ./1— x? dx is a sum of simpler areas. Infinite graph but finite area. 
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with a memory from Chapter 4, the substitution x = sin @ makes the solution auto- 
matic. From { d= @ we go back to sin” ‘x. 

The rest of this section is about other substitutions. They are more complicated 
than x = sin @ (but closely related). A table will display the three main choices—sin 6, 
tan @, sec @—and their uses. 


TRIGONOMETRIC SUBSTITUTIONS 


After working with ,/1 — x?, the next step is ./4 — x”. The change x = sin @ simplified 
the first, but it does nothing for the second: 4 — sin?6 is not familiar. Nevertheless a 
factor of 2 makes everything work. Instead of x = sin 0, the idea is to substitute x = 


2 sin 4: 
/4—x?=./4-— 4 sin?6=2 cos @ and dx =2 cos @ dé. 


Notice both 2’s. The integral is 4 { cos?6 d6 = 2 sin 8 cos 6 + 26. But watch closely. 
This is not 4 times the previous { cos?@ dO! Since x is 2 sin @, @ is now sin~ (x/2). 


EXAMPLE3 [ ./4— x? dx = 4 { cos? dO = x./1 — (x/2)? + 2 sin” '{x/2). 
Based on ./1 — x? and ./4— x’, here is the general rule for ./a? — x*. Substitute 


x=asin @. Then the a’s separate out: 


Ja —x*=/a*—a*sin’?@=acos@ and dx =a cos @ dé. 


That is the automatic substitution to try, whenever the square root appears. 


a4 dx -(" 4cos@ dd sf"? ae=" 
x=0./16—x? Jo=0./47—4%(sin 6)? Je=o 2° 

Here a? = 16. Then a= 4 and x= 4 sin @. The integral has 4 cos @ above and below, 
so it is [ d0. The antiderivative is just 6. For the definite integral notice that x= 4 
means sin 6 = 1, and this means @= n/2. 


A table of integrals would hide that substitution. The table only gives sin~ 1(x/4). 
There is no mention of [ d@= 6. But what if 16 — x? changes to x? — 16? 


EXAMPLE 4 


8 
EXAMPLE 5 | =e 254 

x=4a ./x*— 16 
Notice the two changes—the sign in the square root and the limits on x. Example 4 
stayed inside the interval |x| <4, where 16 — x? has a square root. Example 5 stays 
outside, where x? — 16 has a square root. The new problem cannot use x = 4 sin 0 
because we don’t want the square root of —cos7@, 


The new substitution is x = 4 sec @. This turns the square root into 4 tan 0: 
x=4sec@ gives dx=4sec@tan@d@ and x?-16=16sec?@— 16= 16 tan7é. 


This substitution solves the example, when the limits are changed to @: 


“34sec Otand@ [* m 
i dtane {. sec 0 d@ = In(sec 9 + tan a) | = In(2 + /3). 


I want to emphasize the three steps. First came the substitution x =4 sec @. An 
unrecognizable integral became [ sec 6 dO. Second came the new limits (@=0 when 
x = 4, = 7/3 when x = 8). Then I integrated sec 6. 


7.3 Trigonometric Substitutions 


Example 6 has the same x? — 16. So the substitution is again x = 4 sec 0: 


= 16 dx \- Se = | ae 


EXAMPLE 6 [. (x? = 16)3/ = _ (4 tan 63 wis ana 


Step one substitutes x = 4 sec 0. Step two changes the limits to 6. The upper limit 
x = o0 becomes @ = 2/2, where the secant is infinite. The limit x = 8 again means 6 = 
m/3. To get a grip on the integral, I also changed to sines and cosines. 

The integral of cos @/sin*@ needs another substitution! (Or else recognize 
cot @ csc 0.) With u = sin @ we have { du/u? = — 1/u= — 1/sin 0: 


n/2 = n/2 2 
Solution | Ss =| <e.. 
x3 -sin*@ sin 6 |,/3 


Warning With lower limit 6 = 0 (or x = 4) this integral would be a disaster. It divides 
by sin 0, which is zero. This area is infinite. 


(Warning)? Example 5 also blew up at x = 4, but the area was not infinite. To make 
the point directly, compare x '/? to x */?. Both blow up at x = 0, but the first one 


has finite area: 
1 1 1 \, 1 = 
— dx =2,/x|,=2 —; dx = —=] =. 
\ x veh 9 x fs 0 


Section 7.5 separates finite areas (slow growth of 1/,/x) from infinite areas (fast 
growth of x */?). 


Last substitution Together with 16 — x? and x? — 16 comes the possibility 16 + x?. 
(You might ask about —16— x’, but for obvious reasons we don’t take its square 
root.) This third form 16+ x? requires a third substitution x=4tan @. Then 
16 + x? = 16 + 16 tan7@ = 16 sec”@. Here is an example: 


2S ge "2 Asec?9d0 1, /*? ox 
ee le 16+ x? \.. 16sec*9 4 | 8 


Table of substitutions for a* — x?, a? + x?, x? — a? 


replaces a?—x? by a’cos?@ and dx by acos 6 d0 


replaces a*+x? by a’sec?@ and dx by asec?@ dé 


replaces x?—a* by a*tan?@ and dx by asec @ tan 0 dé 


Note There is a subtle difference between changing x to sin @ and changing sin @ to u: 
in Example 1, dx was replaced by cos @ d@ (new method) 
in Example 6, cos @ d@ was already there and became du (old method). 


The combination cos @ d@ was put into the first and pulled out of the second. 

My point is that Chapter 5 needed du/dx inside the integral. Then (du/dx)dx 
became du. Now it is not necessary to see so far ahead. We can try any substitution. 
If it works, we win. In this section, x = sin @ or sec # or tan @ is bound to succeed. 


dx . o xdx _ [du . 
NEW \; a on |e by trying x = tan 0 OLD F re: \¢ by seeing du 
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We mention the Ayperbolic substitutions tanh @, sinh @, and cosh @. The table below 
shows their use. They give new forms for the same integrals. If you are familiar with 
hyperbolic functions the new form might look simpler—as it does in Example 8. 


x=atanhé@ replaces a?-—x? by «a? sech?9 and dx by a sech’6 dé 


x=asinh@ replaces a*+x* by a’? cosh?@ and dx by acosh d dé 


x=acosh@ replaces x*-«a? by a?sinh?@ and dx by a sinh @ dé 


dx ___ [sinh @ dé 
fx? — } sinh @ 


f dé is simple. The bad part is cosh~'x at the end. Compare with x = sec @: 


EXAMPLE 8 


=@~-C=cosh"'x+C. 


dx _ [sec@tané dé _ _ a 
(=: ad =In(sec @ + tan 4) + C= inte + /x*— 1+. 


This way looks harder, but most tables prefer that final logarithm. It is clearer than 


cosh ~'x, even if it takes more space. All answers agree if Problem 35 is correct. 


COMPLETING THE SQUARE 


We have not said what to do for \/x? — 2x + 2 or ./ — x? + 2x. Those square roots 


contain a linear term—a multiple of x. The device for removing linear terms is worth 
knowing. It is called completing the square, and two examples will begin to explain it: 
x? -2x4+2=(x—- 1)? +1=u+1 
-x?+2x=-(x-1P+1=1-w. 
The idea has three steps. First, get the x? and x terms into one square. Here that 
square was (x ~ 1)? = x? — 2x + 1. Second, fix up the constant term. Here we recover 


the original functions by adding 1. Third, set «= x — | to leave no linear term. Then 
the integral goes forward based on the substitutions of this section: 


| dx 7 |-4- | —_— | du 
The same idea applies to any quadratic that contains a linear term 26x: 
rewrite x*+2bx~>c¢ as (x +6)? +C, with C=c~ 6? 
rewrite x?+2hx+¢ as —(x—b)?+C. with C=c+h? 
To match the quadratic with the square, we fix up the constant: 
x?+410xt+16= (x+5)*+C leads to C=16—-25= —9 
—x?+10x+ 16= —(x—- 5) + C leads to C= 16+ 25= 41. 


EXAMPLE 9 ee Oe 
x? + 10x + 16 {x + 5)? -9 u?2-9 


Here u=x+5 and du=dx. Now comes a choice—struggle on with u = 3 sec @ or 
look for [ du/(u* — a?) inside the front cover. Then set a = 3: 


du alin u—3 
uw—-9 6 |ut+3 


x+2 
x+8 


==In 
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Note If the quadratic starts with 5x? or —5x?, factor out the 5 first: 
5x? — 10x + 25 = 5(x? — 2x + 5)= (complete the square) = 5[(x — 1)? + 4]. 


Now u=x—1 produces 5[u? + 4]. This is ready for table lookup or u = 2 tan 6: 


dx 
atl ie 10x +25 — 


Stu? +4) 


du 2sec70d@ 1 
= |= am = | 
5[4 sec*8] 10 


This answer is 6/10 + C. Now go backwards: 6/10 = (tan ' $u)/10 = (tan~ + {x — 1))/10. 
Nobody could see that from the start. A double substitution takes practice, from x 
to u to @. Then go backwards from @ to u to x. 


Final remark For u? + a? we substitute u= a tan @. For u? ~ a” we substitute u= 
asec 6. This big dividing line depends on whether the constant C (after completing 
the square) is positive or negative. We either have C = a* or C= —a*. The same 
dividing line in the original x* + 2bx +c is between c > 6? and c < b?. In between, 
c = b? yields the perfect square (x + 5)’— and no trigonometric substitution at all. 


7.3 EXERCISES 


Read-through questions 


The function ./1 ~ x? suggests the substitution x =__¢ 
The square root becomes __b and dx changes to _¢_. 
The integral [(1~x7)**dx becomes {__¢__d@. The interval 
4<x<1changessto_e <@<_¢ 


For ./a?~— x? the substitution is x=_9@ with dx= 
h_. For x?-a*? weusex=__! withdx=__1 . Then 


§ dx{1+x?) becomes f d6, because 1+tan?@=__k_. The 
answer is @=tan~*x. We already knew that __|! 
derivative of tan™ ‘x, 


The quadratic x? + 2bx +¢containsa__™__ term 2bx. To 
remove it we __"__ the square. This gives (x + 5)? + C with 
C=_0__. The example x? +4x+9 becomes __p__. Then 
u=x +2. In case x? enters with a minus sign, —x?+ 4x+9 
becomes (_4__)*+__' . When the quadratic contains 


4x?, start by factoring out __3 


Jutegrate 1-20 by substitution. Change @ back to x. 
‘il dx 2 | dx 
J4—-x? Ja 
dx 
~~ 2 — 
3 [v4 x* dx ‘| 
s | x? dx «| dx 
J/l-x? x? /1—x? 
7 fesse B f/x? +a? dx (see 7.2.62) 


2 
| x 25 3 
x 


1° | x? dx 
J/o—xt 


ul [-#= 12 { x — x8 dx 
x°—x 
dx dx 

13 Fessc i4 lar 
d 2 

15 ; x = #16 Yitx dx 

{x? — 9)* x 

2 

“7 |= ax 18 aa dx 
x—1 x +4 


21 (Important) This section started with x = sin @ and 


fdx//1—x? =f dd =6 =sin™'x. 


(a) Use x =cos @ to get a different answer. 
(b) How can the same integral give two answers? 


22 Compute { dx/x,/x*— 1 with x = sec 0. Recompute with 
x=csc @, How can both answers be correct? 


23 Integrate x/(x? + 1} with x = tan 6, and also directly as a 
logarithm. Show that the results agree. 


24 Show that f dx/x /x*-— 1=4 sec” (x), 
Calculate the definite integrals 25—32. 


a 
25 { a’ —x* dx = area of 


1 
26 | (1— x)? dx 
-1 
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oe fa dx - Poy dx 
Ji fx? 2 (x? - 1)? 

a ei x dx i we dx 
Jeux +t ~, x2 49 
ie — 

32 xi] —x? dx = area of 
wi. 


33 Combine the integrals to prove the reduction formula 


(2 # QO): 
ye 1 x" yo 1 
(3 +1 aol nH \s +1 ae 


34 Integrate !icos x and 1/(1 + cos x) and \/ 1 +cos x. 


35 (a)x= gives [ dx'/x? — 1 =In(sec @ + tan 0). 
{b) From the triangle, this answer ts f=In(x + fx? — 1p. 
Check that dfidx =1/¥/x? — 1. 

(c) Verify that coshf=4 (ef +e fJ)=x. Thenf=cosh ‘x, 
the answer in Example &. 

36 (a)x= gives [ dxi, V4l= In(sec @ + tan @). 
tb) The second triangle converts this answer to ¢ = In(x + 
yon? +1), Check that dedx = Lies? + 1. 

(c) Verify that sinh g = 4(e* 


e *J=x so g=sinh 


var 


In 37-42 substitute \ = sinh ?, cosh @. or tanh f). After integ- 
ration change back to x. 


pdx dx 
37 | >> 38 | ==; 
yen! XVl-N 
ao “on 
wT 7 mee ak! 
39 | a Weds 40 — dx 
(od: Po fpae 
at | oe a2 |* ax ds 
[—x° x 


7.4 Partial Fractions 


Rewrite 43-48 as (x + bP + C or —(x — 6)? + C by completing 
the square. 


43 x?—4x48 44 -x?4+2x48 
45 x? — 6x 46 -x?+10 
47 x7 4+2x41 48 x7 +4x— 12 


49 For the three functions f(x} in Problems 43, 45, 47 
integrate 1/ f(x). 


50 For the three functions g{x) in Problems 44, 46, 48 
mtegrate 1/,// a(x). 


$1 kor { dxi(x* + 2bx + ¢) why does the answer have different 
forms for b? > ¢ and hb? <¢? What is the answer if 67 = ¢? 


52 What substitution u=x +h or u=x —b will remove the 
linear term? 


oh tb) | —** 
ax te 3x? + 6x 


dx dx 
(c) |= 10x+¢ | x 


53 Find the mistake. With x=sin@ and \/1—x? =cos 0, 
substituting dx =cos @ d0 changes 


2n o. 
cos?0 df into [ fl—x? dx. 
C 


Jo Ja 
54 (a) If x =tan @ then | 1 +x7dx =[ dé. 
(b) Convert 4fsec 0 tan # + In(sec 0 + tan @)] back to x. 
(c) [fx =sinh @ then [ 1 +x7 dx =] dé). 


(dj Convert 4[sinh @ cosh @ + #] back to x. 
These answers agree. In Section 8.2 they will give the length 
of a parabola. Compare with Problem 7.2.62. 


5§ Rescale x and y in Figure 7.5b to produce the equal area 
J ydx in Figure 7.5c. What happens to y and what happens 
to dx? 


56 Draw y-1 vl —x* and yr 1216 — x? to the same 
scale (1° across and up; 4” across and 4" up). 


57 Whatts wrong. if anything. with 


This section is about rational functions P(x):Q(x). Sometimes their integrals are also 
rational functions (ratios of polynomials). More often they are not. It is very common 
for the integral of P’Q to involve logarithms. We meet logarithms immediately in the 
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simple case 1/(x — 2), whose integral is In|x — 2| + C. We meet them again in a sum 
of simple cases: 


(ls peer $ ix= tax 2| + 3ln|x + 2|—4Injx/+C. 


Our plan is to split P/Q into a sum like this—and integrate each piece. 
Which rational function produced that particular sum? It was 
1 Es 3 4 (x +20) + Hx— 2(xX)-4x—-2Ae+2 —4x+16 


x-2 x+2 x (x — 2)(x + 2)(x) (x — 2)(x + 2})(x) 


This is P/Q. It is a ratio of polynomials, degree 1 over degree 3. The pieces of P are 
collected into — 4x + 16. The common denominator (x — 2)(x + 2)(x) = x° — 4x is Q. 
But I kept these factors separate, for the following reason. When we start with P/Q, 
and break it into a sum of pieces, the first things we need are the factors of Q. 


In the standard probiem P/@ is given. To integrate it, we break it up. The goal of 
partial fractions is to find the pieces—to prepare for integration. That is the technique 
to learn in this section, and we start right away with examples. 


EXAMPLE 4 Suppose P/Q has the same Q but a different numerator P: 


P  3x*+8x-4 A B Cc 
= = si + (1) 
Q@ (x-—2)(x+2)(x) x-2 x+2 x 
Notice the form of those pieces! They are the “partial fractions” that add to P/Q. 
Each one is a constant divided by a factor of @. We know the factors x — 2 and x + 2 


and x. We don’t know the constants A, 8B, C. In the previous case they were 1, 3, —4. 
In this and other examples, there are two ways to find them, 


Method t (slow) Put the right side of (t} over the common denominator Q: 
3x? + 8x —4 m A(x + 2)(x} + B(x — 2}{x) + C(x — 2)(x + 2) 
Q {x — 2)(x + 2)(x) 


Why is A multiplied by (x + 2)(x)? Because canceling those factors will leave A/(x — 2) 
as in equation (1). Similarly we have B/(x + 2) and C/x. Choose the numbers A, B, C 
so that the numerators match. As soon as they agree, the splitting ts correct. 


(2) 


Method 2 (quicker) Multiply equation (1) by x — 2. That leaves a space: 


3x? +8x-4 | Bix-— 2) C(x — 2) 
(x +2)(x) io ope @) 


Now set x = 2 and immediately you have A. The last two terms of (3) are zero, because 
x — 21s zero when x = 2. On the jeft side, x = 2 gives 
3(2)? + 8(2)-4 24 she 
See hich ’ 
Q+2Q) 8 3 (which is A) 
Notice how multiplying by x — 2 produced a hole on the left side. Method 2 is the 
“cover-up method.” Cover up x—2 and then substitute x = 2, The result is 3= 
A+0O+0, just what we wanted. 
In Method 1, the numerators of equation (2) must agree. The factors that multiply 
Band C are again zero at x = 2. That leads to the same A—but the cover-up method 
avoids the unnecessary step of writing down equation (2). 
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Catculation of B Multiply equation (1) by x + 2, which covers up the (x + 2): 
3x*+8x-4 A(x+2) C(x + 2) 
= + BY 
(x-2) (x) x-2 x (4) 
Now set x = — 2, so A and C are multiplied by zero: 


ICAP PSC HA 8 a 
(2-2) €2 86 # #© 


This is almost full speed, but (4) was not needed. Just cover up in Q and give x the 
right value (which makes the covered factor zero). 
Catculation of C (quickest) In equation {1), cover up the factor (x) and set x = 0: 
3(0)? + 80) — 4 —4 
—~=—=|=€C, 5 
(0 — 2}(0 + 2) —4 (5) 


To repeat: The same result 4 = 3, B= — 1, C=1 comes from Method I. 


x+2 A B 
EXAMPLE 2 at 
{x-—D(x+3) x-1 xt+3 
First cover up (x — 1) on the left and set x = 1. Next cover up (x + 3) and set x = — 3: 
1+2 3 -3+2 ~1 
——_—___-——_ = -=4 = =f, 
( \l+3) 4 {(-3- 1} ) -4 


The integral is #1n|x — | +4ln|x+3)+C. 


EXAMPLE 3 This was needed for the logistic equation in Section 6.5: 


I A B 


We-by) yen by ” 


First multiply by y. That covers up y in the first two terms and changes B to By. 
Then set y = 0. The equation becomes 1/c = A. 

To find B, multiply by c — by. That covers up c — by in the outside terms. In the 
middle, A times c — by will be zero at y=c/b. That leaves B on the right equal to 
I/y = b/e on the left. Then A=I1/ce and B=bd/c give the integral announced in 


Equation 6.5.9: 
dy dy b dy Iny In(c — by) 
(= — by? {2 (= by) oc ¢ 7) 


It is time to admit that the general method of partial fractions can be very awkward. 
First of ail, it requires the factors of the denominator Q. When Q is a quadratic 
ax? + bx +c, we can find its roots and its factors. In theory a cubic or a quartic can 
also be factored, but in practice only a few are possible—for example x*—~ 1 is 
(x? — 1)(x? + 1). Even for this good example, two of the roots are imaginary. We can 
split x? — 1 into (x + 1)(x — 1). We cannot split x* + 1 without introducing i. 

The method of partial fractions can work directly with x2 + 1, as we now see. 


EXAMPLE 4 


3x? +2547 ; : 
ee dx {a quadratic over a quadratic). 


x7 +1 


This has another difficulty. The degree of P equals the degree of Q (= 2). Partial 
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fractions cannot start until P has lower degree. Therefore I divide the leading term x? 
into the leading term 3x?. That gives 3, which is separated off by itself: 


sl a Se ax+4 
x? + | xrt+d 


(8) 


Note how 3 really used 3x? + 3 from the original numerator. That left 2x + 4. Partial 
fractions will accept a linear factor 2x + 4 (or Ax + B, not just A) above a quadratic. 

This example contains 2x/(x?+ 1), which integrates to In(x?+ 1). The final 
4/(x? + 1) integrates to 4tan~+x. When the denominator is x? + x + t we complete 
the square before integrating. The point of Sections 7.2 and 7.3 was to make that 
integration possible. This section gets the fraction ready—in parts. 


The essential point is that we never have to go higher than quadralics. Every 
denominator Q can be split inte linear factors and quadratic factors. There is no magic 
way to find those factors, and most examples begin by giving them. They go into 
their own fractions, and they have their own numerators—which are the A and B 
and 2x + 4 we have been computing. 

The one remaining question is what to do if a factor is repeated. This happens in 
Example 5. 


axt+3_ A | iB 
(x-12) &-1) (x- 1)” 


EXAMPLE 5 


The key is the new term B/(x — 1}*. That is the right form to expect. With (x — 1)(x — 2) 
this term would have been B/(x — 2). But when {x — 1} is repeated, something new is 
needed. To find B, multiply through by (x — 1)? and set x = 1: 


2x+3=A(x—1)+B becomes 5=8 when x=1. 
This cover-up method gives B. Then A=2 is easy, and the integral is 


2 In{x — 1| — 5/{x — 1). The fraction 5/(x — 1)? has an integral without logarithms. 


Dy Ox? BA! CA SB CRED. 9 E 
EXAMPLE eA . 
? ote dixe 1 x x? x*4+4 x l 


This final example has almost everything! It is more of a game than a calculus 
problem. In fact calculus doesn’t enter until we integrate (and nothing is new there). 
Before computing A, B, C, D, E, we write down the overail rules for partial fractions: 


1. The degree of P must be less than the degree of Q. Otherwise divide their leading 
terms as in equation (8) to lower the degree of P. Here 3 < 5. 

2. Expect the fractions illustrated by Example 6. The linear factors x and x + 1 
(and the repeated x”) are underneath constants. The quadratic x? + 4 is under a 
linear term. A repeated (x? + 4)? would be under a new Fx + G. 

3. Find the numbers A, 8, C, ... by any means, including cover-up. 

4. Integrate each term separately and add. 


We could prove that this method always works. It makes better sense to show that 
it works once, in Example 6. 


To find E, cover up (x—1) on the left and substitute x=1. Then E=3. 


To find B, cover up x? on the left and set x =0. Then B=4/(0+ 4)(0— 1)= —-1. 


The cover-up method has done its job, and there are several ways to find A, C, D. 
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Compare the numerators, after multiplying through by the common denominator Q: 
2x3 + 9x72 +4 = Ax(x? + 4)(x— 1)— (x? + 4)(x — 1) + (Cx + Di(x?x — 1) + 3x?(x? + 4). 
The known terms on the right, from B= — 1 and E =3, can move to the left: 
—3x4+ 3x3 — 4x2 + dx = Ax{x? + 4)(x — 1) + (Cx + D)x?(x — 1). 
We can divide through by x and x — 1, which checks that B and E were correct: 
—3x?-4= A(x? + 4) + (Cx + D)x. 
Finally x=0 yields A= —1. This leaves —2x? =(Cx+ D)x. Then C= —2 and 


D=0. 


You should mever have to do such a problem! I never intend to do another one. 
It completely depends on expecting the right form and matching the numerators. 
They could also be matched by comparing coefficients of x*, x°, x7, x, 1—to give five 
equations for A, B,C, D, E. That is an invitation to human error. Cover-up is the 
way to start, and usually the way to finish. With repeated factors and quadratic 


factors, match numerators at the end. 


7.4 EXERCISES 


Read-threugh questions 


The idea of __a_ fractions is to express P({x)/Q(x)asa__b 
of simpler terms, each one easy to integrate, To begin, the 
degree of P should be __¢_ the degree of @. Then @ is split 


into__d__ factors like x — 5 (possibly repeated) and quadratic 


factors like x?+x+1 (possibly repeated). The quadratic 
factors have two __@ _ roots, and do not allow real linear 
factors. 


A factor like x—‘5 contributes a fraction A/_f_. Its 
integral is _¢@ . To compute A, cover up _h _ in the 
denominator of P/Q. Then set x=__1_, and the rest of 
P/Q becomes A. An equivalent method puts all fractions over 
a common denominator (which is __| _}. Then match the 

k_. At the same point x=___| ___ this matching gives A. 


A repeated linear factor (x —5)? contributes not only 
Af(x — 5) but also B/_ m__. A quadratic factor like x? +x + 1 
contributes a fraction on f(x? +x + 1) involving C and D. 
A repeated quadratic factor or a triple linear factor would 
bring in (Ex + F)/(x? +x +4 1)? or Gi{x — 5}>. The conclusion 
is that any P/Q can be split into partial 0 , which can 
always be integrated. 


1! Find the numbers A and 8 to split 1/{x? — x}: 


] _A B 
x(x-1) 0 x xl 


Cover up x and set x =0 to find A. Cover up x—1 and set 
x =! to find B. Then integrate. 
2 Find the numbers A and B to split 1/(x? — 1): 

1 A B 


xe—-210 x-1 x4 


Multiply by x —1 and set x=1. Multiply by x +1 and set 
x = —1. Integrate. Then find A and B again by method 1— 
with numerator A(x + 1) + B(x — 1) equal to 1. 


Express the rational functions 3-16 as partial fractions: 


1 x 
3 DR—D *e-G-2) 
x? +] 6 I 
{x)(x + 1} (x + 2) xe—x 
4 3x4] 3x + 1 
x? (x-1P 
3x? 1 
9 Fai (divide first) 10 = Dix? + 1) 
] x 
at x*(x — 1) Ae x*—4 


1 
” x(x — 1)(x — 2)(x — 3) 


1 ] 
Cal) xt DK-NOe +) 


24] 
14 ~*~ (divide first) 
x+l 


15 


x-1) 


! A 
16 =—— (emembe the — tem] 
x*( x 


17 Apply Method 1 (matching numerators) to Example 3: 
1 AB A(c—by) 4+ By 
cy—by y em by y(e— by) 
Match the numerators on the far left and far right. Why does 
Ac = 1? Why does —bA +B =0? What are A and B? 
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18 What goes wrong if we look for A and B so that 


x? A B 
= +—? 
(x—3)(x+3) x-3 x+3 


Over a common denominator, try to match the numerators. 
What to do first? 


10 Sek eo Oe A eC 
Pe et & et txth xl xed 


(a} Cover up x — 1 and set x= 1 to find A. 
(b} Subtract A/(x — 1) from the left side. Find Bx + C. 
{c) Integrate all terms. Why do we already know 


In(x? — 1) =In(x — 1) + In(x? +x +L)? 


20 Solve dy/dt = 1 — y? by separating [dy/1—y? =[dt. Then 


—= = + 
1-y? (1—y(it+y) I-y ft+y 


=t+C, With yo =0 the con- 
. Taking exponentials gives 
. This is the S-curve, 


Integration gives 4In 
stant is C= 
The solution is y= 
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By substitution change 21--28 to integrals of rational functions. 
Problem 23 integrates 1/sin @ with no special trick. 


21 { cox 2 [inva 
e* — e 1+ /x 
sin 6 d0 at 

ape a [oy ey 
l+e 3/x—8 

dx dx 
. 23 |" 
In x+1 [ate 
29 Multiply this partial fraction by x —a. Then let x > a: 
ne. om 
Ox) x—-a 


Show that A = 1/Q'(a). When x =a is a double root this fails 
because {a} = 


+--+, Use Problem 29. 


, i I 
Kil) Find Ain SF = 


31 {for instructors only) Which rational functions P/@ are the 
derivatives of other rational functions (no logarithms)? 


“Improper” means that some part of i y(x)dx becomes infinite. It might be b or a or 
the function y. The region under the graph reaches infinitely far—to the right or left 


or up or down. (Those come from b= oo and a= 


— oo and p> oo and y> — @,) 


Nevertheless the integral may “converge.”” Just because the region is infinite, it is not 
automatic that the erea is infinite. That is the point of this section—to decide when 


improper integrals have proper answers. 


The first examples show finite area when b = oo, then a= — oo, then y= if/x at 


x= 0, The areas in Figure 7.6 are 1, 1, 2: 


-1 


Fig. 7.6 The shaded areas are finite but the regions go to infinity. 
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In practice we substitute the dangerous limits and watch what happens. When the 
integral is —1/x, substituting b= oo gives ‘“‘—1/oo =0.” When the integral is e*, 
substituting a= — 00 gives “e ~*~ =0.” I think that is fair, and I know it is successful. 
But it is not completely precise. 

The strict rules involve a limit. Calculus sneaks up on 1/co and e ® just as it 
sneaks up on 0/0. Instead of swallowing an infinite region all at once, the formal 
definitions push out to the limit: 


a b b b 
DEFINITION | y(x)dx = jim | y(x)dx | y(x)dx = lim | yW(x)dx. 


The conclusion is the same. The first examples converged to 1, 1, 2. Now come two 
more examples going out to b= oo: 


‘ 7g oo 
The area under 1/x is infinite: | = =In x |} = 0 (1) 
1 
' ' -d a dl bs 1 
The area under 1/x? is finite if p> 1: cas =—., (2) 
cx imp 2-1! 


The area under 1/x is like 1+4+4+4+-::, which is also infinite. In fact the sum 
approximates the integral—the curved area is close to the rectangular area. They go 
together (slowly to infinity). 

A larger p brings the graph more quickly to zero. Figure 7.7a shows a finite area 
1/(p — 1) = 100. The region is still infinite, but we can cover it with strips cut out of 
a square! The borderline for finite area is p= 1. I call it the borderline, but p= 1 is 
strictly on the side of divergence. 


The borderline is also p = 1 when the function climbs the y axis. At x = 0, the graph 
of y = 1/x? goes to infinity. For p = 1, the area under 1/x is again infinite. But at x = 
0 it is a small p (meaning p < 1) that produces finite area: 


1 " 1 1—py}i 
nn one 
o x 1—p 


o XP 1—p fo 


Loosely speaking ““—In 0 = oo.” Strictly speaking we integrate from the point x =a 
near zero, to get i%s dx/x = —Ina. As a approaches zero, the area shows itself as 
infinite. For y = 1/x?, which blows up faster, the area —1/x]5 is again infinite. 

For y= 1/,/x, the area from 0 to 1 is 2. In that case p = 3. For p = 99/100 the area 
is 1/(1 — p)= 100. Approaching p= 1 the borderline in Figure 7.7 seems clear. But 
that cutoff is not as sharp as it looks. 


Fig. 7.7. Graphs of 1/x? on both sides of p= 1. I drew the same curves! 
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Narrower borderline Under the graph of 1/x, the area is infinite. When we divide 
by In x or (In x)’, the borderline is somewhere in between. One has infinite area (going 
out to x = 00), the other area is finite: 


m dy . ae ee OY cae 
i rae 7 intin xP = 0 i aa oe ‘l 1. (4) 


The first is [ du/u with w= In x. The logarithm of In x does eventually make it to 
infinity. At x = 10°°, the logarithm is near 23 and In(In x) is near 3. That is slow! 
Even slower is In(In(tn x)) in Problem 11. No function is exactiy on the borderline. 

The second integral in equation (4) is convergent (to 1). It is J du/u? with u=In x. 
At first I wrote it with x going from zero to infinity. That gave an answer I couldn’t 
believe: 


© dy se Ae ane 
i x(n x =n I ve) 


There must be a mistake, because we are integrating a positive function. The area 

can’t be zero. It is true that 1/ln b goes to zero as b> oo. It is also true that I/Ina 

goes to zero as a 0. But there is another infinity in this integral. The trouble is at 
= 1, where In x is zero and the area is infinite. 


EXAMPLE 1 The factor e-* overrides any power x? (but only as x + 00). 
fg x8°e-*dx = 50! but [fp x te “dx = 0. 


The first integral is (50)(49)(48)---(1). It comes from fifty integrations by parts (not 
recommended). Changing 50 to 4, the integral defines “3 factorial.” The product 
4(- 4)(- })-- has no way to stop, but somehow 4! is 4,/n. See Problem 28. 
The integral [> x°e~ *dx = 1 is the reason behind “‘zero factorial” = 1. That seems the 
most surprising of all. 


The area under e */x is (—1)! = 00. The factor e~ * is absolutely no help at x = 0. 
That is an example (the first of many) in which we do not know an antiderivative— 
but still we get a decision. To integrate e~ */x we need a computer. But to decide that 
an improper integral is infinite (in this case) or finite (in other cases), we rely on the 
following comparison test: 


7C (Comparison test) Suppose that 0 < u(x) < v(x). Then the area under u(x) 
is smaller than the area under x(x): 


Jufxldx <0 if fulx\dx<oo if fufx)dx = & then fu(x)dx = w. 


Comparison can decide if the area is finite. We don’t get the exact area, but we learn 
about one function from the other. The trick is to construct a simple function {like 
1/x”) which is on one side of the given functton—and stays close to it: 


ene | ; . 7d 
EXAMPLE 2 { ak converges by comparison with { = = 1. 
dx 7 dx 
EXAMPLE 3 —=—— diverges by comparison with = 
| 1 /xt me re | 1 2/x 
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EXAMPLE 4 diverges by comparison with o, 00 
9 x7 + 4x td P o 5X ; 
1 1 

EXAMPLE 5 | oi converges by comparison with | = = 1. 
0 /x Sa | a} 1 


In Examples 2 and 5, the integral on the right is larger than the integral on the left. 
Removing 4x and /x increased the area. Therefore the integrals on the left are 
somewhere between 0 and 1. 

In Examples 3 and 4, we increased the denominators. The integrals on the right 
are smaller, but still they diverge. So the integrals on the left diverge. The idea of 
comparing functions is seen in the next examples and Figure 7.8. 


«o 1 oO 
EXAMPLE 6 | e*axis below | 1 ax+ | e-*dx=1+1. 


0 0 1 


EXAMPLE 7 ka oo a aoe 
, Inx ,xiInx 


1 1 dx 


t dx dx | 
EXAMPLE 8 ——— is below —+ =2+2. 
j, ./x— x? ee Oo <fL—¥ 


yo we + 


— 


Fig. 7.8 Comparing u(x) to v(x): ff dx/Inx =o and {) dx/,/x—x? <4. But 0 — #0. 


There are two situations not yet mentioned, and both are quite common. The first is 
an integral all the way from a= — «0 to b= + o. That is split into two parts, and 
each part must converge. By definition, the limits at — co and + oo are kept separate: 


oo 0 oo 0 b 
| yx) dx = | y(x) dx + | y(x) dx = lim | y(x) dx + jim | y(x) dx. 
—o -@ 0 ae. Ja *O JO 
The bell-shaped curve y = e~*’ covers a finite area (exactly ./n). The region extends 
to infinity in both directions, and the separate areas are 4./n. But notice: 
\”_, x dx is not defined even though |’, x dx =0 for every b. 


The area under y = x is +00 on one side of zero. The area is — oo on the other side. 
We cannot accept ~ — «© =0. The two areas must be separately finite, and in this 
case they are not. 
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EXAMPLE? 1/x has balancing regions left and nght of x = 0. Compute ae dx}x. 


This integral does not exist. There is no answer, even for the region in Figure 7.8c. 
(They are mirror images because 1/x is an odd function.) You may feel that the 
combined integral from —1 to | should be zero. Cauchy agreed with that—his 
“principal value integral” is zero. But the rules say no: ac — «¢ is not zero. 


7.5 EXERCISES 


Read-through questions 


An improper integral f° y(x) dx has lower limit a=__a__ or 
upper limit b=__b_ or y becomes __¢ __ in the interval 
a<x<h. The example i dx/x? is improper because __d 
We should study the limit of fj dx/x? as ___e@ _. In practice 
we work directly with —jx7*]P=__f . For p>1 the 
improper iniegral __g is finite. For p<1 the improper 
integra! __h___ is finite. For y =e * the integral from 0 to oc 
is__i 


Suppose 0 < u(x) § {x} for all x. The convergence of _i 
implies the convergence of k The divergence of 
fu(x}dx __|__ the divergence of | o(x) dx, From — x to x, 
the integral of I/(e* +e °*} converges by comparison with 

m__. Strictly speaking we split (— oc, oc}into{__n__, O)and 
(0,__o _). Changing to I /(e* — e-*} gives divergence, because 

p_. Also {” _ dx/sin x diverges by comparison with __4 


The regions left and nght of zero don’t cancel because «0 — x 
is__t 


Decide convergence or divergence in 1-16. Compute the integ- 
rals that converge. 


~ dx 1a 

ie fe 
1 * o x 

P is as 4 ik dx 
o l-x 9 [-x 

P 0 ax 6 ! dx 
nxt +i -1f/1—x? 
1 wm 

| Wy 5 | sin x dx 
.U x me 
e x= 

9 | in x dx (by parts} 10 | xe *dx (by parts) 
0 o 

Wl 4 dx 2 bs x dx 
100 x(In x){ln In x) ait (x? _ 1)? 
x ni2 

13 { cos*x dx 14 | tan x dx 
e 0 

is| & pee 

“Ty x? o (e—1P 


In 17-26, find a larger integral that converges or a smaller 
integral that diverges. 


* dx tdx 
18 —_—_ 
a | xf + 1 \, x? + 1 
x dx eo *dx 
19 | Y;— 20 
iF x- +1 \ l-x 
a x 
21 ( e*sin x dx 22 | x “dx 
1 l 
ee 1 r 
23 | ered 24 | vf —In x dx 
0 0 


w gi n2 ~ FL 1 
25 | Te ae 2% | (: - ) dx 
o x 1 \x l+x 


27 If p> 0, integrate by parts to show that 
fg xPe “dx =ply x? fe “dx. 


The first integral is the definition of p! So the equation ts p! = 

. In particular Q! = . Another notation for 
pl is T(p+ i} using the gamma function emphasizes that p 
need not be an integer. 


28 Compute (— 4)! by substituting x =u?: 


Be ge WE aD = ./rtk , 
(5 87 ede = \/m (known), 


Then apply Problem 27 to find (4)! 
29 Integrate [fF x7e7*"dx by parts. 


30 The beta function Bom n) = a x@ xy de is finite 
when m and # are greater than 


31 A perpetual annuity pays s dollars a year forever, With 
continuous interest rate ¢, its present value is yo = |g se 7 de. 
To receive $1000/year at ¢ = 10%, you deposit yg = 


32 In a perpetual annuity that pays once « year, the present 

value is yo=Siatsfar?tor= . To receive 

$1000/year at 10% (now a= 1.1) you again deposit yo = 
. Infintte sums are like improper integrals. 


33 The work to move a satellite (mass m) infinitely far from 
the Earth (radius R, mass M) is W= J, GMm dx/x*. Evaluate 
W, What escape retocity at liftoff gives an energy ime? that 
equals W? 
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34 The escape velocity for a black hole exceeds the speed of 
light: vg > 3° 10% m/sec. The Earth has GM = 4-t0'*m?/sec?. 
Hf it were compressed to radius R = , the Earth 
would be a black hole. 


35 Show how the area under y = /2* can be covered (draw 
a graph) by rectangles of area 1 +4+4+--- =2, What is the 
exact area from x =0 to x = 00? 


36 Explain this paradox: 


b © 
{ as = 0 for every 6 but { ae diverges. 
-b -a 


37 Compute the area between y=secx and y=tanx for 
0<x<zx/2. What is improper? 
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*38 Compute any of these integrals found by geniuses: 


-1f2 Pt Sar tf 
[7 cae { es dein 3 
{+x x 


0 
{ xe~* cos x dx =0 { cos x2dx = ./n/8. 
0 0 


ig Fic wich a |? 9 
Or Wil E —— = ©? 
chpit | eax 


40 Explain from Figure 7.6c why the red area is 2, when 
Figure 7.6a has red area 1. 
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Ee 868.4 Areas and Volumes by Slices 


CHAPTER 8 


Applications of the Integral 


We are experts in one application of the integral—to find the area under a curve. 
The curve is the graph of y= v(x), extending from x =a at the left to x= b at the 
right. The area between the curve and the x axis is the definite integral. 

I think of that integral in the following way. The region is made up of thin strips. 
Their width is dx and their height is v(x). The area of a strip is u(x) times dx. The 
area of all the strips is ‘4 v(x} dx. Stnetly speaking, the area of one strip is 
meaningless—genuine rectangles have width Ax. My point is that the picture of thin 
strips gives the correct approach. 

We know what function to integrate (from the picture). We also know how (from 
this course or a calculator). The new applications to volume and length and surface 
area cut up the region in new ways. Again the small pieces teli the story. In this 
chapter, what to integrate is more important than how. 


This section starts with areas between curves. Then it moves to volumes, where the 
strips become slices. We arc weighing a loaf of bread by adding the weights of the 
slices. The discussion is dominated by examples and figures—the theory is minimal. 
The real problem is to set up the right integral. At the end we look at a diflerent way 
of cutting up volumes, into thin shells. All formulas are collected into a final table. 

Figure 8.1 shows the area between two curves. The upper curve is the graph of 
y = u(x). The lower curve is the graph of y = w(x). The strip height is p(x) — w(x), from 
one curve down to the other. The width is dx (speaking informally again). The total 
area is the integral of “top minus bottom”: 


Be ns 
area between two curves = | [ v(x) - wx) | dx. (1) 


EXAMPLE 4 The upper curve is y = 6x (straight line). The lower curve is » = 3x? 
(parabola). The area lies between the points where those curves intersect. 


To find the intersection points, solve u(x) = w(x) or 6x = 3x?. 
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circle 


re oe 


¢ 
7 


/ . 
7 line wa4 


bulge 


Fig. 8.4 Area between curves = integral of v—w. Area in Example 2 starts with x > 0. 


Once crossing is at x = 0, the other is at x = 2. The area is an integral from 0 to 2: 


area = f° (v — w) dx = JG (6x — 3x?) dx = 3x? - x? | = 4. 


EXAMPLE 2 Find the area between the circle y = ./1 — x? and the 45° line w= x. 


First question: Which area and what limits? Start with the pie-shaped wedge in 
Figure 8.1b, The arca begins at the y axis and ends where the circle meets the line. 
At the intersection point we have v(x) = w(x): 

from /1- x? = x squaring gives 1 — x? = x? and then 2x? = 1. 


Thus x* = 4. The endpoint is at x= Wi/2 Now integrate the strip height v — w: 


The area is 7/8 (one cighth of the circle}. To integrate \/ 1 — x? dx we apply the 
techniques of Chapter 7: Set x=sin@/, convert to f cos? 0d) =4(0 + sin @ cos 4). 
convert back using @ = sin” ' x. It is harder than expected, for a familiar shape. 


Remark Suppose the problem is to find the whole area between the circle and the 
line. The figure shows v= w at two points, which are x = 1/2 {already used) and 


also x = - 1),/2. Instead of starting at x = 0, which gave 3 of a circle, we now include 
the area to the left. 7 - 
Main point: Integrating from x= —1/,/2 to x= 1/\/2 will give the wrong answer. 


It misses the part of the circle that bulges out over itself, at the far left. In that part. 
the strips have height 2z instead of : — w. The figure is essential, to get the correct 
area of this half-circle. 


HORIZONTAL STRIPS INSTEAD OF VERTICAL STRIPS 


There is morc than one way to shice a region. Vertical slices give x integrals. Horizontal 
slices give y integrals. We have a free choice, and sometimes the y integral is better. 


3.4 Areas and Volumes by Slices 


yelnx 


Fig. 8.2 Vertical slices (x integrals) vs. horizontal slices (y integrals). 


Figure 8.2 shows a unit parallelogram, with base 1 and height 1. To find its area from 
vertical slices, three separate integrals are necessary. You should see why! With hori- 
zontal slices of length 1 and thickness dy, the area is just So dy= 1. 


EXAMPLE3 [Find the area under y= In x (or beyond x = &’) out to x=e. 
The x integral from vertical slices is in Figure 8.2c. The y integral is in 8.2d. The area 
is a choice between two equal integrals (I personally would choose y): 

ff-, nx dx=[xInx-x]|'=1 or fig @-@) dy=[ey-e |= 1. 


, 


VOLUMES BY SLICES 


For the first time in this book, we now look at volumes. The regions are three- 
dimensional solids. There are three coordinates x, y,z—-and many ways to cut up a 
solid. 

Figure 8.3 shows one basic way—using slices. The slices have thickness dx, like 
strips in the plane. Instead of the height y of a strip, we now have the area A of a 
cross-section. This area is different for different slices: A depends on x. The volume 
of the slice is its area times its thickness: dV = A(x) dx. The volume of the whole solid 
is the integral: 


volume = integral of area times thickness = j A(x} dx. (2) 


Note An actual slice does not have the same area on both sides! Its thickness is Ax 
(not dx). Its volume is approximately A(x) Ax (but not exactly). In the limit, the 
thickness approaches zero and the sum of volumes approaches the integral. 

For a cylinder all slices are the same. Figure 8.3b shows a cylinder—not circular. 
The area is a fixed number A, so integration is trivial. The volume is A times h. The 


d¥ = A(x) dx 


Fig. 8.3 Cross-sections have area A(x). Volumes are § A(x) dx. 
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letter k, which stands for height, reminds us that the cylinder often stands on its end. 
Then the slices are horizontal and the y integral or z integral goes from 0 to A. 
When the cross-section is a circle, the cylinder has volume xr*h. 


EXAMPLE 4 The triangular wedge in Figure 8.3b has constant cross-sections with 
area A = $(3)(4) = 6. The volume is 6h. 


EXAMPLE 5 For the triangular pyramid in Figure 8.3c, the area A(x} drops from 6 
to 0. It is a general rule for pyramids or cones that their volume has an extra factor 
+ (compared to cylinders). The volume is now 2/ instead of 6h. For a cone with base 
area xr’, the volume is $2r7h. Tapering the area to zero leaves only 4 of the volume. 

Why the 4? Triangles sliced from the pyramid have shorter sides. Starting from 3 
and 4, the side lengths 3(1 — x/h) and 4{1 — x/#) drop to zero at x=h. The area is 
A= 6(1 — x/h)’. Notice: The sidc lengths go down linearly, the area drops quadrati- 
cally. The factor 4 really comes from integrating x? to get 3.x*: 


h ht x 2 x 37h 
| diy de= | 6(1- #) ix=-21(1-¥) | = 2h. 
0 rf) h h 0 


EXAMPLE 6 A half-sphere of radius R has known volume ${$7R’*). Its cross-sections 
are semicircles. The key relation is x? + r? = R*, for the right triangle in Figure 8.4a. 
The area of the semicircle is A = }ar? =42(R? — x’). So we integrate A(x): 


volume = {* , A(x) dx = 42{R?x - 4x3) |f, =4aR°. 


EXAMPLE? Find the volume of the same half-sphere using horizontal slices 
(Figure 8.4b). The sphere stili has radius R. The new right triangle gives y* + r? = R?. 
Since we have full circles the area is mr? = a(R? — 57). Notice that this is A(y)} not 
A(x). But the y integral starts at zero: 


volume = [f A(y) dy = x{R?y — iy) = 3nR° (as before). 


Fig.8.4 A half-sphere sliced vertically or horizontally. Washer area xf? — ag’. 


SOLIDS OF REVOLUTION 


Cones and spheres and circular cylinders are “‘solids of revolution.” Rotating a hori- 
zontal line around the x axis gives a cylinder. Rotating a sloping line gives a cone. 
Rotating a semicircle gives a sphere. Ifa eircle is moved away from the axis, rotation 
produces a torus (a doughnut). The rotation of any curve y=/(x) produces a solid 
of revolution, 


8.4 Areas and Volumes by Slices 


The volume of that solid is made easier because every cross-section is a circle. All 
slices are pancakes (or pizzas). Rotating the curve y=f(x) around the x axis gives 
disks of radius y, so the area is A= xy? = x[f(x)]*. We add the slices: 


b b 
volume of solid of revolution = | ny? dx = | n| f(x) | dx. 


EXAMPLE 8 Rotating y= afk with A = n(,/x)? produces a “headlight” (Figure 8.5a): 
volume of headlight = [5 A dx = [2 mx dx =4nx? Eb = 2n. 


If the same curve is rotated around the y axis, it makes a champagne glass. The slices 
are horizontal. The area of a slice is tx? not zy?. When y= /x this area is my*. 
Integrating from y= 0 to wf gives the champagne volume n(,/2)5/5. 


revolution around the y axis: volume = | mx? dy. 


EXAMPLE9 The headlight has a hole down the center (Figure 8.5b). Volume = ? 


The hole has radius 1. All of the /x solid is removed, up to the point where Jx 
reaches 1. After that, from x = 1 to x =2, each cross-section is a disk with a hole. 
The disk has radius f= a/x and the hole has radius g = 1. The slice is a flat ring or 
a ‘“‘washer.” Its area is the full disk minus the area of the hole: 


area of washer = nf* — ng? = n(./x)? — n(1)? = mx — 17. 
This is the area A(x) in the method of washers. Its integral is the volume: 


fi A dx=fj (xx — nm) dx =| $nx? = nx |! =4n. 


Please notice: The washer area is not n(f—g). It is A= nf? — ng’. 


b+ Vg? — x2 


inner radius 
of washer 


Fig.8.5 y= /x revolved; y = | revolved inside it; circle revolved to give torus. 


EXAMPLE 10 (Doughnut sliced into washers) Rotate a circle of radius a around the 
x axis. The center of the circle stays out at a distance b > a. Show that the volume 
of the doughnut (or torus) is 22?.ab. 
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8 Applications of the Integral 


The outside half of the circle rotates to give the outside of the doughnut. The inside 
half gives the hole. The biggest slice (through the center plane) has outer radius b+ a 
and inner radius 4 — a. 

Shifting over by x, the outer radius is f=b + ./a?— x? and the inner radius is 
g=h-—./a’— x’. Figure 8.5¢ shows a slice (a washer) with area nf* — xy’. 


area A=x(ht+ fa? — x*)? —a(h— \/a? — x?) = 4nd v/a? — x?. 
Now integrate over the washers to find the volume of the doughnut: 
P. , A(x) dx = 4b f° | ¥/a? — x? dx = (4nb)(5 207) = 207 7b. 


That integral $na* is the area of a semicircle. When we set x = asin @ the area is 
[ a? cos? @ df. Not for the last time do we meet cos? @. 

The hardest part is visualizing the washers, because a doughnut usually breaks the 
other way. A better description is a Aagef, sliced the long way to be buttered. 


VOLUMES BY CYLINDRICAL SHELLS 


Finally we look at a diflerent way of cutting up a solid of revolution. So far it was 
cut into slices. The slices were perpendicular to the axis of revolution. Now the cuts 
arc parallel to the axis, and each ptece is a thin cylindrical shell. The new formula 
gives the same volume, but the integral to be computed might be easier. 

Figure 8.6a shows a solid cone. A shell is inside it. The inner radius is x and the 
outer radius is x + dx. The shell is an outer cylinder minus an inner cylinder: 


shell volume a(x + dx)?h — 2x7h = xx7?h+2extdxjh + a(dxPh—ax?h. (3) 


The term that matters is 2xx(dx)h. The shell valume is essentially 27x (the distance 
around} times dx (the thickness) times h (the height). The volume of the solid comes 
from putting together the thin shells: 


solid volume = integral of shell volumes = | 2axh dx. (4) 


This is the central formula of the shell method. The rest is examples. 


Remark on this volume formula It is completely typical of integration that (dx)? and 
(Ax)? disappear. The reason is this. The number of shells grows like 1/Ax. Terms of 
order (Ax)? add up to a volume of order Ax {approaching zero). The linear term 
involving Ax or dx is the one to get right. Its limit gives the integral f 2nxh dx. The 
key is to build the solid out of shells—and to find the area or volume of each piece. 


EXAMPLE 44 Find the volume of a cone (base area zr’, height 5) cut into shells. 


A tall sheil at the center has A near b. A short shell at the outside has # near zero. In 
between the shel! height A decreases linearly, reaching zero at x =r. The height in 
Figure §.6a is h= b — bx/r. Integrating over all shells gives the volume of the cone 
(with the expected 4): 


is 3 Ir 
| amx(b— n=) ax=[ nxt AEP SNF 
0 r 3r 0 3 


8.4 Areas and Volumes by Slices 


hole radius a 
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Fig. 8.6 Shells of volume 27xh dx inside cone, sphere with hole, and paraboloid. 


EXAMPLE 12 Bore a hole of radius a through a sphere of radius b> a. 


The hole removes all points out to x = a, where the shells begin. The height of the 
shell is h = 2, /b? — x?. (The key is the right triangle in Figure 8.6b. The height upward 
is ./b? — x?—this is half the height of the shell.) Therefore the sphere-with-hole has 


volume = f° 2nxh dx = J? 4nx./b? — x? dx. 
With u = b? — x? we almost see du. Multiplying du = — 2x dx is an extra factor —2n: 
volume = —2n f \/u du = —2n(3u?”). 
We can find limits on u, or we can put back u = b? — x?: 
4 p 4 
volume = — - (b? — aad = = (b? — a?)?/. 
If a= b (the hole is as big as the sphere) this volume is zero. If a=0 (no hole) we 
have 4zb?/3 for the complete sphere. 
Question What if the sphere-with-hole is cut into slices instead of shells? 
Answer Horizontal slices are washers (Problem 66). Vertical slices are not good. 
EXAMPLE 13 Rotate the parabola y = x” around the y axis to form a bowl. 


We go out to em uho (and up to y= 2). The shells in Figure 8.6c have height 
h=2-— x*. The bowl (or paraboloid) is the same as the headlight in Example 8, but 
we have shells not slices: 


v2 4 v2 
| 2nx(2 — x”) dx = 22x? — aes | =2n. 


area between curves: A= i} (v(x) — w(x)) dx 
solid volume cut into slices: V={ A(x) dx or f A(y) dy 
AND solid of revolution: cross-section A=ny* or mx? 

Bas solid with hole: washer area A = nf” — ng? 
solid of revolution cut into shells: V={ 2xxh dx. 
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3 Applications of the integrai 


Which to use, slices or shells? Start with a vertical line going up to y = cos x. Rotating 
the line around the x axis produces a slice (a circular disk). The radius is cos x. 
Rotating the line around the y axis produces a shell (the outside of a cylinder). The 
height is cos x. See Figure 8.7 for the slice and the shell. For volumes we just integrate 
n cos’ x dx (the slice volume) or 2xx cos x dx (the shell volume). 

This is the normal choice—slices through the x axis and shells around the y axis. 
Then y=/(x) gives the disk radius and the shell height. The slice is a washer instead 
of a disk if there is also an inner radius g(x). No problem—just integrate small 
volumes. 

What if you use slices for rotation around the y axis? The disks are in Figure 8.7b, 
and their radius is x. This is x= cos~' y in the example. It is x=f~ '(y) in general. 
You have to solve y=/(x) to find x in terms of y. Similarly for shells around the x 
axis: The length of the shell is x =f —1(y). Integrating may be difficult or impossible. 

When y=cos x is rotated around the x axis, here are the choices for volume: 


(bad by shells) | 2xy cos~* y dy. 


(good by slices) | n cos? x dx 


ae x=f7l(y) = cosy 


Aig. 8.7 Slices through x axis and shells around y axis (good). The opposite way needs f ~ '(y). 


8.4 EXERCISES 


Read-through questions 


The area between y=x° and y= x* equals the integral of 

a_. If the region ends where the curves intersect, we find 
the limits on x by solving 6 . Then the areaequals_ ¢ 
When the area between y = J= and the y axis is sliced hori- 
zontally, the integral to compute is __d 


In thzee dimensions the volume of a slice is its thickness dx 
timesits_ @__. If the cross-sections are squares of side 1 — x, 
the volume comes from f_f . From x=0 to x=1, this 
gives the volume _g of a square __. If the cross-sec- 


tions are circles of radius 1—x, the volume comes from 
f__i__. This gives the volume J of a circular __* 


For a solid of revolution, the cross-sections are __! 
Rotating the graph of y = f(x) around the x axis gives a solid 
volume jf _m__. Rotating around the y axis leads to [ __1 
Rotating the area between py = f(x) and y = g(x} around the x 
axis, the slices look like ©. Their areas are_ Pp ___ so the 
volume isf_a_. 


Another method is to cut the solid into thin cylindrical 
s__. Revolving the area under y = f(x) around the y axis, 
ashell has height _» _ and thickness dx and yolume _ ¢ 


The total volume is [ __v 


Find where the curves in [-12 intersect, draw rough graphs, 
and compute the area between them. 


1 y=x?—3and y=1 2 y=x’—2and y=0 
3 y?=xandx=9 4 y?=xandx=y+2 
§ y=x*—2x? and y=2x? 6 x=y5 and y=x* 

T y=x? and y=—x? 4 18x 

8 y=1/x and y=1/x? and x =3 
9 y=cos x and y=cos*x 

40 y=sin xx and y=2x and x=0 
ll y=e and y=e**"! and x=0 
12 y=e and y=e and y=e * 


13 Find the area tnside the three lines y = 4 — x, y = 3x, and 
y=x. 


14 Find the arca bounded by y = 12 —x, y=./x, and y= 1. 


15 Does the parabola y=1~—x? out to x =1 sit inside or 
outside the unit circle x? + y? = 1? Find the area of the “skin” 
between them. 


8.4 Areas and Volumes by Slices 


16 Find the area of the largest triangle with base on the x 
axis that fits (a) inside the unit circle (b) inside that parabola. 


17 Rotate the ellipse x?/a? + y?/b? = 1 around the x axis to 
find the volume of a football. What is the volume around the 
y axis? If a=2 and b=1, locate a point (x, y, z) that is in one 
football but not the other. 


18 What is the volume of the loaf of bread which comes from 
rotating y=sin x (0<x<z) around the x axis? 


19 What is the volume of the flying saucer that comes from 
rotating y=sin x (0<x <7) around the y axis? 


20 What is the volume of the galaxy that comes from rotating 
y =sin x (0<x <7) around the x axis and then rotating the 
whole thing around the y axis? 


Draw the region bounded by the curves in 21-28. Find the 
volume when the region is rotated (a) around the x axis (b) 
around the y axis. 


21 x+y=8,x=0, y=0 

22 y-—e&=1,x=1, y=0, x=0 

23 y=x*, y=1,x=0 

24 y=sin x, y=cosx,x=0 

25 xy=1,x=2, y=3 

26 x?—y?=9, x + y=9 (rotate the region where y > 0) 
27 x2 =y3, xP =y? 


28 (x —2)?+(y—1)?=1 


In 29-34 find the volume and draw a typical slice. 


29 A cap of height h is cut off the top of a sphere of radius 
R. Slice the sphere horizontally starting at y= R —h. 


30 A pyramid P has height 6 and square base of side 2. Its 
volume is 4(6)(2)? = 8. 
(a) Find the volume up to height 3 by horizontal slices. 
What is the length of a side at height y? 
(b) Recompute by removing a smaller pyramid from P. 


31 The base is a disk of radius a. Slices perpendicular to the 
base are squares. 


32 The base is the region under the parabola y=1—x?. 
Slices perpendicular to the x axis are squares. 


33 The base is the region under the parabola y=1-— x’. 
Slices perpendicular to the y axis are squares. 


34 The base is the triangle with corners (0, 0), (1, 0), (0, 1). 
Slices perpendicular to the x axis are semicircles. 


35 Cavalieri’s principle for areas: If two regions have strips 
of equal length, then the regions have the same area. Draw a 
parallelogram and a curved region, both with the same strips 
as the unit square. Why are the areas equal? 
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36 Cavalieri’s principle for volumes: If two solids have slices 
of equal area, the solids have the same volume. Find the 
volume of the tilted cylinder in the figure. 


37 Draw another region with the same slice areas as the tilted 
cylinder. When all areas A(x) are the same, the volumes 
are the same. 


38 Find the volume common to two circular cylinders of 
radius a. One eighth of the region is shown (axes are perpen- 
dicular and horizontal slices are squares). 


39 A wedge is cut out of a cylindrical tree (see figure). One 
cut is along the ground to the x axis. The second cut is at 
angle @, also stopping at the x axis. 


(a) The curve C is part of a (circle) (ellipse) (parabola). 
(b) The height of point P in terms of x is 

(c) The area A(x) of the triangular slice is 

(d) The volume of the wedge is 


40 The same wedge is sliced perpendicular to the y axis. 
(a) The slices are now (triangles) (rectangles) (curved). 
(b) The slice area is (slice height y tan 6). 
(c) The volume of the wedge is the integral 


(d) Change the radius from 1 to r. The volume is 
multiplied by 


41 A cylinder of radius r and height h is half full of water. 
Tilt it so the water just covers the base. 
(a) Find the volume of water by common sense. 
(b) Slices perpendicular to the x axis are (rectangles) (trap- 
ezoids) (curved). I had to tilt an actual glass. 


*42 Find the area of a slice in Problem 41. (The tilt angle has 
tan 0 = 2h/r.) Integrate to find the volume of water. 
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The slices in 43—46 are washers. Find the slice area and volume. 


43 The rectangle with sides x = 1, x =3, p= 2, y = 5 is rotated 
around the x axis. 


44 The same rectangie is rotated around the y axis. 
45 The same rectangle is rotated around the line y = 1. 


46 Draw the triangle with corners (1, 0}, (t, 1), (0, 1). After 
rotation around the x axis, describe the solid and find its 
volume, 


47 Bore a hole of radius a down the axts of a cone and 
through the base of radius b. If it is a 45° cone (height also 
b), what volume is left? Check a=0 and a=8, 


48 Find the volume common to two spheres of radius r if 
their centers are 2(r —h) apart. Use Problem 29 on spherical 
caps. 


49 (Shells vs. disks) Rotate y= 3 ~ x around the x axis from 
x=Oto x=2, Write down the volume integral by disks and 
then by shells. 


50 (Shells vs. disks) Rotate y= x? around the y axis from 
y=0 to y=8, Write down the volume integral by shells and 
disks and compute both ways. 


51 Yogurt comes in a solid of revolution. Rotate the line 
y= mx around the y axis to find the volume between y=a 
and y= 6. 


52 Suppose y = f(x) decreases from f(0)=6 to f(1}=0, The 
curve is rotated around the y axis. Compare shells ta disks: 
Jo nxfl) dx = ff, w(f—"(y))? dy. 


Substitute y = {x} in the second. Also substitute dy = f'(x) dx. 
Integrate by parts to reach the first. 


53 Ifa roll of paper with inner radius 2 cm and outer radius 
10 cm has about 10 thicknesses per centimeter, approximately 
how Jong is the paper when unroiled? 


54 Find the approximate volume of your brain. OK to 
include everything above your eyes (skull too). 


Use shells to find the volumes in 55-63. The rotated regions 
lie between the curve and x axis. 


55 y=1—x?,0<x <1 (around the y axis) 


The graph of y= x*?? 


8.2 Length of a Plane Curve 


8 Appileations ct the integral 


S6 p= 1/x, 1 <x < 100 (around the y axis) 

57 y=\/1—x?,0<x <1 (around either axis) 
58 y=1/(1+ x7), 0<x <3 (around the y axis) 
59 y=sin(x?), 0<x <./n (around the y axis) 
60 y= /1—x?, 0<.x <1 (around the y axis) 
61 y=x?, 0<x <2 (around the x axis} 

62 y=e,0<x< 1 (around the x axis) 

63 y=Inx, 1<x<e (around the x axis) 


64 The region between y= x? and y=x is revolved around 
the y axis. (a) Find the volume by cutting into shells, (b) Find 
the volume by slicing into washers. 


65 The region between y=/{(x) and y=1+/{x) is rotated 

around the y axis. The shells have height . The vol- 

ume out to x =a is . It equals the volume of a 
because the shells are the same. 


6 A horizontal slice of the sphere-with-hole in Figure 8.6b 
is a washer. Its area is mx? — na? = n(b? ~ y’ — a’). 

(a) Find the upper limit on y (the top of the hole}. 

(b} Integrate the area to verify the volume in Example (2. 


67 Jf the hole in the sphere has length 2, show that the yolume 
is 4nx/3 regardless of the radii a and 6. 


“68 An upright cylinder of radius r is sliced by two parallel 
planes at angle x. One is a height 4 above the other. 


{a) Draw a picture 10 show that the yolume between the 
planes is xr?h. 


(b} Tilt the picture by x, so the base and top are flat. What 
is the shape of the base? What is its area A? What is the 
height H of the tilted cylinder? 


69 True or false, with a reason. 
(a) A cube can only be sliced into squares. 
(b} A cube cannot be cut into cylindrical sheils. 
(c) The washer with radii r and R has area n(R — 1)’, 


(d) The plane w =$ slices a 3-dimenstonal sphere out of 
a 4-dimensional sphere x? + y? + z7+w?=1. 


is a curve in the x-y plane. How long is that curve? A definite 


integral needs endpoints, and we specify x = 0 and x = 4. The first problem is to know 


what “‘length function” to integrate. 


The distance along a curve is the arc length. To set up an integral, we break the 
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problem into small pieces. Roughly speaking, senail pieces of a smooth curve are nearly 
straight. We know the exact length As of a straight piece, and Figure 8.8 shows how 
it comes close to a curved piece. 


8 

ecg 

(As)? = (Ax)’ + (Ay)* 

2 
ay Ay (ds)? = (dx)? + (2) (dx)? 

dx 

Ax 
ds = ./1+ (dy/dx)* dx 
x 
0 4 0 4 


Hg. 8.8 Length As of short straight segment. Length ds of very short curved segment. 


Here is the unofficial reasoning that gives the length of the curve. A straight piece 
has (As)? = (Ax)? + (Ay)?. Within that right triangle, the height Ay is the slope 
(Ay/Ax) times Ax. This secant slope is close to the slope of the curve. Thus Ay is 
approximately (dy/dx) Ax. 


As = ./ (Ax)? + (dy/dx)*(Ax)? = ./1 + (dp/dx)? Ax. (1) 
Now add these pieces and make them smaller. The infinitesimal triangle has (ds)? = 


(dx)? + (dy)*. Think of ds as ./1+ {dy/dx)? dx and integrate: 


length of curve = [as = | 1+ (dy/dx)? dx. (2) 


EXAMPLE 4 Keep y= x°*? and dy/dx = }x*/?. Watch out for 3 and 3: 
length = f¢ ./1 + $x dx = (3)($)(1 + $x)? [f = 8108? ~ 1°), (3) 


This answer is just above 9. A straight line from (0, 0) to (4, 8) has exact length 
./ 80. Note 4? + 8? = 80. Since ./ 80 is just below 9, the curve is surprisingly straight. 


You may not approve of those numbers {or the reasoning behind them). We can 
fix the reasoning, but nothing can be done about the numbers. This example y = x*/? 
had to be chosen carefully to make the integration possible at all. The length integral 
is difficult because of the square root. In most cases we integrate numerically. 


EXAMPLE 2 The straight line y= 2x from x = 0 to x = 4 has dy/dx =2: 
length = [5 /i+4dx= 4./5=./80 as before (just checking). 


We return briefly to the reasoning. The curve is the graph of y = f(x). Each piece 
contains at least one point where secant slope equals tangent slope: Ay/Ax =f"(c). 
The Mean Value Theorem applies when the slope is continuous—this is required 


for a smooth curve. The straight length As is exactly ./ (Ax)? + (f'{c) Axy’. Adding 
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8 Applications of the Integral 


the an pieces gives the length of the broken line (close to the curve): 


Nt fm As = >, 1+[f'(c)}° Ax;. 


As n- oo and Ax,,,, + 0 this approaches the integral that gives arc Jength. 


8A The length of the curve y=/(x) from x=a to x= bis 


s=fds=[? /I+E f(x} dx=f? /1 + (dyfdx} dx 


EXAMPLE 3 Find the length of the first quarter of the circle y= ./1 — x?. 
Here dy/dx = —x/,/1— x”. From Figure 8.9a, the integral goes from x = 0 to x = 1: 


1 ; 1 x? 1 ax 
length = 1 + (dy/dx)}* dx = f1+——5 d ={ ——. 
£ { ( yl ) { 1 = x? x {, /y — x2 


The antiderivative is sin~' x. It equals x/2 at x = 1. This length 7/2 is a quarter of 
the full circumference 27. 


EXAMPLE 4 Compute the distance around a quarter of the ellipse y? + 2x? = 2. 
The equation is y= \/2— 2x? and the slope is dy/dx = —2x/,/2— 2x. So J ds is 


: 4x? 1 J24+ 2x? 1 fl+x? 
+ ——— dx = <——— dx = —— dx. 
i Vi 2-7 esas |. Jive ©) 


That integral can’t be done in closed form. The length of an ellipse can only be 
computed numerically. The denominator is zero at x = 1, so a blind application of the 
trapezoidal rule or Simpson’s rule would give length = oo. The midpoint rule gives 
length = 1.91 with thousands of intervals. 


V2 y=Y2-2x? 
yevl-x? ds -Vt+'yPax 
ds=Vl+ (yar 


v=cost,y=¥2 sine 


Yds =+/ sin?s + 2 cos*s dt 


start at f= O~ dx Start at ?=O~__ 


Fig. 8.9 Circle and elltpse, directly by y = f(x) or parametrically by x{t) and y(r). 


LENGTH OF A CURVE FROM PARAMETRIC EQUATIONS: x(/} AND y(?) 


We have met the unit circle in two forms. One is x? + y? = 1. The other is x = cos ¢, 
y=sin t. Since cos? t + sin? t = 1, this point goes around the correct circle. One advan- 
tage of the “parameter” ¢ is to give extra information—it tells where the point is and 


8.2 Length of a Plane Curve 


also when. In Chapter 1, the parameter was the time and also the angle—because 
we moved around the circle with speed 1. 

Using t is a natural way to give the position of a particle or a spacecraft. We can 
recover the velocity if we know x and y at every time ¢. An equation y = f(x) tells the 
shape of the path, not the speed along it. 

Chapter 12 deals with parametric equations for curves. Here we concentrate on 
the path length—which allows you to see the idea of a parameter t without too much 
detail. We give x as a function of t and y as a function of t. The curve is still 
approximated by straight pieces, and each piece has (As)? = (Ax)? + (Ay)’. But instead 
of using Ay ~ (dy/dx) Ax, we approximate Ax and Ay separately: 


Ax = (dx/dt) At, Ayx(dy/dt)At, As» ./(dx/dt)? + (dy/dt?? At. 


8B The length of a parametric curve is an integral with respect to f: 


{ ds= (dsjdt)de =| /(@x/diy® + (dylaty at. 


EXAMPLE 5 Find the length of the quarter-circle using x = cos t and y=sint: 


x/2 x/2 af2 r 

| (dx/dt)* + (dy/dt)? dt = | sin? t + cos? ¢ dt = | dt=-. 
it) tt] 0 

The integral is simpler than 1/./1—x*, and there is one new advantage. We can 

integrate around a whole circle with no trouble. Parametric equations allow a path to 

close up or even cross itself. The time t keeps going and the point (x(t), y(t}) keeps 

moving. In contrast, curves y = f(x) are limited to one y for each x, 


EXAMPLE 6 Find the jength of the quarter-ellipse: x=cosr¢ and y=/2sin t: 


On this path y* + 2x? is 2sin?2+2cos?t=2 (same ellipse). The non-parametric 
equation y= ./2~— 2x? comes from eliminating t. We keep t: 


napa nj2 
length = | (dx/dt)? + (dy/dt)? dt = | ./sin? t+ 2 cos? t dt. (7) 
0 0 


This integral (7) must equal (5). If one cannot be done, neither can the other. They 
are related by x =cos t, but (7) does not blow up at the endpoints. The trapezoidal 
rule gives 1.9101 with less than 100 intervals. Section 5.8 mentioned that calculators 
automatically do a substitution that makes (5) more like (7). 


EXAMPLE 7 The path x= #7, y=t° goes from (0, 0) to (4, 8). Stop at ¢= 2. 


To find this path without the parameter t, first solve for t= x1/?. Then substitute 
into the equation for y: y=? = x*/?. The non-parametric form (with ¢ eliminated) és 
the same curve y = x*!? as in Example 1. 

The length from the t-integral equals the length from the x-integral. This is 
Problem 22. 


EXAMPLE 8 Special choice of parameter: t is x. The curve becomes x= t, y= ¢?/, 


if x =¢ then dx/dt = 1. The square root in (6) is the same as the square root in (4). 
Thus the non-parametric form ) = f(x) is a special case of the parametric form—just 
take t= x. 

Compare x = t, y= t?/? with x = 17, y= 1°. Same curve, same length, different speed. 
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324 8 Applications of the integral 


EXAMPLE9 Define “speed” by 


short distance ds dx 2 
lp a ee ee It is 
short time dt 


When a bail is thrown straight upward, dx/dt is zero. But the speed is not dy/dt. 
It is |dy/dt|. The speed is positive downward as well as upward. 


8.2 EXERCISES 


Read-through questions 


The length of a straight segment (Ax across, Ay up) is 
As=_ 4 __. Between two points of the graph of p(x), Ay is 


approximately dy/dx times _>__. The length of that piece 


is approximately ./(Ax)?+_¢ _. An infinitesimal piece of 


the curve has length ds=_ 9. Then the arc length integral 
isj_e 


For y=4—x from x=0 to x=3 the arc length is 
t = 9. For y=x? the arc Jength integral is _h 


The curve x=cos t, y=sin ¢ is the same as __! _. The 


length of a curve given by x(t), y(t) is if /_! dt. For exam- 
ple x=cost, y=sint from t=x/3 to t=2x/2 has length 


k__. The speed is ds/dt=_' . For the special case 


x=t, y=/f(t) the length formula goes back to f m dx. 


Find the lengths of the curves in Problems 1-8. 
1 y=x*? from (0, 0) to (1, 1) 


2 y= x7 from (0, 0} to (1, 1) (compare with Problem 1 or 
put u=%+x7/5 in the length integral) 


3 p=4or7? + 2p? from x =0 to x=1 
4 pot(x? — 2)? from x=2tox=4 


x 1 
= = =3 
5 y 3 +4, from x I tox 
4 
6 pat + gos from x= 1 to x=2 
7 y= x3? —$x'? from x=1 to x=4 


8 y= x? from < 0) to (1, 1) 


9 The curve given by x =cos*1, y=sin?t is an astroid (a 
hypocycloid). Its non-parametric form is x7/3+y7?=1. 
Sketch the curve from ¢ =0 to t= 2/2 and find its length. 


10 Find the length from ¢=0 to i = a of the curve given by 
x=cost+sint, y=cost—sint. Show that the curve is a 
circle (of what radius?). 


11 Find the length from ¢ = 0 to t = 7/2 of the curve given by 
x=cost, p=t—sint, 


12 What integral gives the length of Archimedes’ spiral 
x=itcost,y=fsint? 


_ Show by substituting x = 


13 Find the distance traveled in the first second (to t = 1) if 
x=$e?, p= $(2t4 LP. 

14 x=(1—4c0s 2¢)cost and y= 
4(1 — x? — y?)? = 27kx? 
to 2/4, 


(1 +4 cos 21) sin ¢ lead to 
— y*)?. Find the are jength from t =0 


Find the arc lengths in 15-18 by numerica] integration. 

15 One arch of y=sin x, from x=0 to x=7. 

16 y=e" from x=O to x=1. 

17 y=Inx from x =1 tox=e. 

18 x=cost, y=3 sin ¢,0<x<2n. 

19 Draw a rough picture of y = x!°. Without computing the 
length of y= x” from (0, 0) to (1, {), find the limit as n- x. 
20 Which is longer between (1, 1) and (2,4), the hyperbola 
y= 1/x or the graph of x+2y=3? 

21 Find the speed ds/dt on the circle x = 2 cos 3t, y = 2 sin 31. 


22 Examples | and 7 were y= x?? and x=17, y= 2°: 


length = fj /1+4x dx, length = fj 


4:7 +924 dt. 
that these integrals agree. 


23 Instead of y=/{x) a curve can be given as x = g(y), Then 


(dx)? + (dy? = ./(dx/dy? +1 dy. 
Draw x = 5y from y=0 to y= 1 and find its length. 


24 The length of x=y*? from (0,0) to (t, 1) is 
[ds=|./%y +1 dy. Compare with Problem 1: Same length? 
Same curve? 


25 Find the length of x =4{e’+e°”} from y=—1 to y=1 
and draw the curve. 


26 The length of x = g(y)} is a special case of equation oe with 
y=tand x =g(t). The length integral becomes 


27 Plot the point x=3cosit, y=4sint at the five times 
t=0, 2/2, 2, 3x/2, 2m. The equation of the curve is 
{x/3)? + (y/4)? = 1, not a circle but an . This curve 
cannot be wnitten as y=/f(x) because 


28 (a) Find the length of x =cos’t, y=sin?4,0<y<z. 
(b) Why does this path stay on the line x + y= 1? 
(c) Why isn’t the path length equal to a: 
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29 (important) The line y = x is close to a staircase of pieces 
that go straight across or straight up. With 100 pieces of length 
Ax = 1/100 or Ay = 1/100, find the length of carpet on the 
staircase. (The length of the 45° line is of 2s The staircase can 
be close when its length is not close.) 


30 The area of an ellipse its xab. The area of a strip around 
it (width A) is m(a + A)(b + A) — ab = x(a + BJA, The distance 
around the ellipse seems to be x(a + 6). But this distance is 
impossible to find—what is wrong? 


31 The point x = cos ¢, py =sin t, z = t moves on a space curve. 
(a) In three-dimensional space (ds)? equals (dx)? + 
. In equation (6), ds is now dt. 


8.3 Area of a Surface of Revolution 


(b) This particular curve has ds = . Find its 


length from t=0 to t= 2n. 
(c) Describe the curve and its shadow in the xy plane. 


32 Explain in 50 words the difference between a non-para- 
metric equation y=f{x) and two parametric equations 
x= x(t), y = y(t). 


33 Write down the integral for the length L of y= x? from 
(0, 0) to (1, 1). Show that y =4x? from (0, 0) to (2, 2) is exactly 
twice as long. If possible give a reason using the graphs. 


34 (for professors) Compare the lengths of the parabola 
y=x* and the line y= 5x from (0,0) to (b,b?). Does the 
difference approach a limit as b > w? 


This section starts by constructing surfaces. A curve y = f(x) is revolved around an 
axis. That produces a “surface of revolution,” which is symmetric around the axis. If 
we revolve a sloping line, the result is a cone. When the line is parallel to the axis we 
get @ cylinder (a pipe). By revolving a curve we might get a lamp or a lamp shade 


(or even the light bulb). 


Section 8.1 computed the volume inside that surface. This section computes the 
surface area. Previously we cut the solid into slices or shells. Now we need a good 


way to cut up the surface. 


The key idea is to revolve short straight line segments. Their slope is Ay/Ax. They 
can be the same pieces of length As that were used to find length—now we compute 
area, When revolved, a straight piece produces a “thin band’ (Figure 8.10). The curved 
surface, from revolving y = f(x), is close to the bands. The first step is to compute the 


surface area of a band. 


A small comment: Curved surfaces can also be cut into tiny patches. Each patch 
is nearly flat, like a little square. The sum of those patches leads to a double integral 
(with dx dy}. Here the integral stays one-dimensional (dx or dy or dt). Surfaces of 
revolution are special—we approximate them by bands that go all the way around. 
A band is just a belt with a slope, and its slope has an effect on its area. 


middle radius x 


area AS = 2mrAs 


area AS = 27xAs 


side As 


arta AS = 2myAs 


Fig. 8.10 Revolving a straight piece and a curve around the y axis and x axis. 
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Revolve a smail straight piece (length As not Ax). The center of the piece goes 
around a circle of radius r. The band is a slice of a cone. When we flatten it out 
(Problems 11-13) we discover its area. The area is the side length As times the middle 
circumference 2nr: 


The surface area of a band is 2nrAs = 2nr./1+ (Ay/Ax)* Ax. 


For revolution around the y axis, the radius is r= x. For revolution around the x 
axis, the radius is the height: r = y = f(x). Figure 8.10 shows both bands—the problem 
tells us which to use. The sum of band areas 2nr As is close to the area S of the curved 
surface. In the limit we integrate 2r ds: 


8C The surface area generated by revolving the curve y= f(x) between x =a 
and x =b is 


S =f? 2ny./1 + (dy/dx)? dx around the x axis (r= y) (1) 
S =f? 2nx./1+ (dy/dx)? dx around the y axis (r= x). (2) 


EXAMPLE 1 Revolve a complete semicircle y= ./R* — x* around the x axis. 


The surface of revolution is a sphere. Its area (known!) is 42R?. The limits on x are 


—R and R. The slope of y=./R* — x? is dy/dx = —x/./ R? — x?: 


R 2 R 
area $= [ 2n./R?—x? [i+ aa axe | 2nR dx = 4nR?, 
-R ~ -R 


EXAMPLE 2 Revolve a piece of the straight line y = 2x around the x axis. 


The surface is a come with (dy/dx)* = 4. The band from x=0 to x=1 has area 
2n/5: 


S=f 2ny ds=f} 2n(2x) /1 +4 dx =2n/5, 


This answer must agree with the formula 2zr As (which it came from). The line from 
(0, 0) to (1, 2) has length As = FS. Its midpoint is (4, 1). Around the x axis, the middle 
radius is r= 1 and the area is 2n/5. 


EXAMPLE 3 Revolve the same straight line segment around the y axis. Now the 
radius is x instead of y = 2x. The area in Example 2 is cut in half: 


S=f 2ax ds=§) nx /1+4 dx=2,/5. 


For surfaces as for arc length, only a few examples have convenient answers. 
Watermelons and basketballs and light bulbs are in the exercises. Rather than stretch- 
ing out this section, we give a final area formula and show how to use it. 

The formula applies when there is a parameter t. Instead of (x, f(x)) the points on 
the curve are (x(t), y(t)). As ¢ varies, we move along the curve. The length formula 
(ds)? = (dx} + (dy)? is expressed in terms of t. 

For the surface of revolution around the x axis, the area becomes a t-integral: 


8D The surface area is { 2ny ds = { 2ny(t)./(dx/dt)? + (dy/dt)* dt. 3) 
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EXAMPLE 4 The point x =cos ¢, »y=5+sint travels on a circle with center at (0, 5). 
Revolving that circle around the x axis produces a doughnut. Find its surface area. 


Solution (dx/dt}? + (dy/dt)? = sin? + cos? t = 1. The circle is complete at t = 22: 


§ 2ny ds = [2 2n{S + sin 1) dt =[ 2m(5e— cos t) |G" = 20n?. 


8.3 EXERCISES 


Read-through questions 


A surface of revolution comes from revolving a__@__ around 

b_. This section computes the __¢ . When the curve is 
a short straight piece (length As), the surface isa 4. Its 
area is AS=__e_. In that formula (Problem 13) + is the 
radiusof_f . The linefrom(0, O}to(1, IjJhaslength_g_, 


and revolving it produces area __h 


When the curve } =/{x) revolves around the x axis, the 
surface area is the integral | __. For y =x? the integral to 
compute is__J . When y = x” is revolved around the y axis, 
the area is S=__k__. For the curve given by x = 22, y =0°, 
change dsto | dt. 


Find the surface area when curves 1-6 revolve around the x 
axis. 


6 y=coshx. O<x<l. 


In 7-10 find the area of the surface of revolution around the + 
axis. 


T y=x*, O<x<2 


9y=x+l1, O<xg3 


1) A cone with base radius R and slant height s ts laid out 
flat, Explain why the angle (1n radians) is # = 22 R/s. Then the 
surface area is a fraction of a circle: 


R 
area = (= r= (7) as? =aRs. 
2n 5 


12 A band with slant height As =s —s' and radii R and R’ ts 
laid out flat. Explain in one line why its surface area is 
mRs— R's’. 


area KR’s’ 


13 By similar triangles Ris = R’/s’ or Rs’ = R's. The middle 
radius r is $(R + R'). Substitute for r and As in the proposed 
avea formula 2nr As, to show that this gives the correct area 
mRs— R's’. 


14 Slices of a basketball all have the same area of cover, 
if they have the same thickness. 
(a) Rotate ra J/1—-x? around the x axis. Show that 
dS = 2n dx. 
(b} The area between x =a and x=a+h is 
(c) | of the Earth’s area is above latitude 


15 Change the circle in Example 4 to x =a cost and y= 
b+asin¢. Its radtus is and its center is ‘ 
Find the surface arca of a torus by revolving this circle around 
the x axis, 


16 What part of the circle x=Rcost, y=Rsin¢t should 
rotate around the y axis to produce the top half of a sphere? 
Choose limits on ¢ and verify the area. 


17 The base of a lamp is constructed by revolving the quar- 
ter-circle y= ./2x—x? (x= to x =2) around the } axis. 
Draw the quarter-circle, find the area integral, and compute 
the area. 


18 The light bulb is a sphere of radius 1/2 with its bottom 
sliced off to fit onto a cylinder of radius 1/4 and length 1/3. 
Draw the light bulb and find its surface area (ends of the 
cylinder not included). 


19 The lamp shade is constructed by rotating y = 1/x around 
the y axis, and keeptng the part from } =1 to y= 2. Set up 
the definite integral that gives its surface area. 


20 Compute the area of that lamp shade. 
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21 Explain why the surface area is infinite when y= 1/x is 
rotated around the x axis (1 <x < oc). But the volume of 
“Gabriel’s horn” is . It can’t hold enough paint to 
paint its surface. 


22 A disk of radius 1” can be covered by four strips of tape 
(width 3”). If the strips are not parallel, prove that they can’t 
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cover the disk. Hint: Change to a unit sphere sliced by planes 
$" apart. Problem 14 gives surface area m for each slice. 


23 A watermelon (maybe a football) is the result of rotating 
half of the clitpse x=,/2cost, y=sin¢ (which means 
x? + 2p? = 2). Find the surface area, parametrically or net. 


24 Estimate the surface area of an egg. 


Discrete probability usually involves careful counting. Not many samples are taken 
and not many experiments are made. There is a list of possible outcomes, and a 
known probability for each outcome. But probabilities go far beyond red cards and 
black cards. The real questions are much more practical: 


t. How often will too many passengers arrive for a flight? 
2. How many random errors do you make on a quiz? 
3. What is the chance of exactly one winner in a big lottery? 


Those are important questions and we will set up models to answer them. 

There is another point. Discrete models do not involve calculus. The number of 
errors or bumped passengers or lottery winners is a small whole number. Calculus 
enters for continuous probability. Instcad of results that exactly equal | or 2 or 3, 
calculus deals with results that fall in a range of numbers. Continuous probability 
comes up in at least two ways: 


(A) An experiment is repeated many times and we take averages. 
(B) The outcome lies anywhere in an interval of numbers. 


In the continuous case, the probability p, of hitting a particular value x = n becomes 
zero. Instead we have a probability density p(x}— which is a key idea. The chance that 
arandom X falls between a and b is found by integrating the density p(x): 


Probtas ¥ <b = i p(x) dx. (t) 


Roughly speaking, p(x} dx is the chance of falling between x and x + dx. Certainly 
p(x) 2 0. Ifa and 6 are the extreme limits — oc and +x, including all possible outcomes, 
the probability is necessarily one: 

Prob}- 4% <X<+~x%:=]’, plx)dx al. (2) 


This is a case where infinite limits of integration are natural and unavoidabie. In 
studying probability they create no difficulty—areas out to infinity are often easier. 
Here are typical questions involving continuous probability and calculus: 


4. How conclusive is a 53%-47% poll of 2500 voters? 
5. Are 16 random football players safe on an elevator with capacity 3600 pounds? 
6. How Jong before your car is in an accident? 


It is not so traditional for a calculus course to study these questions. They necd extra 
thought, beyond computing integrals {so this section is harder than average). But 
probability is more important than some traditional topics, and also more interesting. 
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Drug testing and gene identification and market research are major applications. 
Comparing Questions 1-3 with 4—6 brings out the relation of discrete to continuous— 
the differences between them, and the parallels. 

It would be impossible to give here a full treatment of probability theory. 1 believe 
you will see the point (and the use of calculus) from our examples. Frank Morgan’s 
lectures have been a valuahle guide. 


DISCRETE RANDOM VARIABLES 


A discrete random variable X has a list of possible values. For two dice the outcomes 
are X = 2,3,..., 12. For coin tosses (see below), the list is infinite: X = 1, 2, 3, .... 
A continuous variable lies in an interval a< X <b. 


EXAMPLE 1 Toss a fair coin until heads come up. The outcome X is the aumber of 
tosses. The value of X is 1 or 2 or 3 or ..., and the probability ts } that ¥ = 1 (heads 
on the first toss). The probability of tails then heads is p, = 4. The probability that 
X =n is p, =(3)'—this is the chance of n — 1 tails followed by heads. The sum of all 
probabilities is necessarily 1: 


Pit Pat Pst EAE EH a 


EXAMPLE 2 Suppose a student (not you) makes an average of 2 unforced errors per 
hour exam. The number of actual errors on the next exam is X =O or 1 or2or.... 
A reasonable model for the probability of m errors—when they are random and 
independent— is the Poisson model (pronounced Pwason): 
is 
P, = probability of n errors = nie ~ 


The probabilities of no errors, one error, and two errors are po, p;, and p,: 


2° -2_! -2 2 -2 ms -2 
Po= ore 7 m 135 a= FW my 27 Pa= ae 27. 
The probability of more than two errors is 1 — .135— .27 — .27 = .325. 
This Poisson model can be derived theoretically or tested experimentally. The total 
probability is again 1, from the infinite series (Section 6.6) for e*: 
7 ae Sma eee ee 
att ate =e*e *=1, 3) 


Pot Pit Pot - -( 


EXAMPLE 3 Suppose on average 3 out of 100 passengers with reservations don’t 
show up for a flight. If the plane holds 98 passengers, what is the probability that 
someone will be bumped? 


If the passengers come independently to the airport, use the Poisson model with 2 
changed to 3. X is the number of no-shows, and X = n happens with probability p,: 


=¢ P= Te = 3e7>, 


There are 98 seats and 100 reservations. Someone is bumped if X¥ =0 or X = 1: 
chance of bumping = py + p, = e7°> + 3e7? = 4/20. 


We will soon define the average or expected value or mean of X—this model has yp = 3. 
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CONTINUOUS RANDOM VARIABLES 


If X is the lifetime of a VCR, all numbers X > 0 are possible. If X is a score on the 
SAT, then 200 < X < 800. If X is the fraction of computer owners in a poll of 600 
people, X is between 0 and 1. You may object that the SAT score is a whole number 
and the fraction of computer owners must be 0 or 1/600 or 2/600 or .... But it is 
completely impractical to work with 601 discrete possibilities. Instead we take X to 
be a continuous random variable, falling anywhere in the range X > 0 or [200, 800] or 
0< X <1, Of course the various values of X are not equally probable. 


EXAMPLE 4 The average lifetime of a VCR is 4 years. A reasonable model for break- 
down time is an exponential random variable. Its probability density is 
p(x)=fe ** for 0O<x<o. 
The probability that the VCR will eventually break is 1: 
2 Lex dx =[ —e 4 | =0-CI)=1. (4) 
The probability of breakdown within 12 years (X from 0 to 12) is .95: 
f3? 4e-** dx =| —e7*4 |? = 0-3 +1 95, (5) 


An exponential distribution has p(x)=ae~™. Its integral from 0 to x is F(x)= 
1—e-™. Figure 8.11 is the graph for a = 1. It shows the area up to x=1. 
To repeat: The probability that a < X <b is the integral of p(x) from a to b. 


2 3 
2/e 1 * 4.5¢ t ’ 1 
Qe 4 ” 
le2 @ p= 3e p= 
" ale eo 1 nte> 
ut uf 
0 1 2 3 oO : 2 3 


Fig. 8.44 Probabilities add to £ p, = 1. Continuous density integrates to { p(x) dx = 1. 


EXAMPLES We now define the most important density function. Suppose the 
average SAT score is 500, and the standard deviation (defined below—it measures the 
spread around the average) is 200. Then the normal distribution of grades has 


ee 500)2/2{200)7 


1 
= for -o<x< x, 
PO) 300 jan 


This is the normal {or Gaussian) distribution with mean 500 and standard deviation 
200. The graph of p(x) is the famous bell-shaped curve in Figure 8.12. 

A new objection is possible. The actual scores are between 200 and 800, while the 
density p(x) extends all the way from — oo to oo. I think the Educational Testing 
Service counts all scores over 800 as 800. The fraction of such scores is pretty small— 
in fact the normal distribution gives 


x 


Prob {X > 800} = | ge &~ $00)7/2(200)? dy =~ 0013. (6) 


I 
800 200,/2n 
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Fig. 8.142 The nonnal distribution (bell-shaped curve) and its cumulative density F(x). 


~x2 


Regrettably, e~* has no elementary antiderivative. We need numerical integration. 
But there is nothing the matter with that! The integral is called the “error function,” 
and special tables give its value to great accuracy. The integral of e~*”? from — oo 
to oo is exactly /2n. Then division by Jn keeps [ p(x) dx = 1. 

Notice that the normal distribution involves two parameters. They are the mean 
value (in this case 4 = 500) and the standard deviation {in this case o = 200). Those 
numbers mu and sigma are often given the “normalized” values p = 0 and a = 1: 


-y2 
eel 


p(x) = J e #720? becomes p(x) = 
CG 

The beli-shaped graph of p is symmetric around the middle point x = 4. The width 
of the graph is governed by the second parameter o—which stretches the x axis and 
shrinks the y axis (leaving total area equal to 1). The axes are labeled to show the 
standard case 4 = 0, o = 1 and also the graph for any other y and o. 

We now give a name to the integral of p(x). The limits will be — oo and x, so the 
integral F(x} measures the probability that a random sample ts below x: 


Prob{X <x}=J* , p(x) dx = cumulative density function F(x). (7) 


F(x) accumulates the probabilities given by p(x), so dF/dx = p(x). The total prob- 
ability is F(oo) = 1. This integral from — 00 to,oo covers all outcomes. 

Figure 8.12b shows the integral of the bell-shaped nonmal distribution. The middle 
point x = yz has F = §. By symmetry there is a 50-50 chance of an outcome below the 
mean. The cumulative density F(x) is near .16 at 2 —o and near .84 at y+o. The 
chance of falling in between is .84— .16=.68. Thus 68% of the outcomes are less 
than one deviation o away from the center y. 

Moving out to 4~ 20 and p+ 20, 95% of the area is in between. With 95% 
confidence X is less than two deviations from the mean. Only one sample in 20 is 
further out (less than one in 40 on each side). 

Note that ¢ = 200 is not the precise value for the SAT! 


MEAN, VARIANCE, AND STANDARD DEVIATION 


In Example 1, X was the number of coin tosses until the appearance of heads. The 
probabilities were p; = 3, pp =%, P3= 4, .... What is the average number of tosses? 
We now find the “mean” yz of any distribution p(x)—not only the normal distribution, 
where symmetry guarantees that the built-in number 4 is the mean. 

To find ., multiply outcomes by probabilities and add: 


= mean = )) np, = 1(p1} + 2(p2) + 3(p3) + (8) 
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The average number of tosses is 1(3)+ 2(4)+ 3(4)+ +--+. This series adds up (in 
Section 10.1) to «= 2. Please do the experiment 10 times, I am almost certain that 
the average will be near 2. 


When the average is 4 = 2 quiz errors or A = 3 no-shows, the Poisson probabilities 
are p, = A"e */n!t Check that the formula 7 = np, does give A as the mean: 


A gk ak mn AB Se rer 
ijt25 sage |e wife bsFe le =he*e*=A, 


For continuous probability, the sum »=Znp, changes to u=J xp(x}dx. We 
multiply outcome x by probability p(x) and integrate. In the VCR model, integration 
by parts gives a mean breakdown time of u = 4 years: 


J x p(x) dx = fp x(Ze~*!*) dx = [ ~xe-* ~ de-** | = 4. (9) 


Together with the mean we introduce the variance. It is always written o”, and in 
the normal distribution that measured the “width” of the curve. When a” was 2007, 
SAT scores spread out pretty far. If the testing service changed to a? = 1”, the scores 
would be a disaster. 95% of them would be within +2 of the mean. When a teacher 
announces an average grade of 72, the variance should also be announced—if it is 
big then those with 60 can relax. At least they have company. 


8E The mean zp is the expected value of X. The variance o” is the expected 
value of (X — mean)* = (X — y)?. Multiply outcome times probability and add: 


p= np, 0? =) (n— uy" Da (discrete) 


w=J®,, xp(x) dx a? = [2 (x— )* p(x) dx (continuous) 


The standard deviation (written c) is the square root of a?. 
EXAMPLE 6 (Yes-no poll, one person asked) The probabilities are p and 1 — p. 


A fraction p = 4 of the population thinks yes, the remaining fraction 1 — p = 3 thinks 
no. Suppose we only ask one person. If X = 1 for yes and X = 0 for no, the expected 
value of X is p= p=4. The variance is a? = p(1 — p)=8: 


aQ)aGb me eI) 


The standard deviation is ¢ = Jf 2/9. When the fraction p is near One or near zero, 
the spread is smailer—and one person is more likely to give the right answer for 
everybody. The maximum of a? = p(1— p) is at p= 4, where o = 4. 

The table shows yp and a? for important probability distributions. 


Mode! Mean Variance Application 
Pi=P,Po=l—p Pp p{l — p) yes-no 
Poisson p, = 4’e “/n! A A random occurrence 
Exponential p(x)= ae %* tla L/a? waiting time 
Normal p(x) ‘= ee BP 20? H a? distribution 


around mean 


8.4 Probability and Calculus 
THE LAW OF AVERAGES AND THE CENTRAL LIMIT THEOREM 


We come to the center of probability theory (without intending to give proofs). The 
key idea is to repeat an experiment many times—poll many voters, or toss many 
dice, or play considerable poker. Each independent experiment produces an outcome 
X, and the average from N experiments is X. It is called “X bar’’: 


gatithat + ade! +X 
N 

All we know about p(x) is its mean y and variance a’. It is amazing how much 

information that gives about the average X: 


= average outcome. 


8F Law of Averages: X is almost sure to approach yp: as N -> oo. 


Central Limit Theorem: The probability density py{x) for X approaches 
a normal distribution with the same mean y and with variance o?/N. 


No matter what the probabilities for X , the probabilities for X move toward the normal 
bell-shaped curve. The standard deviation is close to o}./N when the experiment is 
repeated N times. In the Law of Averages, “almost sure” means that the chance of 
X not approaching x is zero. It can happen, but it won't. 


Remark I The Boston Globe doesn’t understand the Law of Averages. I quote from 
September 1988: “What would happen if a giant Red Sox slump arrived? What would 
happen if the fabled Law of Averages came into play, reversing al] those can’t miss 
decisions during the winning streak?” They think the Law of Averages evens every- 
thing up, favoring heads after a series of tails. See Problem 20. 


EXAMPLE 7 Yes-no poll of N = 2500 voters. Is a 53%-47% outcome conclusive? 


The fraction p of “yes” voters in the whole population is not known. That is the reason 
for the poil. The deviation a = ./p{1 — p) is also not known, but for one voter this is 
never more than } (when p= 4). Therefore a}./N for 2500 voters is no larger than 
$/./2500, which is 1%. 

The result of the poll was ¥ = 53%. With 95% confidence, this sample is within 
two standard deviations (here 2%) of its mean. Therefore with 95% confidence, the 
unknown mean j= p of the whole population is between 51% and 55%. This poll is 
conclusive. 

If the true mean had been p = 50%, the poll would have had only a .0013 chance 
of reaching 53%. The error margin on each side of a poll is amazingly simple; it is 
always 1/./N . 


Remark 2 The New York Times has better mathematicians than the Globe. Two 
days after Bush defeated Dukakis, their poll of N = 11,645 voters was printed with 
the following explanation. “In theory, in 19 cases out of 20 [there is 95%] the results 
should differ by no more than one percentage point [there is 1/./N] from what 
would have been obtained by seeking out all voters in the United States.” 


EXAMPLE 8 Football players at Caltech (if any) have average weight 4 = 210 pounds 
and standard deviation ¢ = 30 pounds. Are N = 16 players safe on an elevator with 
capacity 3600 pounds? 16 times 210 is 3360. 


333 


334 


8 Applications of the Integral 


The average weight X is approximately a normal random variable with 7 = 210 and 
T= 30/,/N = 30/4. There is only a 2% chance that X is above p+ 2¢ = 225 (see 
Figure 8.12b—weights below the mean are no problem on an elevator). Since 16 
times 225 is 3600, a statistician would have 98% confidence that the elevator is safe. 
This is an example where 98% is not good enough——I wouldn’t get on. 


EXAMPLE9 (The famous Weldon Dice) Weldon threw 12 dice 26,306 times and 
counted the 5’s and 6’s. They came up in 33.77% of the 315,672 separate rolls. Thus 
X = .3377 instead of the expected fraction p =} of 5’s and 6’s. Were the dice fair? 


The variance in each roll is 7 = p(1—p)= 2/9. The standard deviation of X is 
g= of /N= /2/9/ /315672 = 00084. For fair dice, there is a 95% chance that X 
will differ from 4 by less than 2¢. (For Poisson probabilities that is false. Here X is 
normal.) But .3377 differs from .3333 by more than 5¢. The chance of falling 5 standard 
deviations away from the mean is only about 1 in 10,000.T 

So the dice were unfair. The faces with 5 or 6 indentations were lighter than the 
others, and a little more likely to come up. Modern dice are made to compensate for 
that, but Weldon never tried again. 


8.4 EXERCISES 


Read-through questions 


Discrete probability uses counting, 9 probability uses 
calculus. The function p(x) ts the probability _b . The 
chance that a random variable fails between a and bis _c 
The total probability is fou p(ix)dx=_d_. Jn the dis- 
crete case Lp,=__9 . The mean (or expected value} 
is z= {in the continuous case and p=EZ ap, in 
the g . 


The Poisson distribution with mean 4 has p,=__h _. The 


sum X p,= 1 comes from the __t__ series, The exponential 
distribution has p(x) = e7* or 2e7?* or __j__. The standard 
Gaussian (or __k__} distribution has / 2a pix) =e "2 Its 
graph is the well-known __|__ curve. The chance that the 


variable falls below x is F(x) =__m_. F isthe n__ density 


function, The diflerence F(x +dx)— F(x} is about _o 
which is the chance that X ts between x and x + dx. 


The variance, which measures the spread around x, is 
o7=§__p_ in the continuous case and 0? =£__q_ in the 
discrete case. Its square root ¢ is the _t . The normal 
distribution has p(x)=__s If Xisthe__t of N samples 
from any population with mean py and variance o?, the Law 
of Averages says that X will approach _u__. The Central 


Limit Theorem says that the distribution for X¥ approaches 
y__, Its mean is_w__ and its vananceis _ x 


In a yes-no poll when the voters are 50-50, the mean for 
one voter is p=O(4)+14)=_y_. The variance is 
(O— u)*po +(1—pyp, =__2_. Fora poll with N = 100, @ ts 


A_, There is a 95% chance that X (the fraction saying yes) 
will be between Band _¢ 


1 If p, =4, pp =4, p3 =4, ..., What is the probability of an 
outcome X <4? What are the probabilities of X =4 and 
X>@ 


2 With the same p, =(3)", what is the probability that X is 
odd? Why is p,=(4)" an impossible set of probabilities? 
What multiple c(4)" is possible? 


3 Why is p(x) =e 2* not an acceptable probability density 
for x 20? Why is p(x}=4e ** —e~* not acceptable? 


*4 If p, =(4)", show that the probability P that X is a prime 
number satisfies 6/16 < P <7/16. 


§ If p(x) =e * for x > 0, find the probability that X > 2 and 
the approximate probability that 1 <x <1.01. 


6 If p(x) =C/x? is a probability density for x 21, find the 
constant C and the probabitity that X < 2. 


7 If you choose x completely at random between 0 and z, 
what is the density p(x) and the cumulative density F(x)? 


tJoe DiMaggio’s 56-game hitting streak was much more improbable—] think it is statistically 


the most exceptional record in major sports. 
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In 8-13 find the mean value 4 = np, or p = | xp(x) dx. 
8 po = 1/2, py = 1/4, p2 = 1/4 
9 py = 1/7, po = 1/7, ..., po = 1/7 

10 p, = \fnte (po = 1/e, py = Ve, po = 1/2e, ...) 

tl p(x)=2/z(14x7), x20 

12 p(x) =e * {integrate by parts) 

13 p(x) = ae~™ (integrate by parts) 


14 Show by substitution that 


[7 eT dx=/20 f?, eo du = fro. 


15 Find the cumulative probability F (the integra] of p) in 
Problems 11, 12, 13. In terms of F, what is the chance that a 
randoin sample lies between a and 5? 


16 Can-Do Airlines books [00 passengers when their plane 
only holds 98. If the average number of no-shows is 2, what 
is the Poisson probability that someone will be bumped? 


17 The waiting time for a bus has probability density 
(1/10)e~*/1°, with 4 = 10 minutes. What is the probability of 
waiting longer than {0 minutes? 


18 You make a 3-minute telephone call. If the waiting time 
for the next incoming call has p(x)=e-*, what is the prob- 
ability that your phone will be busy? 


19 Supernoyas are expected about every 100 years. What is 
the probability that you will be alive for the next one? Use a 
Poisson model with A = .O1 and estimate your lifetime. (Super- 
novas actually occurred in 1054 (Crab Nebula), 1572, 1604, 
and 1987. But the future distribution doesn’t depend on the 
date of the fast one.) 


20 {a} A fair coin comes up heads [0 times in a row, Will 
heads or tails be more likely on the next toss? 


(b) The fraction of heads after N tosses is x. The expected 
fraction afler 2N tosses is 


23 Show that the area between » and +o under the bell- 
shaped curve is a fixed number (near 1/3), by substituting 
;= 


le 1 ‘ eis: 5 I, 1 
en xo ar x= 
# o/2n oy 2a 


What is the area between —o and yz? The area outside 
(zo, p+0)? 


ev”? dy. 


22 For a yes-no poll of two yoters, explain why 
Po=(1—p), pi = 2p — 2p, p2 =p. 


Find y and ¢”. N voters give the “binomial distribution.” 


23 Explain the last step in this reorganization of the formula 
for o?: 


a? =f (x—p)* p(x) dx = J (x? — xp + 7) p(x) dx 
= § x? p(x) dx — 2p f xp(x) dx + np? J p(x) dx 
=f x? p(x) dx — p*. 


24 Use f (x —p)* p(x) dx and also { x? p(x) dx — y? to find a? 
for the uniform distribution: p(x) = 1 for O<x< 1. 


25 Find a? if po = 1/3, p, = 1/3, pp = 1/3. Use L(n — pz)? p, and 
also En? p, — 47. 


26 Use Problem 23 and integration by parts (equation 7.1.10) 
to find o? for the exponential distribution p(x) = 2e~?* for 
x20, which has mean 4, 


27 The waiting time to your next car accident has probability 
density p(x) =te~*?. What is x? What is the probability of 
no accident in the next four years? 


28 With p=}, }, f, ..., find the average number yp of coin 
tosses by writing p;+2p.+3p3+--: aS (pit+pat+Pp3t---}+ 
(P2+P3+ pat ---}+(P3+ Pat Pst oe) + --. 

29 In a poll of 900 Americans, 30 are in favor of war. What 


range can you give with 95% confidence for the percentage 
of peaceful Americans? 


30 Sketch rough graphs of p(x) for the fraction x of heads in 
4 tosses of a fair coin, and in 16 tosses. The mean value is }. 


31 A judge tosses a coin 2500 times. How many heads does 
it take to prove with 95% confidence that the coin is unfair? 


32 Long-life bulbs shine an average of 2000 hours with stan- 
dard deviation 150 hours. You can have 95% confidence that 
your bulb will fail between ss arn hours. 


33 Grades have a normal distribution with mean 70 and stan- 
dard deviation 10. If 300 students take the test and passing is 
55, how many are expected to fail? (Estimate from 
Figure 8.12b.) What passing grade will fail 1/10 of the class? 


34 The average weight of luggage is x = 30 pounds with devi- 
ation o = 8 pounds. What is the probability that the luggage 
for 64 passengers exceeds 2000 pounds? How does the answer 
change for 256 passengers and 8000 pounds? 


35 A thousand people try independently to guess a number 
between 1 and 1000. This is like a lottery. 


(a) What is the chance that the first person fails? 
(b) What is the chance Po that they all fail? 
(c) Explain why Py is approximately L/e. 
36 (a) In Problem 35, what is the chance that the first person 
is right and all others are wrong? 


(b} Show that the probability P, of exactly one winner is 
also close to 1/e. 


(c) Guess the probability P, of n winners (fishy question). 
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Re §88.5 Massesand Moments 


This chapter concludes with two sections related to engineering and physics. Each 
application starts with a finite number of masses or forces. Their sum is the total 
mass or total force. Then comes the ‘‘continuous case,”’ in which the mass is spread 
out instead of lumped. Its distribution is given by a density function p (Greek rho}, 
and the sum changes to an integral. 

The first step (hardest step?) is to get the physical quantities straight. The sccond 
step is to move from sums to integrals (discrete to continuous, lumped to distributed). 
By now we hardly stop to think about it—although this is the key idea of integral 
calculus. The third step is to evaluate the integrals. For that we can use substitution 
or integration by parts or tables or a computer. 

Figure 8.13 shows the one-dimensional case: masses along the x axis. The total 
mass is the sum of the masses. The new idea is that of moments—when the mass or 
force is multiplied by a distance: 


moment of mass around the » axis = mix = (mass) times (distance to axis). 


mass | Mt) + fis 


ta eM a) sa a 


rs 
torque =fi 


Py + Fix, +f ov, 


iM 


iy +t, FAV = Myx 


moment 7) 


Fig. 8.43 The center of mass ts at ¥ = (total moment):{total mass) = average distance. 


The figure has masses 1, 3, 2. The total mass ts 6. The “lever arms” or “moment 
arms” are the distances x = 1, 3,7. The masses have moments | and 9 and 14 (since 
mx is 2 times 7). The total moment is t + 9+ 14= 24. Then the balance point is at 
x= M,/M = 24/6=4. 

The total mass is the sum of the m’s. The total moment is the sum of m, times x, 
(negative on the other side of x =0). If the masses are children on a seesaw, the 
balance point is the center of gravity x—also called the center of mass: 


__ ¥im,x, _ total moment 
g= = ba 


DEFINITION 
Ym, total mass 


(1) 
If all masses are moved to ¥, the total moment (6 times 4) is still 24. The moment 
equals the mass Z m, times x. The masses act like a single mass at x. 

Also: If we move the axis to Xx, and leave the children where they are, the seesaw 
balances. The masses on the left of <= 4 will offset the mass on the right. Reasoit: 
The distances to the new axis are x, -- ¥. The moments add to zero by equation (1): 


moment around new axis =} m,(x,— ©) = ¥ nt,X_— Ym, X = 0. 


Turn now to the continuous case. when mass is spread out along the line. Each 
piece of length Ax has an average density p, = (mass of piece)/(length of picce) = 
Am;Ax. As the pieces get shorter, this approaches dmidx—the density at the point. 
The limit of (small mass)}(small length) is the density p(x). 

Integrating that derivative p = dmi‘dx, we recover the total mass: L p, Ax becomes 


total mass M = | p(x) dx. (2) 


When the mass is spread evenly, p is constant. Then M = pL = density times length. 


8.5 Masses and Moments 


The moment formula is similar. For each piece, the moment is mass p, Ax 
multiplied by distance x—and we add. In the continuous limit, p(x) dx is multiplied 
by x and we integrate: 


total moment around y axis = M,= j xp(x) dx. (3) 


Moment is mass times distance. Dividing by the total mass M gives “average 
distance”’: 


._™moment M, J xp(x) dx 
center of mass x = =e Mo Tpehde: 


(4) 


Remark If you studied Section 8.4 on probability, you will notice how the formulas 
match up. The mass § p(x)dx is like the total probability J p(x) dx. The moment 
§ xp(x) dx is like the mean J xp(x) dx. The moment of inertia { (x — x)’ p(x) dx is the 
variance. Mathematics keeps hammering away at the same basic ideas! The only 
difference is that the total probability is always 1. The mean really corresponds to 
the center of mass %, but in probability we didn’t notice the division by f p(x) dx = 1. 


EXAMPLE 4 With constant density p from 0 to L, the mass is M = pL. The moment 
is 


L 
M, = Jo xe dx = tox? | =jpL’. 
The center of mass is x= M,/M = L/2. It is halfway along. 


EXAMPLE 2 With density e~* the mass is 1, the moment is 1, and x is 1: 


~® 


ae dx=|-e* |P=1 and ff xe * dx =| —xe*-e* | =1. 
MASSES AND MOMENTS IN TWO DIMENSIONS 


Instead of placing masses along the x axis, suppose m, is at the point (x,, y,) in the 
plane. Similarly m, is at (x,, y,). Now there are two moments to consider. Around the 
yaxis M,= 2 m,x, and around the x axis M, = £ m,y,. Please notice that the x’s go 
into the moment M,—because the x coordinate gives the distance from the y axis! 

Around the x axis, the distance is y and the moment is M,. The center of mass is 
the point (x, 7) at which everything balances: 


‘M > (3) 


In the continuous case these sums become two-dimensional integrals. The total 
mass is {jf p(x, y)dx dy, when the density is p= mass per unit area. These ‘double 
integrals” are for the future (Section 14.1), Here we consider the most important case: 
p=constant. Think of a thin plate, made of material with constant density (say 
p= 1). To compute its mass and moments, the plate is cut into strips (Figure 8.14): 


mass M = area of plate {6) 
moment M, = § (distance x) (length of vertical strip} dx (7) 
moment M, = | (height y) (length of horizontal strip) dy. (8) 
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centroid y=4-20 _ =_af,2_.2 ae ee 
y= 73, y= 430 ay =? xs-¥r?-y =yr7-y 
centroid @ 
y=4r/3n 
dx 2 2 -r 0 r 


Fig. §.14 Plates cut into strips to compute masses and moments and centroids. 


The mass equals the area because p = 1. For moments, all points in a vertical strip 
are the same distance from the y axis. That distance is x. The moment is x times area, 
or x times length times dx—and the integral accounts for all strips. 

Similarly the x-moment of a horizontal strip is y times strip length times dy. 


EXAMPLE 3 A plate has sides x= 0 and y=0 and y=4-— 2x. Find M, M,, M,. 
mass M = area = {2 y dx = {9 (4— 2x) dx= [4x- x? ]}= 4. 


The vertical strips go up to y = 4 — 2x, and the horizontal strips go out to x = (4 — y): 
2 24 Pk 
moment M, = | x(4 — 2x) dx = [27 - | == 
0 30 lo 3 


vie 2_! ,/°_ 16 
moment M,=} y=(4-y)dy=|y?--y®] =—. 
0 2 6 0 3 


The “center of mass” has xX = M,/M = 2/3 and ¥ = M,/M = 4/3. This is the centroid 
of the triangle (and also the ‘center of gravity”). With p = 1 these terms all refer to 
the same balance point (x, y). The plate will not tip over, if it rests on that point. 
EXAMPLE 4 Find M, and M, for the half-circle below x? + y? =7?. 


M, = 0 because the region is symmetric— Figure 8.14 balances on the y axis. In the 
x-Moment we integrate y times the length of a horizontal strip (notice the factor 2): 


M,= fy 2fF = ¥F dy = — 3? - yy? [= 9r?. 
Divide by the mass (the area $xr?) to find the height of the centroid: y= M,/M = 


4r/3x. This is less than 4r because the bottom of the semicircle is wider than the top. 


MOMENT OF INERTIA 
The moment of inertia comes from multiplying each mass by the square of its 
distance from the axis. Around the y axis, the distance is x. Around the origin, it is r: 
I,=ZLx2m, and I,=Ly2m, and Io=L remy. 


Notice that I, + 1, = [I because x3 + yz; =r}. In the continuous case we integrate. 

The moment of inertia around the y axis is 1, = {{ x’ p(x, y) dx dy. With a constant 
density p = 1, we again keep together the points on a strip, On a vertical strip they 
share the same x. On a horizontal strip they share y: 


1, =f (x) (vertical strip length) dx and 1, = {y*) (horizontal strip length) dy. 


8.5 Masses ond Moments 


In engineering and physics, it is retation that leads to the moment of inertia. Look 


at the energy of a mass m going around a circle of radius r. It has Ig = mr?. 


kinetic energy = mv? = 4m(rw)? =f 1a. (9) 


The angular velocity is w (radians per second). The speed is » = rw (meters per second). 

An ice skater reduces I, by putting her arms up instead of out. She stays close to 
the axis of rotation {r is small). Since her rotational energy +/,«” does not change, 
@ increases as Ig decreases. Then she spins faster. 


Another example: It takes force to turn a revolying door. More correctly, it takes 
torque, The force is multiplied by distance from the turning axis: T= Fx, so a push 
further out is more effective. 

To see the physics, replace Newton’s law F = ma = m dv/dt by its rotational form: 
T= 1 de/dt. Where F makes the mass move, the torque T makes it turn. Where m 
measures unwillingness to change speed, / measures unwillingness to change rotation. 


EXAMPLE 5 Find the moment of inertia of a rod about (a) its end and (b) its center. 


The distance x from the end of the rod goes from 0 to L. The distance from the center 
goes from — L/2 to L/2. Around the center, turning is easier because / is smaller: 


Bena =§o x7 AX=4L5 — Moemter = $2 x? dx = (10) 


df=atdm 


df=xtdm 


pI px lived 


Fig. 8.45 Moment of inertia for rod and propeller. Rolling balls beat cylinders. 


MOMENT OF INERTIA EXPERIMENT 


Experiment. Roll a solid cylinder {a coin), a hollow cylinder (a ring), a solid ball (a 
marble). and a hollow ball (not a pingpong ball) down a slope. Galileo dropped things 
from the Leaning Tower—. this experiment requires a Leaning Table. Objects that fall 
together from the tower don’t roll together down the table. 


Question 1 What is the order of finish? Record your prediction first! 
Question 2. Does size make a diffcrence if shape and density are the same? 
Question 3. Does density make a difference if size and shape are the same? 


Question 4 Find formulas for the velocity v and the finish time T. 


To compute v, the key is that potential energy plus kinetic energy is practically 
constant. Energy loss from rolling friction is very smatl. If the mass is m and the 
vertical drop is h, the energy at the top (all potential) is mgh. The energy at the bottom 
{all kinetic) has two parts: }mz? from movement along the plane plus 4/w? from 
turning. /mportant fact: v= wr for a rolling cylinder or ball of radius r. 
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Equate energies and set w = v/r: 


! 
mgh = 3 me + y lw 


2 


i i 
= i ———* 
3m ( =) (11) 


The ratio I/mr? is critical. Call it J and solve (11) for v?: 


= 2gh 
i+t/J 


{smaller J means larger velocity). (12) 


The order of J's, for different shapes and sizes, should decide the race. Apparently 
the density doesn’t matter, because it is a factor in both f and m—so it cancels in 


J =I mr’. A hollow cylinder has J = 1, which is the largest possible 


all its mass is 


at the full distance r from the axis. So the hollow cylindcr should theoretically come 
in last. This experiment was developed by Daniel Drucker. 

Problems 35--37 find the other three J's. Problem 40 finds the time T by integration. 
Your experiment will show how close this comes to the measured time. 


8.5 EXERCISES 


Read-through questions 


If masses m, are at distances x,, the total mass is M= 9 
The total moment around x =Q0is M,=__® _. The center of 
mass ig alx=_ © __. In the continuous cuse, the mass distri- 
bution is given by the _ 4 p(x). The total mass is M = 

6 and the center of mass ts atx=_ ¢ . With p=x, 
the integrals from 0 to L give M=_9@ and { xp{x)dx = 

hand x=__J__. The total moment is the same if the 
whole mass M is placed at __1| 


Ina plane, with masses m, at the points (x,. 1',), the moment 
around the y axis is__k __. The center of mass has X = __! 
and ;=__™_. For a plate with density p= 1, the mass M 
equals the _ f__. [f the plate is divided into vertical strips of 
height j(x), then M =f y(x)dx and M,=[_ 0 dx. Fora 


square plate O< x, )}< L. the mass is M=_ © and the 
moment around the yaxisis M,= _ a. The center of mass 
is at (X,f)=_ t . This point is the ___$__, where the plate 
balances. 


A mass m at a distance x from the axis has moment of 
inertia /=_ t  . A rod with p=1 from x =a to x =b has 
i,=_4__. For a plate with » =1 and strips of height (1), 
this becomes i=J v__. The torque T is _¥ _ times 

x 


Compute the mass M along the x axis, the moment Af, around 
x =0, and the center of mass * = M,/M. 


1m, =2 atx, =1, m1, 
2 m=3atx=O, 1, 2,6 


3 p=! for -I<x<3 


4atx,=2 


4 p=x? forO5x<h 


$ p=l|forO<x<1,p=2forl<x<2 


6 p=sinxfor0<x<z 


Find the mass M, the moments M, and M,, and the center of 
mass (X, #). 


7 Unit masses at (x, ¥) = (1, 0), (0, 1), and (1, 1) 
8 m, = [ at (1,0), m, =4 at (0, 1) 
9 p=7in the square0Sx<lO<}<l. 


10 » = 3 in the triangle with vertices (0, 0), (2, 0}, and (0, d). 


Find the area M and the centroid (¥, \') inside curves 11-16. 
Woy= laa y=0,x=0 (quarter-circle) 

12 y=x.y=2- x,y =0 (triangle) 

13 y=e°?* + +0,x=0 (infinite dagger) 

14 y=x?, y= (lens) 

15 x?4y7=1, x7 +1? =4 (ring) 


16 x? 44% = 1x74 57 =4, ¢=0  thall-ring). 


Verify these engineering formulas for /, with p = L: 


17 Rectangle bounded by x=0Q x=a, y=0, -=b: 
i= asp. 


18 Square bounded by x=—}a, x=4a, y=—4a, yada: 
I, =a*12. 


19 Triangle bounded by x=0. y=O0, x+y =a f,= 04/12. 


20 Disk of radius a centered at x =y=0: I, = nas 4, 
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21 The moment of inertia around the point x =r of a rod 
with density p(x) is I = | (x — 1)? p(x) dx. Expand (x — 1)? and 
I into three terms. Show that d/J/dt=0 when t=x. The 
moment of inertia is smallest around the center of mass. 


22 A region has x=0 if M,=f x(height of strip) dx =0. 
The moment of inertia about any other axis x=c is 
I=f(x—c)(height of strip)dx. Show that /=I,+ 
(area)(c?). This is the parallel axis theorem: I is smallest 
around the balancing axis c = 0. 


23 (With thanks to Trivial Pursuit) In what state is the center 
of gravity of the United States—the “geographical center” or 
centroid? 


24 Pappus (an ancient Greek) noticed that the volume is 
V =f 2ny(strip width) dy = 2xM, = 2njM 


when a region of area M is revolved around the x axis. In the 
first step the solid was cut into 


\ volume 2m y M 


ee 


25 Use this theorem of Pappus to find the volume of a torus. 
Revolve a disk of radius a whose center is at height j = b >a. 


26 Rotate the triangle of Example 3 around the x axis and 
find the volume of the resulting cone—first from V = 22yM, 
second from 42r7h. 


27 Find M, and M, for a thin wire along the semicircle 
y=,/1—x?. Take p=1s0 M =length =z. 


28 A second theorem of Pappus gives A = 2xjL as the surface 
area when a wire of length L is rotated around the x axis. 
Verify his formula for a horizontal wire along y=3 (x =0 
to x = L) and a vertical wire (y= 1 to y=L+ 1). 


length L 


(r2)(2n rdr) om 


29 The surface area of a sphere is A= 4x whenr=1. So A= 
2nyL leads to p= for the semicircular wire in 
Problem 27. 


30 Rotating y=mx around the x axis between x =0 and 
x =1 produces the surface area A = 


31 Put a mass m at the point (x, 0). Around the origin the 
torque from gravity is the force mg times the distance x. This 
equals g times the mx. 


32 If ten equal forces F are alternately down and up at 
x =1,2,..., 10, what is their torque? 


33 The solar system has nine masses m, at distances r, with 
angular velocities w,. What is the moment of inertia around 
the sun? What is the rotational energy? What is the torque 
provided by the sun? 


34 The disk x? + y? <a” has Ig = [¢, r?2nr dr =4na*. Why is 


this different from J, in Problem 20? Find the radius of 
gyration F =./I9/M. (The rotational energy $/)w? equals 
4Mr? w?—when the whole mass is turning at radius 7.) 


Questions 35—42 come from the moment of inertia experiment. 


35 A solid cylinder of radius r is assembled from hollow cylin- 
ders of length /, radius x, and volume (27x)(I)(dx). The solid 
cylinder has 

mass M =f), 2xxlp dx and 1 =f), x?2nxlp dx. 
With p =7 find M and J and J = 1/Mr?. 
36 Problem 14.4.40 finds J = 2/5 for a solid ball. It is less 


than J for a solid cylinder because the mass of the ball is 
more concentrated near 


37 Problem 14.4.39 finds J =4 [} sin? @ dp = for a 
hollow ball. The four rolling objects finish in the order 


38 By varying the density of the ball how could you make it 
roll faster than any of these shapes? 


39 Answer Question 2 about the experiment. 


40 For a vertical drop of y, equation (12) gives the velocity 
along the plane: v? =2gy/(1+J). Thus v=cy'? for c= 
. The vertical velocity is dy/dt =v sin a: 
dy/dt=cy'?sina and fy '?dy=fc sina dt. 
Integrate to find y(t). Show that the bottom is reached 
(y=h) at time T= 2,/h/e sin a. 
41 What is the theoretical ratio of the four finishing times? 


42 True or false: 
(a) Basketballs roll downhill faster than baseballs. 
(b) The center of mass is always at the centroid. 
(c) By putting your arms up you reduce /, and /,. 


(d) The center of mass of a high jumper goes over the bar 
(on successful jumps). 
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Re = 8.6 ~Force, Work, and Energy | 


Chapter | introduced derivatives df/dt and df/dx. The independent variable could be 
t or x. For velocity it was natural to use the letter r. This section is about two 
important physical quantities—foerce and werk—for which x is the right choice. 

The basic formula is W= Fx. Work equals force times distance moved (distance in 
the direction of F). With a force of 100 pounds on a car that moves 20 feet, the work 
is 2000 foot-pounds. If the car is rolling forward and you are pushing backward, the 
work is — 2000 foot-pounds. If your force is only 80 pounds and the car doesn’t 
move, the work is zero. In these examples the force is constant. 

W = Fx is completely parallel to f= vt. When v is constant, we only need muilti- 
plication. It is a changing velocity that requires calculus. The integral { v(t) dt adds 
up small multiplications over short times. For a changing force, we add up small 
pieces of work F dx over short distances: 


W=Fx (constant force) W= j F(x) dx (changing force). 


In the first case we lift a suitcase weighing F = 30 pounds up x = 20 feet of stairs. 
The work is W = 600 foot-pounds. The suitcase doesn’t get heavier as we go up—it 
only seems that way. Actually it gets lighter (we study gravity below). 

In the second case we stretch a spring, which needs more force as x increases. 
Hooke’s law says that F(x) = kx. The force is proportional to the stretching distance x. 
Starting from x = 0, the work increases with the square of x: 


F=kx and We [fj kx dx = fkx?. (1) 


In metric units the force is measured in Newtons and the distance in meters. The unit 
of work is a Newton-meter (a joule). The 600 foot-pounds for an American suitcase 
would have been about 800 joules in France. 


EXAMPLE 4 Suppose a force of F = 20 pounds stretches a spring ! foot. 
{a) Find k. The elastic constant is k = F/x = 20 pounds per foot. 
{b) Find W. The work is 4kx* = $+ 20- 1? = 10 foot-pounds. 
(c) Find x when F=—10 pounds. This is compression not stretching: x = — § foot. 


Compressing the same spring through the same distance requires the same work. For 
compression x and F are negative. But the work W=4kx? is still positive. Please 
note that W does not equal kx times x! That is the whole point of variable force 
(change Fx to { F(x) dx). 


May I add another important quantity from physics? It comes from looking at the 
situation from the viewpoint of the spring. In its natural position, the spring rests 
comfortably. It feels no strain and has no energy. Tension or compression gives it 
potential energy. More stretching or more compression means more energy. The 
change in energy equals the work. The potential energy of the suitcase increases by 
600 foot-pounds, when it is lifted 20 feet. 

Write V(x) for the potential energy. Here x is the height of the suitcase or the 
extension of the spring. {n moving from x = a to x = 6, work = increase in potential: 


W = ff F(x) dx = V(b)~ V{a). (2) 


This is absolutely beautiful. The work W is the definite integral. The potential V is 
the indefinite integral. If we carry the suitcase up the stairs and back down, our total 
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work is zero. We may feel tired, but the trip down should have given back our energy. 
(It was in the suitcase.) Starting with a spring that is compressed one foot, and ending 
with the spring extended one foot, again we have donc no work. V = >kx? is the same 
for x = —1 and x= 1. But an extension from x = 1 to x = 3 requires work: 


W = change in V = $k(3)* — $k(1)’. 


Indefinite integrals like V come with a property that we know well. They include 
an arbitrary constant C. The correct potential is not simply $kx’, it is $kx* + C. To 
compute a change in potential, we don’t need C. The constant cancels. But to deter- 
mine V itself, we have to choose C. By fixing V=0 at one point, the potential 
is determined at all other points. A common choice is V¥=0 at x= 0. Sometimes 
V=0at x= <x (for gravity). Electric fields can be “grounded” at any point, 

There is another connection between the potential V and the force F. According 
to (2), V is the indefinite integral of F. Therefore F(x) is the derivative of V{x). The 
fundamental theorem of calculus is also fundamental to physics: 


force exerted on spring: F= dV /dx (3a) 
force exerted by spring: F = — dV/dx (3b) 


Those lines say the same thing. One is our force pulling on the spring, the other is 
the “restoring force” pulling back. (3a) and (3b) are a warning that the sign of F 
depends on the point of view. Electrical engineers and physicists use the minus sign. 
In mechanics the plus sign is more common. It is one of the ironies of fate that 
F = V’, while distance and velocity have those letters reversed: v = f’. Note the change 
to capital letters and the change to x. 


TE. mM ie _GMm Q om 
—_ ne . ' : Motion 
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Fig. 8.16 Stretched spring; suitcase 20 feet up; moon of mass m; oscillating spring. 


EXAMPLE 2 Newton’s faw of gravitation {inverse square law): 
force to overcome gravity = GMm/x? force exerted by gravity = — GMm/x? 


An engine pushes a rocket forward. Gravity pulls it back. The gravitational constant 
is G and the Earth’s mass is Mf. The mass of the rocket or satellite or suitcase is m, 
and the potential is the indefinite integral: 


Vix) =f F(x) dx = — GMm|x + C. (4) 


Usually C = 0, which makes the potential zero at x = oc. 


Remark When carrying the suitcase upstairs, x changed by 20 feet. The weight was 
regarded as constant—which it nearly is. But an exact calculation of work uses the 
integral of F(x), not just the multiplication 30 times 20. The serious difference comes 
when the suitcase is carried to x = oc. With constant force that requires infinite work. 
With the correct (decreasing) force, the work cquals V at infinity (which is zero) minus 
V at the pickup point x). The change in V is W= GMm/x. 
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KINETIC ENERGY 


This optional paragraph carries the physics one step further. Suppose you release the 
spring or drop the suitcase. The external foree changes to F = 0. But the internal 
force still acts on the spring, and gravity still acts on the suitcase. They both start 
moving. The potential energy of the suitcase is converted to kinetic energy, until it 
hits the bottom of the stairs. 

Time enters the problem, either through Newton’s law or Einstein’s: 


(Newton) F = ma= mo (Einstein) F = “ (me) (5) 
é 


Here we stay with Newton, and pretend the mass is constant. Exercise 21 follows 
Einstein; the mass increases with velocity. There m= mo/,/ 1 — v?/c? goes to infinity 
as v approaches c, the speed of light. That correction comes from the theory of 
relativity, and is not needed for suitcascs. 

What happens as the suitcase falls? From x = a at the top of the stairs to x = b at 
the bottom, potential energy is lost. But kinetic energy 4mv? is gained, as we see from 
integrating Newton’s law: 


de dv dx dt 


force Frm7 =m 7 =m 
° b db Be ith 

work Fdx= | mve— dx = —mv*(b)— = mv*(a). (6) 
e . dx 2 2 


This same force F is given by —dV/dx. So the work is also the change in V: 


b fb 
work | F dx = (- | dx = — V(b) + V(a). (7) 


Since (6) = (7), the total energy 4mv? + V (kinetic plus potential) is constant: 
sme? (b) + Vib) = 4me7(a) + Vea). (8) 


This is the law of conservation of energy. The total cnergy is conserved. 


EXAMPLE 3 Attach a mass m to the end of a stretched spring and let go. The spring's 
energy V =4kx? is gradually converted to kinetic energy of the mass. At x = 0 the 
change to kinetic energy is complete: the original $kx* has become 3mv?. Beyond 
x = 0 the potential energy increases, the force reverses sign and pulls back, and kinctic 
energy is lost. Eventually all energy is potential— when the mass reaches the other 
extreme. It is simple harmonic motion, exactly as in Chapter 1 (where (he mass was 
the shadow of a circiing ball). The equation of motion is the statement that the rate 
of change of energy is zero (and we cancel v = dx/dt): 

d 


é . . 2y 
dt (; me + : k*) = Wie . + kx - =) or m “s +kx =O. (9) 


That is F = ma in disguise. For a spring, the solution x =cos ./k/mt will be found 
in this book. For more complicated structures, engineers spend a billion dollars a 
year computing the solution. 
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PRESSURE AND HYDROSTATIC FORCE 


Our forces have been concentrated at a single points. That is not the case for pressure. 
A fluid exerts a force all over the base and sides of its container. Suppose a water 
tank or swimming pool has constant depth h (in meters or feet). The water has weight- 
density w + 9800 N/m? ~ 62 lb/ft?. On the base, the pressure is w times h. The force 
is wh times the base area A: 


F = whA (pounds or Newtons) p = F/A= wh (1b/ft* or N/m’). (10) 


Thus pressure is force per unit area. Here p and F are computed by multiplication, 
because the depth h is constant. Pressure is proportional to depth (as divers know). 
Down the side wall, h varies and we need calculus. 

The pressure on the side is still wh—the same in all directions. We divide the side 
into horizontal strips of thickness Ah. Geometry gives the length /(h) at depth h 
(Figure 8.17). The area of a strip is [(h) Ah. The pressure wh is nearly constant on the 
strip—the depth only changes by Ah. The force on the strip is AF = whlAh. Adding 
those forces, and narrowing the strips so that Ah > 0, the total force approaches an 
integral: 


total force F ={ whl(h) dh (11) 


/=60 
h=0 
I(h) 
ag 
length /=2nr h=20 
—— area A=nr? 1=50 


pressure p=wh 


Fig. 8.47 Water tank and dam: length of side strip =/, area of layer = A. 


EXAMPLE 4 Find the total force on the trapezoidal dam in Figure 8.17. 


The side length is |= 60 when h=0. The depth h increases from 0 to 20. The main 
problem is to find / at an in-between depth h. With straight sides the relation is 
linear: |= 60+ ch. We choose c to give |= 50 when h= 20. Then 50 = 60 + c(20) 
yields c = — 3. 

The total force is the integral of whl. So substitute |= 60 —$h: 


F = [2° wh(60— 4h) dh =| 30wh? — dwh? [2° = 12000w — 3(8000w). 


With distance in feet and w = 62 lb/ft?, F is in pounds. With distance in meters and 
w = 9800 N/m, the force is in Newtons. 

Note that (weight-density w)=(mass-density p) times (g) = (1000)(9.8). These SI 
units were chosen to make the density of water at 0°C exactly p = 1000 kg/m*. 


EXAMPLE 5 Find the work to pump water out of a tank. The area at depth h is A(h). 


Imagine lifting out one layer of water at a time. The layer weighs wA(h) Ah. The 
work to lift it to the top is its weight times the distance h, or whA(h) Ah. The work 
to empty the whole tank is the integral: 


W= i) whA(h) dh. (12) 
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Suppose the tank is the bottom half of a sphere of radius R. The cross-sectional area 
at depth h is A = n(R? ~ h). Then the work is the integral (12) from 0 to R. It equals 


W = nwR4/4. 


Units: w = force/(distance)* times R* = (distance)* gives work W = (force}(distance). 


8.6 EXERCISES 


Read-through questions 


Work equals __a__sitimes _b__. For a spring the force is 
F=__c_, proportional to the extension x (thisis__d__Jaw). 


With this variable force, the work in stretching from 0 to x is 


W={_e =_t__. This equals the increase in the __g 
energy V. Thus W is a__h __ integral and V is the corre- 
sponding __| integral, which includes an arbitrary jj - 


The derivative dV/dx equals _k . The force of gravity is 
F=_ 1 and the potential ts V=__m 


In falling, V is converted to__n energy K=__o_. The 


total energy K+Vis__p_ (this is the law of _q when 
there is no external force). 


Pressure is force per untt + . Water of density w in 
a pool of depth A and area A exerts a downward force 
F=_ son the base. The pressure ts p=_+t_. On the 
sides the __u___ is still wh at depth hk, so the total) force is 
j whi dh, where [is__v__. In a cubic pool of side s, the force 
on the base is F =_w_, the length around the sides is 
!=__ x __,and the total force on the four sidesis F= _y__. 
The work to pump the water out of the pool is 


W=|whddh=_ 2 


1 (a) Find the work W when a constant force F = 12 pounds 
moves an object from x = .9 feet to x = 1.1 feet. 


(b) Compute W by integration when the force F = 12/x? 
varies with x. 


2 A 12-inch spring is stretched to 15 tnches by a force of 75 
pounds. 
(a) What is the spring constant k in pounds per foot? 
(b) Find the work done in stretching the spring. 
{c) Find the work to stretch it 3 more inches. 


3 A shock-absorber is compressed { inch by a weight of [ 
ton. Find its spring constant & in pounds per foot. What 
potential energy is stored in the shock-absorber? 


4 A force F = 20x — x? stretches a nonlinear spring by x. 
(a} What work is required to stretch it from x=0 to 
x=2? 

{b) What is its potential energy V at x = 2, if V(0) = 5? 
(c) What is k=dF/dx for a small additional stretch at 
x=2? 


§ (a) A 120-lb person makes a scale go down x inches. How 
much work is done? 
(b) If the same person goes x inches down the stairs, how 
much potential energy is lost? 


6 A rocket burns its 100 kg of fuel at a steady rate to reach 
a height of 25 km. 


(a) Find the weight of fuel left at height 4. 
(b) How much work is done lifting fuel? 


7 Integrate to find the work in winding up a hanging cable 
of length 100 feet and weight density 5 lb/ft. How much addi- 
tional work is caused by a 200-pound weight hanging at the 
end of the cable? 


& The great pyramid (height 500’—you can see it from 
Cairo) has a square base 800’ by 800’. Find the area A at 
height h, If the rock weighs w = 100 Ib/ft?, approximately how 
much work did it take to lift all the rock? 


9 The force of gravity on a mass m is F = —GMm/x?. With 
G=6-+10 '7 and Earth mass M =6: 1074 and rocket mass 
m= 1000, compute the work to lift the rocket from x = 6400 
to x = 6500. (The units are kgs and kms and Newtons, giving 
work in Newton-kms.) 


10 The approximate work to lift a 30-pound suitcase 20 feet 
is 600 foot-pounds, The exact work is the change in the poten- 
tial V= —GmM/x. Show that AV is 600 times a correction 
factor R*/(R* — 107), when x changes from R — 10 to R + 10. 
(This factor is practically 1, when R = radius of the Earth.) 


11 Find the work to lift the rocket in Problem 9 from 
x =6400 out to x=oc. If this work equals the original 
kinetic energy }mv?, what was the original p (the escape 
velocity)? 


12 The kinetic energy $mv? of a rocket is converted into 
potential energy — GMm/x. Starting from the Earth’s radius 
x = R, what x docs the rocket reach? If it reaches x = «© show 
that v =./2GM/R. This escape velocity is 25,000 miles per 
hour. 


13 It takes 20 foot-pounds of work to stretch a spring 2 feet. 
How much work to stretch it one more foot? 


14 A barrel full of beer is 4 feet high with a { foot radius and 


an opening at the bottom. How much potential energy is lost 
by the beer as it comes out of the barrel? 
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15 A rectangular dam js 40 feet high and 60 feet wide. Com- 
pute the total side force F on the dam when (a) the water is 
at the top (b) the water level is halfway up. 


16 A triangular dam has an 80-meter base at a depth of 30 
meters. If water covers the triangle, find 


(a) the pressure at depth & 
(b) the length ? of the dam at depth A 
{c} the total force on the dam. 


17 A cylinder of depth H and cross-sectional area A stands 
full of water (density w), (a} Compute the work W = { wAh dh 
to lift all the water to the top. (b) Check the units of W. 
(c) What is the work W if the cylinder is only half full? 


18 In Problem 17, compute W in both cases if H = 20 feet, 
w = 62 Ib/ft?, and the base is a circle of radius r = 5 feet. 


19 How much work is required to pump out a swimming 
pool, if the area of the base is 800 square feet, the water ts 4 
feet deep, and the top is one foot above the water level? 


20 For a cone-shaped tank the cross-sectional area increases 
with depth: A = zr*h?/H?. Show that the work to empty it is 
half the work for a cylinder with the same height and base. 
What is the ratio of volumes of water? 


21 In relativity the mass is m = mo/,/ 1 — v?/c?. Find the cor- 
rection factor in Newton’s equation F = moa to give Einstein’s 
equation F = d(mv)/dt = (d(mv)/dv)(dv/dt) = Moa. 


22 Estimate the depth of the Titanic, the pressure at that 
depth, and the force on a cabin door. Why doesn’t every door 
collapse at the bottom of the AUlantic Ocean? 


23 A swimming pool is 4 meters wide, 10 meters long, and 2 
meters deep. Find the weight of the water and the total force 
on the bottom. 


24 If the pool in Problem 23 has a shallow end only one 
meter deep, what fraction of the water is saved? Draw a cross- 
section (a trapezoid) and show the direction of force on the 
sides and the sloping bottom. 


25 In what ways is work like a definite integral and energy 
like an indefinite integral? Their derivative is the 
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CHAPTER 9 


Polar Coordinates and Complex 
Numbers 


Eee §=86©69.1 Polar Coordinates ER! 


Up to now, points have been located by their x and y coordinates. But if you were 
a flight controller, and a plane appeared on the screen, you would not give its 
position that way. Instead of x and y, you would read off the direction of the plane 
and its distance. The direction is given by an angle 6. The distance is given by a 
positive number r. Those are the polar coordinates of the point, where x and y ate 
the rectangular coordinates. 

The angle @ is measured from the horizontal. Suppose the distance is 2 and the 
direction is 30° or 2/6 (degrees preferred by flight controllers, radians by mathemati- 
cians). A pilot looking along the x axis would give the plane’s direction as “11 
o'clock.” This totally destroys our system of units, by measuring direction in hours. 
But the angle and the distance locate the plane. 

How far to a landing strip at r = 1 and @ = — x/2? For that question polar coordi- 
nates are not good. They are perfect for distance from the origin (which equals r), 
but for most other distances I would switch to x and y. It is extremely simple to 
determine x and y from r and @, and we will do it constantly. The most used formulas 
in this chapter come from Figure 9.1—where the right triangle has angle @ and 
hypotenuse r. The sides of that triangle are x and y- 


x=rcos 0 and y=r sin 6. (1} 


The point at r=2, @=2/6 has x =2cos(x/6) and y= 2sin(x/6). The cosine of 
7/6 is /3/2 and the sine is 3. So x= J/3 and y= 1. Polar coordinates convert 
easily to xp coordinates—now we go the other way. 

Always x? + y?=r?, In this example (/3)? + (1)? =(2)?. Pythagoras produces 
r from x and y. The direction @ is also available, but the formula is not so beautiful: 


r= /x+y? and tang = and (almost) @=tan™*?. (2) 


348 Our point has y/x = 1//3. One angle with this tangent is @=tan~! (1/,/3) = n/6. 
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y=rsin® 


8 =-45° =- 1/4 


r= V2 \B =(1,-1) 


Fig. 9.4 Polar coordinates r, # and rectangular coordinates x =r cos #, y=r sin 8. 


EXAMPLE 4 Point B in Figure 9.1c is at a negative angle 8= —n/4. The x coordinate 
rcos(—1/4) is the same as rcos 2/4 (the cosine is even). But the y coordinate 
rsin(— 2/4) is negative. Computing r and @ from x=1 and y=1, the distance is 
r=, /1+1 and tan@is —1/1. 


Warning Yo any angle @ we can add or subtract 2x—-which goes a full 360° circle 
and keeps the same direction. Thus — 2/4 or — 45° is the same angle as 72/4 or 315°. 
So is 152/4 or 675°. 

If we add or subtract 180°, the tangent doesn’t change. The point (1, —1) is on the 
~ 45° line at r= /2. The point (—1, 1) is on the 135° line also with r= Jos Both 
have tan 6 = — 1, We had to write “almost” in equation (2), because a point has many 
8s and two points have the same r and tan @. 

Even worse, we could say that B=(1, —1} is on the 135° line but at a negative 
distance r= — ,/2. A negative r carries the point backward along the 135° line, which 
is forward to B. In giving the position of B, I would always keep r > 0. But in drawing 
the graph of a polar equation, r < 0 is allowed. We move now to those graphs. 


THE CIRCLE r= cos 2 


The basis for Chapters 1-8 was y=/(x). The key to this chapter is r = F(@). That is 
a relation between the polar coordinates, and the points satisfying an equation like 
r=cos 8 produce a polar graph. 

It is not obvious why r=cos@ gives a circle, The equations r=cos 2@ and 
r=cos*@ and r= | + cos @ produce entirely different graphs—not circles. The direct 
approach is to take @ = 0°, 30°, 60°, ... and go out the distance r = cos @ on each ray. 
The points are marked in Figure 9.2a, and connected into a curve. It seems to be a 
circle of radius 3, with its center at the point (}, 0). We have to be able to show 
mathematically that r = cos @ represents a shifted circle, 


One point must be mentioned. The angles from Q to x give the whole circle. The 
number r= cos @ becomes negative after 7/2, and we go backwards along each ray. 


[9 


Fig. 9.2 The circle r= cos @ and the switch to x and y. The circle r = sin 0. 
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At @=2 (to the left of the origin) the cosine is —1. Going backwards brings us to 
the same point as = 0 and r= + 1— which completes the circle. 

When @ continues from x to 2% we go around again. The polar equation gives the 
circle twice. (Or more times, when @ continues past 27.) If you don’t like negative r’s 
and multiple circles, restrict @ to the range from — 7/2 to 2/2. We still have to see 
why the graph of r= cos @ is a circle. 


Methed1 Multiply by r and convert to rectangular coordinates x and y: 
r=cos9 => r=rcos@ => x?+y? =x. (3) 


This is a circle because of x? + y*. From rewriting as (x — 4)? + y? = (4)? we recognize 
its center and radius. Center at x = 5 and y = 0; radius 3. Done. 


Method2 Write x and y separately as functions of 6. Then @ is a “parameter”: 


x=rcos@=cos*@ and y=rsin#=sin Ocos@. (4) 


These are not pelar equations but parametric equations. The parameter @ is the angle, 
but it could be the time—the curve would be the same. Chapter 12 studies parametric 
equations in detaii—here we stay with the circle. 

To find the circle, square x and y and add. This produces x? + y? = xin Problem 26. 
But here we do something new: Start with the circle and find equation (4), In case you 
don’t reach Chapter 12, the idea is this. Add the vectors OC to the center and CP 
out the radius: 


The point P in Figure 9.2 has (x, y)}= OC + CP = (4, 0)+ & cos ¢, } sin 2). 


The parameter t is the angle at the center of the circle. The equations are 
x=4$+4cos¢ and y=}sint. A trigonometric person sees a double angle and sets 
t = 28. The result is equation (4) for the circle: 


x= 3+40cos20=cos?@ and y=4sin 26=sin 4 cos 0. (5) 


This step rediscovers a basic theorem of geometry: The angle ¢ at the center is twice 
the angle @ at the circumference. End of quick introduction to parameters. 


A second circle is r = sin 8, drawn in Figure 9.2c. A third circle is r = cos @ + sin 6, 
not drawn. Problem 27 asks you to find its xy equation and its radius. All calculations 
go back to x =r cos @ and y=r sin 6—the basic facts of polar coordinates! The last 
exercise shows a parametric equation with beautiful graphs, because it may be pos- 
sible to draw them now. Then the next section concentrates on r = F(@)—and goes 
far beyond circles. 


9.4 EXERCISES 


Read-through questions 


Polar coordinates r and @ correspond tox =__a__ and y= 
b__. The points with r>Q and d=n are located _ 
The points with r=1 and O<@<z are located _g_. Re- 


yersing the sign of @ moves the point (x, y) to ___e 


Given x and y, the polar distance isr =__f _. The tangent 
of @is__q__. The point (6,8) hasr=_ nm and@=__i 
Another point with the same 0 is _j _. Another point with 
the sameris__k__. Another point with the same r and tan 0 


1$ i 


The polar equation r = cos 6 produces a shifted _m__. The 
top point is at @=__n_, which gives r=__o__. When @ 
goes from 0 to 27, we go _p__ times around the graph. 
Rewriting as r? =r cos @ leads to the xyequation __q __. Sub- 
stituting r =cos 0 into x =r cos 6 yields x=__r__and simi- 
larly y=__s __. In this form x and y are functions of the 

t_ 98. 


Find the polar coordinates r > 0 and 0 < @ < 2x of these points. 


1 (x, y}=(0, 1} 2 (x, y) =(—4, 9} 
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3 & W=6/2, /2) 


§ (x, y=(C1, -1) 


4 (x, =(-1, /3) 

6 (x, y)=(3, 4) 

Find rectangular coordinates (x, ») from polar coordinates. 
7 (, €)=(2, x/2) 8 {r, 9} =(1, 3x/2) 
9, 4= (/20, 7/4) 10 (r, 8) =(3x, 3x) 

11 (@, #)=(2, —2/6) 12 (r, 8) =(2, 52/6) 

13 What is the distance from {x, y}= (/3, 1) to (1, —f3) 

14 How far is the point r = 3, 8= 2/2 from r= 4, 0 =x? 


15 How far is (x, y) =(r cos 6, r sin &) from (X, Y) =(R cos ¢, 
R sin )? Simplify (x — X)? +(y— Y)? by using cos(@ — ¢) = 
cos 6 cos ¢ + sin 6 sin ¢. 
16 Find a second set of polar coordinates (a different r or 8) 
for the points 
(r, é) = (-l, n/2), (- 1, 37/4), (1, —n/2), (0, 0). 

17 Using polar coordinates describe (a) the half-plane x > 0; 
(b) the half-plane y <0; {c) the ring with x? + y? between 4 
and 5; (d) the wedge x > |yl. 
18 True or false, with a reason or an example: 

{a) Changing to —r and —@ produces the same point. 


{b) Each point has only one r and 6, when ¢ <0 is not 
allowed. 


(c) The graph of r = f/sin 6 is a straight line. 
19 From x and 6 find y and r. 


20 Which other point has the same r and tan 9 as x= ./3, 
y=! in Figure 9, 1b? 


21 Convert from rectangular to polar equations: 
(ajy=x  (b)xt+y=1 (x? +y?=x4y 


22 Show that the triangle with vertices at (0, 0), (r,, @,), and 
(rz, 6.) has area A= 47,r, sin(6, —6,). Find the base and 
height assuming 0< 6, <8, <n. 


Probiems 23-28 are about polar equations that give circles. 
23 Convert r=sin @ into an xy equation. Multiply first by r. 
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24 Graphr =sin 6 at @ =0°, 30°, 60°, ..., 360°. These thirteen 
values of ? gtve different points on the graph. What 
range of @’s goes once around the circle? 


25 Substitute r =sin @ into x =r cos 8 and y=rsin @ to find 
x and y in terms of the parameter 6. Then compute x? + y? 
to reach the xy equation. 


26 From the parametric equations x=cos7@ and y= 
sin@cos @ in (4), recover the xy equation. Square, add, 
eliminate @. 


27 (a} Multiply r=cos 6+ sin @ by r to convert into an xy 
equation, (b) Rewrite the equation as (x — 4)? +(y —4)? = R? 
to find the radius R. (c) Draw the graph. 


28 Find the radius of r = a cos @ + b sin 6. (Multiply by r.) 


29 Convert x + y=1 into an ré equation and solve for r. 
Then substitute this r into x =r cos 6 and y=r sin @ to find 
parametric equations for the line. 


30 The equations x=cos?@ and y=sin’@ also lead to 
x+y=1—but they are different from the answer to 
Problem 29. Explanation: 6 is no longer the polar angle and 
we should have written ¢. Find a point x =cos”@, y = sin? 
that is not at the angle @, 


31 Convert r=cos’@ into an xy equation (of sixth degree!) 


32 If you have a graphics package for parametric curves, 
graph some fhypocyeloids. The equations are x= 
(1 — 4) cos t+ b cos(1 — b)t/b, y = (1 — b)sin ¢ — b sin(1 — b)t/b. 
The figure shows 5 = 7; and part of 5 = .31831. 


The most important equation in polar coordinates, by far, is r= 1. The angle @ does 
not even appear. The equation looks too easy, but that is the point! The graph is a 
circle around the origin (the unit circle). Compare with the line x = 1. More important, 


compare the simplicity of r= 1 with the complexity of y= + 


./1—x?. Circles are so 


common in applications that they created the need for polar coordinates. 
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This section studies polar curves r = F(@). The cardioid is a sentimental favorite— 
maybe parabolas are more practical. The cardioid is r= 1+ cos 6, the parabola is 
r= 1/1 + cos 6}. Section 12.2 adds cycloids and astroids. A graphics package can 
draw them and so can we. 

Together with the circles r = constant go the straight lines @ = constant. The equa- 
tion @ = 2/4 is a ray out from the origin, at that fixed angle. If we allow r <0, as we 
do in drawing graphs, the one-directional ray changes to a full line. Important: The 
circles are perpendicular to the rays. We have “orthogonal coordinates” —more inter- 
esting than the x — » grid of perpendicular lines. In principle x could be mixed with 
0 (non-orthogonal), but in practice that never happens. 

Other curves are attractive in polar coordinates—we look first at five exampies. 
Sometimes we switch back to x =rcos @ and y=rsin 6, to recognize the graph. 


EXAMPLE 4 The graph of r = 1/cos @ is the straight line x = | (because r cos @ = 1). 


EXAMPLE 2 The graph of r= cos 26 is the four-petaf flower in Figure 9.3. 


The points at @ = 30° and — 30° and 150° and —150° are marked on the flower. They 
all have r=cos 20=4. There are three important symmetries—across the x axis, 
across the y axis, and through the origin. This four-petal curve has them all. So does 
the vertical flower r = sin 26—but surprisingly, the tests it passes are different. 


(Across the x axis: y to —y) There are two ways to cross. First, change 8 to — @. 
The equation r = cos 2@ stays the same. Second, change @ to zx — @ and also r to —r. 
The equation r = sin 20 stays the same. Both flowers have x axis symmetry. 


(Across the y axis: x to —x) There are two ways to cross. First, change @ to x — 0. 
The equation r= cos 28 stays the same. Second, change @ to —@ andrto —r. Now 
r= sin 26 stays the same (the sine is odd), Both curves have y axis symmetry. 


(Through the origin) Now we change r to —r er 0 to 8+ 2x. The flower equations 
pass the second test only: cos 2(@+ 2)=cos 28 and sin 2(¢+x)=sin 20. Every 
equation r? = F(6) passes the first test, since (—r)? = r?. 


The circle r= cos @ has x axis symmetry, but not y or r. The spiral r= 0° has 
y axis symmetry, because — r = (— 6)> is the same equation. 


Question What happens if you change r to —r and also change @ to + x? 
Answer Nothing—because {r, 6) and (—r, 9+ 2) are always the same point. 


Ag.9.3 The four-petal flower r= cos 28 and the spiral r = @ (r > 0 in red). 


HOO 


r=1+—teos 6 r=1+ cos r=1+cos6 r= 1+ cos 0 


3 


no dimple dimple cardiwid inner loop 


9.2 Polar Equations and Graphs 


EXAMPLE 3 The graph of r= @ is a spiral of Archimedes—or maybe two spirals. 


The spiral adds new points as @ increases past 2x. Our other examples are 
“periodic” —@ = 2n gives the same point as 4 = 0. A periodic curve repeats itself. The 
spiral moves out by 2z each time it comes around. If we allow negative angles and 
negative r = @, a second spiral appcars. 


EXAMPLE 4 The graph of r= 1+ cos @ is a cardioid. It is drawn in Figure 9.4c. 


The cardioid has no simple xy equation. Still the curve is very attractive. It has a 
cusp at the origin and it is heart-shaped (hence its name). To draw it, plot r= 1 + cos 0 
at 30° intervals and connect the points. For this curve r is never negative, since cos 0 
never goes below —1. 

It is a curious fact that the electrical vector in your heart almost traces out a 
cardioid. See Section 11.1 about electrocardiograms. If it is a perfect cardioid you are 
in a little trouble. 
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Fig. 9.4 Limacons r = 1 + bcos f), including a cardioid and Mars seen from Earth. 


EXAMPLE 5 The graph of r=1+ bcos @ is a dimagon {a cardioid when b= 1). 


Limagon {soft ¢) is a French word for snail—not so well known as escargot but just 
as inedible. (J am only referring to the shell. Excusez-moi!) Figure 9.4 shows how a 
dimple appears as b increases. Then an inner loop appears beyond b = 1 (the cardioid 
at b= 1 is giving birth to a loop). For large 6 the curve looks more like two circles. 
The limiting case is a doubie circle, when the inner loop is the same as the outer 
loop. Remember that r = cos 0 goes around the circle twice. 

We could magnify the limacgon by a factor c, changing to r=c(l + bcos 0}. We 
could rotate 180° to r= 1 — bh cos @. But the real interest is whether these Agures arise 
in applications, and Donald Saari showed me a nice example. 


Mars secn from Earth The Earth gocs around the Sun and so docs Mars. Roughly 
speaking Mars is 13 times as far out, and completes its orbit in two Earth years. 

We take the orbits as circles: r = 2 for Earth and r = 3 for Mars. Those equations 
teli where but not when. With time as a parameter, the coordinates of Earth and Mars 
are piven at every instant f: 


Xp = 2 cos 2nt, yp =2 sin 2nt and Xy = 3 cos Ar, Vy = 3 sin Rt. 
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At t= 1 year, the Earth completes a circle (angle = 2x) and Mars is halfway. 
Now the key step. Subtract to find the position of Mars relative to Earth: 


Xm_—p = 3 cos mt ~ 2 cos 2nt and Ya—-E= 3 Sin xt ~—- 2 sin 2x1. 
Replacing cos 2nt by 2 cos?nt — 1 and sin 2xt by 2 sin nt cos xt, this is 
Xm-E=(3—4 cos atkosnt+2 and Yu-E=(3—4 cos at)sin xt. 


Seen from the Earth, Mars does a loop in the sky! There are two ?’s for which 
3—4cos at =0 (or cos xt = 3). At both times, Mars is two units from Earth (xy_- = 
2 and yy4--¢ = 0). When we move the origin to that point, the 2 is subtracted away— 
the M— E coordinates become x = r cos xt and y =r sin xt with r = 3 — 4 cos xt. That 
is a limacon with a loop, like Figure 9.4d. 


Note added in proof 1 didn’t realize that a 3-to-2 ratio is also responsible for heating 
up two spots on opposite sides of Mercury. From the newspaper of June 13, 1990: 


“Astronomers today reported the first observations showing that Mercury 
has two extremely hot spots. That is because Mercury, the planet closest to the 
Sun, turns on Its axis once every 59.6 days, which is a day on Mercury. It goes 
around the sun every 88 days, a Mercurian year. With this 3-to-2 ratio between 
spin and revolution, the Sum appears to step in the sky and move backward, 
describing a loop over each of the hot spots.” 


CONIC SECTIONS IN POLAR COORDINATES 


The exercises include other polar curves, like lemniscates and 200-petal flowers. But 
get serious. The most important curves are the ellipse and parabola and hyperbola. 
In Section 3.5 their equations involyed 1, x, y, x?, xy, y*. With one focus at the origin, 
their polar equations are even better. 


9A The graph of r= A/(1 + e cos 8) is a conic section with “eccentricity” e: 


circle ife=O ellipse ifO0<e<t parabolaife=1 hyperbola if e> 1, 


EXAMPLE 6 (e= 1} The graph of r=1/(1 + cos 6) is a parabola. This equation is 
r+rcos@= 1 orr=1-—x. Squaring both sides gives x? + y? = 1— 2x + x”. Cancel- 
ing x? leaves y? = 1 — 2x, the parabola in Figure 9.5b. 


The amplifying factor A blows up all curves, with no change in shape. 


EXAMPLE 7 (¢ = 2) The same steps lead from r{t + 2 cos #) = 1 to r= 1 — 2x. Squar- 
ing gives x? + y?>=1—4x+ 4x? and the x? terms do not cancel. Instead we have 
y? — 3x? = 1 — 4x. This is the hyperbola in Figure 9.5c, with a focus at (0, 0). 


The hyperbola y? — 3x? = 1 (without the —4x) has its center at (0, 0). 


EXAMPLE 8 (e = 4) The same steps lead from r{1 + 4 cos 6)= 1 to r= 1 — 4x. Squar- 
ing gives the ellipse x? + y? = 1—x + 4x’. Polar equations look at conics in a new 
way, which happens to match the sun and planets perfectly. The sun at (0, 0) is not 
the center of the system, but a focus. 


Finally e = 0 gives the circle r = 1. Center of circle = both foci = (0, 0). 


9.2 Polar Equations and Graphe 


Ag.9.5 r= 1/(1 +e cos 6) is an ellipse for e=4, a parabola for e = 1, a hyperbola for e =2. 


The directrix The figure shows the line d (the “‘directrix’’) for each curve. All points 

P on the curve satisfy r = |PF| = e|Pd|. The distance to the focus is e times the distance 

to the directrix. (e is still the eccentricity, nothing to do with exponentials.) A geometer 

would start from this property r= e|Pd| and construct the curve. We derive the 
property from the equation: 

A 
r SS 
1+ecos@ 


The directrix is the line at x = A/e. That last equation is exactly |PF| = e|Pd]. 


-=> rt+ex=A => r=d2-s), (1} 


Notice how two numbers determine these curves. Here the numbers are A and e. 
In Section 3.5 they were a and b. (The ellipse was x?/a? + y?/b? = 1.) Using A and e 
we go smoothly from ellipses through parabolas (at e= 1) and on to hyperbolas. 
With three more numbers we can move the focus to any point and rotate the curve 
through any angle. Conics are determined by five numbers. 


9.2 EXERCISES 


Read-through questions 


The circle of radius 3 around the origin has polar equation 

a_. The 45° line has polar equation__b _. Those graphs 
meet atanangle of__¢__. Multiplying r = 4 cos @ by r yields 
the xy equation __d__. Its graph its a__e _ with center at 


t__. The graph of r= 4/cos 0 is the line x=__g__. The 


equation r? = cos 26 is not changed when 8 > — @ (symmetric 
across__h )and when8+2+@(orr—_ i). The graph 
ofr=1l+cos@isa_ |. 


The graph of r= A/(_k_} is a conic section with one 
focus at __1__. It is an ellipse if __m__ and a hyperbola if 

n__. The equationr = 1/(1 + cos 8) leads tor + x =1 which 
givesa__o_. Then r = distance from origin equals |! —x= 
distance from __p_. The equations r= {!—x} and r= 
4(1—x)representsa__q andan__-r_. Including a shift 
and rotation, conics are determined by __s__ numbers. 


Convert to xy coordinates to draw and identify these curves. 
i1rsin@=1 2 Ncos @—sin #)=2 
3 r=2 cos 0 4r=—2sin6 


5 r=1}(2+cos 8) 6 r=1/(14+2 cos 6) 


In 7-14 sketch the curve aad check for x, y, and y symmetry. 


7 r>=4 cos 26 (lemniscate) 
g r? =4 sin 26 (lemniscate} 
9 r=cos 38 (three petals) 


10 r?= 10+ 6cos 40 


ll r=e° (logarithmic spiral) 
12 r=1/0 (hyperbolic sptral) 
13 r=tan@ 


14 r=1—2 sin 38 (rose inside rose) 


15 Convert r =6 sin # + 8 cos @ to the xy equation of a circle 
(what radius, what center?)}, 


*16 Squaring and adding in the Mars—Earth equation gives 
xh-e+¥e-e=13—12cosm, The graph of r?=13- 
12 cos 6 is not at ali like Figure 9.4d. What went wrong? 


In 17-23 find the points where the two curves meet. 


17 r=2cos@ and r=1+cos0 

Warning: You might set 2 cos @ = [ + cos @ to find cos @ = t. 
But the graphs have another meeting point—they reach it at 
different @’s. Draw graphs to find all meeting points. 
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18 r? =sin 20 and r* = cos 26 

19 r=1+ cos 6 and r=1—sin @ 

20 r=1+ cos @ and r=1—-—cos@ 

21 r=2 and r=4 sin 20 

22 r?=4 cos ? and r= 1—cos @ 

23 rsin @ = 1 and rcos (6 — /4) = \/2 (straight lines) 


24 When is there a dimple in r=1+6cos@ From x= 
(1 + bcos O)cos @ find dx/d@ and d*x/d0? at @= x. When that 
second derivative is negative the limacon has a dimple. 


25 How many petals for r = cos 59? For r=cos @ there was 
one, for r = cos 26 there were four. 


26 Explain why r = cos 1008 has 200 petals but r=cos 1010 
only has 101, The other 101 petals are . What about 
r=cos 47? 


27 Find an xy equation for the cardioid r = I + cos 0. 


28 {a} The flower r =cos 26 is symmetric across the x and y 
axes. Does that make it symmetric about the origin? (Do 
two symmetries imply the third, so —r = cos 24 produces 
the same curve?) 


(b}) How can r=1, @=2/2 lie on the curve but fail to 
satisfy the equation? 
29 Find an xy equation for the flower x = cos 28. 


30 Find equations for curves with these properties: 
(a) Symmetric about the origin but not the x axis 
(b) Symmetric across the 45° line but not symmetric in x 
oryorr 


(c) Symmetric in x and y and + (like the flower} but 
changed when x+ y (not symmetric across the 45° line). 


Problems 31~37 are about conic sections—especially ellipses. 


31 Find the top point of the ellipse in Figure 9,5a, by maxi- 
mizing y =r sin @ =sin 0/(1 +4 cos 8). 


9.3 Slope, Length, and Area for Polar Curves 


32 (a) Show that all conics r=1/{1+ecos go through 
x=O0,y=1. 
(b) Find the second focus of the ellipse and hyperbola, For 
the parabola (e = t} where is the second focus? 


33 The point @ in Figure 9.5c has y = 1. By symmeiry find x 
and then r (negative!). Check that x? + y? =r? and |QF|= 
2I@d\. 


34 The equations r= A/(1 + ecos 6) and r=1/(C + Dcos 0) 
are the same if C= and D= . For the 
mirror image across the y axis replace @ by . This 
gives r={/((C —Dcos@) as in Figure 12.t0 for a planet 
around the sun. 


35 The ellipse r = A/(1 + e cos 8) has length 2a on the x axis. 
Add r at =O to r at 0=72 to prove that e= A/(1 —e). The 
Earth’s orbit has @ = 92,600,000 miles = one astronomical 
unit (AU). 

36 The maximum height 4 occurs when y =r sin = A sin 6/ 
(1 + ecos 6) has dy/d@ = 0. Show that 5 = y,,,, = A/,/1 —e?. 
37_Combine a and 4 from Problems 35-36 to find ¢= 
(a? — b? = Aef(1 — e*). Then the eccentricity e is c/a, Halley’s 
comet is an ellipse with a= 18.1 AU and b=46AU so 
e= . 


Comets have large eccentricity, planets have much smaller e: 
Mercury .21, Venus .01, Earth .02, Mars .09, Jupiter .05, Sat- 
urn .05, Uranus .05, Neptune .01, Pluto .25, Kohoutek .9999. 


38 If you have a computer with software to do polar graphs, 
start with these: 
1. Flowers r= A+cos nO for n=4, 3,7, 8; A=O, 1,2 


2. Petals r=(cos m0 +4 cos nO)/cos @, (m, n) =(5, 3), (3, 5), 

(9, t}, (2, 3) 

3. Logarithmic spiral r =e 

4, Nephroid r= 1+2sin 30 from the bottom of a teacup 

5. Dr. Fay’s butterfly 7 = e°*° — 2 cos 40 + sin*(0/12) 
Then create and name your own curve. 


of2a 


The previous sections introduced polar coordinates and polar equations and polar 
graphs. There was no calculus! We now tackle the problems of area (integral calculus) 
and stope (differential calculus), when the equation is r = F(@). The use of F instead 
of f is a reminder that the slope is not dF/d@ and the area is not | F(6)d6. 

Start with area. The region is always divided into small pieces—what is their 
shape? Look between the angles @ and @ + A@ in Figure 9.6a. Inside the curve is a 
narrow wedge—almost a triangle, with A@ as its small angle. If the radius is constant 


9.3 Slope, Length, and Area for Polar Curves 


r=cos 6 


area 


Fig. 9.6 Area of a wedge and a circle and an intersection of circles. 


the wedge is a sector of a circle. It is a piece of pie cut at the extremely narrow angle 
A@. The area of that piece is a fraction (the angle A@ divided by the whole angle 27) 
of the whole area zr? of the circle: 


0 
area of wedge = ad tr? = 2A0= 5 CFOPA0, (1) 


2n 2 
We admit that the exact shape is not circular. The true radius F(@) varies with @— 


but in a narrow angle that variation is small. When we add up the wedges and let 
A@ approach zero, the area becomes an integral. 


9B The area inside the polar curve r = F(@) is the limit of S4r2A0 = Y4F2A0: 


aa | ; rd0 = | 5 CRO? 40. 


EXAMPLE 4 Find the area inside the circle r = cos 0 of radius } (Figure 9.6). 


sa | Osind+0@)?" 2 
area= [costo qg == Oem O70)" _ =. 


That is wrong! The correct area of a circle of radius 4 is 2/4. The mistake is that we 
went twice around the circle as @ increased to 27. Integrating from 0 to z gives 7/4. 


EXAMPLE 2 Find the area between the circles r= cos 6 and r= }4. 


The circles cross at the points where r = cos @ agrees with r = }. Figure 9.6 shows these 
points at +60°, or @= +7/3. Those are the limits of integration, where cos 0 = 4. The 
integral adds up the difference between two wedges, one out to r= cos 6 and a smaller 


one with r=}4: 
n/3 1 9 " 1 2 40 
area = ii ~ 5 (os | (5) | ; (3) 


Note that chopped wedges have area }(F ? — F3)A0 and not }(F , — F,)°A0. 
EXAMPLE 3 Find the area between the cardioid r= 1 + cos 0 and the circle r= 1. 


n/2 
area -| st + cos 0)? — 17]d0 (imi d=+ 5 where 1+cos@= 7 
—n/2 


> 


| cos? @— (2) 
3 (cos 9-(5) )A® 
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SLOPE OF A POLAR CURVE 


Where is the highest point on the cardioid r = 1 + cos 6? What is the slope at #= 
n/4? Those are not the most important questions in calculus, but stij] we should 
know how to answer them. I will describe the method quickly, hy switching to 
rectangular coordinates: 


x=rcos 8=(1+ cos 8os 6 and y=rsin @=({1+ cos Opin 6. 
For the highest point, maximize y by setting its derivative to zero: 
dy/d@ = (1 + cos 6){cos 6) + (—sin 8)(sin 6) = 0. (3) 
Thus cos @ + cos 28= 0, which happens at 60°. The height is y=(1+ 4)(,/3/2). 
For the slope, use the chain rule dy/d6 = (dy/dx)(dx/d@): 


ax dx/d8 ~ (1+ cos 8)(—sin 6) + (—sin 4) cos 6 (4) 


Equations (3) and (4) avoid the awkward (or impossible) step of eliminating 8. Instead 
of trying to find y as a function of x, we keep x and y as functions of 8. At 9= 7/4, 
the ratio in equation (4) yields dy/dx = — 1/(1+ «/ 2). 

Problem 18 finds a general formula for the slope, using dy/dx = (dy/d@)/(dx/d6). 
Problem 20 finds a more elegant formula, by looking at the question differently. 


LENGTH OF A POLAR CURVE 
The length integral always starts with ds= ./(dx)* + (dy)*. A polar curve has x = 
r cos 6 = F(6) cos 6 and y = F(6) sin 6. Now take derivatives by the product rule: 
dx = (F(8)cos # — F(8)sin 6)dé and dy = (F'(8)sin 6+ F(8)cos 66. 
Squaring and adding (note cos? + sin?) gives the element of length ds: 


ds = ./(F(8)]? + [F(@)}? 48. (5) 


The figure shows (ds)? = (dr)? + (rd@)*, the same formula with different letters. The 
total arc length is f ds. 

The area of a surface of revolution is J2zny ds {around the x axis) or j2nx ds 
(around the y axis). Write x, y, and ds in terms af 0 and dO. Then integrate. 


EXAMPLE 4 The circle r = cos @ has ds = fl d6. So its length is x (not 2x!!—don't 
go around twice). Revolyed around the y axis the circle yields a doughnut with no 
hole. Since x = 7 cos @ = cos?@, the surface area of the doughnut is 


[ax ds= | 2n cos?@ dO = n?. 


0 


EXAMPLE 5 The length of r= 1+ cos @ is, by symmetry, double the integral from 
0 to x: 


length of cardioid = 2] (—sin 9)? + (1+ cos 6)? dé 
Q 


n 5 é 
=2{ VIF F008 6 d= 4) cos 5 d= 8. 
6 0 


{ds)* = (dr)? + (rd6y 
Fig. 9.7 
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We substituted 4 cos? 4@ for 2+2cos @ in the square root. It is possible to skip 
symmetry and integrate from 0 to 2x—but that needs the absolute value |cos $8] to 


maintain a positive square root. 


EXAMPLE 6 The logarithmic spiral r= e~° has ds = ./e~?° + e~ 8 dé, It spirals to 
zero as @ goes to infinity, and the total length is finite: 


fds=[% /2e-* do= ~/2e~* |® = ./2. 


Revolve this spiral for a mathematical seashell with atea [°(2me ~* cos 6),/2e~°d8. 


9.3. EXERCISES 


Read-through exercises 


A circular wedge with angle A@ is a fraction__a___ of a whole 
circle. If the radius is r, the wedge area is __b__. Then the 
area inside r = F(@) ts {__¢ _. The arca inside r = 0? from 0 
tonis_d__. That spiral meets the circler=1at@=_ 6 

The area inside the circle and outside the spiralis __t .A 
chopped wedge of angle Af betweenr, andr, has area__g_. 


andy=_i_. 
The slope dy/dx is dy/d@ divided by__| _. For length (ds)? = 
(dx + (dy)? =__k _. The length of the spiral r=@10 =x 
is J_1__ (not to compute integrals), The surface area when 
r= @ is revolved around the x axis is [2xyds=[_m__. The 


volume of that solid is fny?dx =f __n 


The curve r= F(@}hasx=rcos9?=_h 


In 1-6 draw the curve and find the area inside. 

1 r=1+4+cos@ 

2 r=sin 8+cos 0 from 0 to x 

3 r=2+ cos 0 

4 r=1+2 cos 0 (inner loop only) 

§ r=cos 20 (onc petal only) 

6 r=cos 38 (one petal only) 
Find the area between the curves in 7-12 after locating their 
intersections (draw them first). 

7 circle r=cos 6 and circle r=sin 0 

8 spiral r= @ and y axis (first arch) 

9 outside cardioid r = 1 + cos 0 inside circle r = 3 cos 0 
10 lemniscate r? = 4 cos 26 outside r = \/2 
11 circle r= 8 cos @ beyond line r cos 8 = 4 
12 circle r= 10 beyond line ¢ cos 6 =6 


13 Locate the mistake and find the correct area of the lemnis- 
cater? =cos 20: area = [5 4x? dé = {5 $cos 26 d@ = 0. 


14 Find the area between the two circles in Example 2. 


15 Compute the area between the cardioid and circle in 
Example 3. 


16 Find the complete area (carefully) between the spiral r = 
e° (@ > 0) and the origin. 

17 At what 6s does the cardioid x = 1 +cos @ have infinite 
slope? Which points are furthest to the left (minimum x)? 


18 Apply the chain rule dy/dx =(dyp/d@)/(dx/d@) to x= 
F(@) cos 6, y = F(6) sin & Simplify to reach 

dy __F+tan 0 dF/d0 

dx —F tan 0+dF/d0 


19 The groove in a record is nearly a spiral r = cé: 


length =f ./r? + (dr/dO)* dO = [09 ¥/r? +c? dric. 


Take c=.002 to give 636 turns between the outer radius 
14 cm and the inner radius 6 cm (14 — 6 equals .002(636}27). 
(a) Omit ¢? and just integrate r dric. 
(b} Compute the length integral. Tables and calculators 
allowed. You will never trust integrals again. 
20 Show that the angle wy between the ray from the origin 
and the tangent line has tan py = F/(dF/d0). 


Hint: If the tangent line is at an angle @ with the horizontal, 
then tan ¢ is the slope dy/dx in Problem 18, Therefore 


tan @ — tan @ 
1+tan@ tan 


tan wy = tan(¢d — A= 


Substitute for tan @ and simplify like mad. 


21 The circle r= F(#)=4sin@ has %=6. Draw a figure 
including 0, ¢, ¥ and check tan w. 


22 Draw the cardioid r = 1 —cos @, noticing the minus sign. 
Include the angles 8, ¢, y and show that ¢ = 8/2. 


23 The first limagon in Figure 9.4 looks like a circle centered 
at (4, 0). Prove that it isn’t. 


24 Find the equation of the tangent line to the circle r = cos 6 
at O= 7/6. 
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In 25-28 compute the length of the curve. 
25 r= (0 from 0 to 22) 

26 r=sec @ (0 from 0 to 2/4) 

27 r =sin*(0/3) (@ from 0 to 3x) 

28 r= (f from 0 to x) 


29 The narrow wedge in Figure 9.6 is almost a triangle. It 
was treated as a circular sector but triangles are more familiar. 
Why is the area approximately $77A0? 


30 In Example 4 revolve the circle around the x axis and find 
the surface area. We really only revelve a semicircle. 


31 Compute the seashell area 2a ./2|% ¢~® cos 0 d0 using 
two integrations by parts. 


32 Find the surface area when the cardioid r=!+ cos @ 
is revolved around the x axis, 


33 Find the surface area when the lemniscate r? =cos 26 is 
revolved around the x axis. What is @ after one petal? 
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34 When y =/(x) is revolved around the x axis, the volume 
is [ny*dx. When the circle r =cos 0 is revolved, switch to a 
0-integral from 0 to 2/2 and check the volume of a sphere. 


35 Find the volume when the cardioid r = 1 + cos 0 is rotated 
around the x axis. 


36 Find the snrface area and volume when the graph of r= 
I/cos § is rotated around the y axis (0 <0 < 2/4}. 


37 Show that the spirals r= @ and r = 1/0 are perpendicular 
when they meet at ? = 1. 


38 Draw three circles of radius ] that touch each other and 
find the area of the curved triangle between them. 


39 Draw the unit squarcO<x¥<1,0< 1. In polar coordi- 
nates its right side ts r = . Find the area from f$r7d0. 


40 (Unravel the paradox) The areu of the ellipse x = 4 cos #, 
y=3sin Cis 2°4°3 = 122. But the integral of $r7d0 is 


2r 1 1 
= {16 cos??= 9 sin?0)d@ = 12=n. 
o 2 2 


Real numbers are sufficient for most of calculus. Starting from x? + 4, its integral 
t4x3+4x+C is also real. If we are given x*— 1, its derivative 3x? is real. But the 
roots (or zeres) of those polynomials are complex numbers: 


we+4=0 and 


x7 -1=0 have complex solutions. 


We expect two square roots of — 4. There are three cube roots of 1. Complex numbers 
are unavoidable, in order to find m roots for each polynomial of degree n. 

This section explains how to work with complex numbers. You will see their 
relation to polar coordinates. At the end, we use them to solve differential equations. 

Start with the imaginary number i. Everybody knows that x?= — 1 has no reai 
solution. When you square a real number. the result is never negative. So the world 
has agreed on a Solution cailed /. (Except that electrical engineers call it j.) Imaginary 
numbers follow the normal rules of addition, subtraction, multiplication. and division. 
with one difference: Whenever i* appears it is replaced by ~1. In particular —i times 
—igives +i? = — 1. In other words. —7 is also a square root of —1], There are two 
solutions (i and —i) to the equation x? + 1=0. 

Finding cube roots of | will stretch us further. We need complex numbers—real 


plus imaginary. 


9B A complex numsber (say 1+ 3i) is the sum of a real number (1) and a pure 
imaginary number (3). Addition keeps those parts separate; multiplication uses 


i=-1: 


Addition: (1+ 3/)+ (14+ 3)=14+1+4i13+3)=2+ 63 
Multiplication: (1 + 3/)(1 + 3) = 1+ 31+ 3i+ 97? = —8 + Gi. 
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Adding 1+ 3i to 5—i is easy (6 + 2i}. Multiplying is longer, but you see the rules: 
(1+ 3i)(S5—i}=5+ 15i—i— 31? =8 + 143. 


The point is this: We don’t have to imagine any more new numbers. After accepting 
i, the rest is straightforward. A real number is just a complex number with no 
imaginary part! When 1 + 3i combines with its “‘complex conjugate” | — 3i—adding 
or multiplying—the answer is real: 


(1+3i)+(1-3/)=2 (real) 
(1+ 3i)(1 — 31) =1-31+ 31-977 =10. (real) 


The complex conjugate offers a way to do division, by making the denominator real: 


(1) 


1+3i 1+311-3) 10 xt+iy xtiyx—-iy x2+y"" 


9C The complex number x + iy has real part x and imaginary part y. Its 


complex conjugate is x ~ iy. The product (x + iy){x — iy) equals x? + y? = r?. 
The absolute pafue (or modulus) is r = |x + iyj = _/x* + py’. 


THE COMPLEX PLANE 


Complex numbers correspond to points in a plane. The number | + 3i corresponds to 
the point (1, 3). Similarly x + iy is paired with (x, y}—which is x units along the ‘‘real 
axis” and y units up the “imaginary axis.” The ordinary plane turns into the complex 
plane. The absolute value r is the same as the polar coordinate r (Figure 9.8a). 

The figure shows two more copies of the complex plane. The one in the middle is 
for addition and subtraction. It uses rectangular coordinates. The one on the right is 
for multiplication and division and squaring. It uses polar coordinates. In squaring 
a complex number, 7 is squared and @ is doubled—as the right figure and equation 
(3) both show. 


SUM» 2 + 2i 
-” x 


ad +i? 
34+i r2=2 { 


-l+i 


F ; 3-i 
imaginary axis conjugate r,8—>r?, 26 


Fig. 9.6 The complex plane shows x, y, 7, @. Add with x and y, multiply with r and @. 


Adding complex numbers is like adding vectors (Chapter 11). The real parts give 
3—1 and the imaginary parts give 1+ 1. The vector sum (2, 2) corresponds to the 
complex sum 2+ 2i. The complex conjugate 3 — i is the mirror image across the real 
axis (i reversed to —i). The connection to r and @ is the same as before (you see it in 
the triangle): 


x=rcos@ amd y=rsin@ sethat x+iy=rcos 6+ isin 8). (2) 


In the third figure, 1+ i has 7 = /2 and @=x/4. The polar fonn is J/2 cos r/4 + 
J2i sin 7/4. When this number is squared, its 45° angle becomes 90°. The square is 
(1+ i)? = 1 + 2i— 1 = 2i. Its polar form is 2 cos n/2 + 2i sin 7/2. 
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9D Multiplication adds angles, division subtracts angles, and squaring doubles 
angles. The absolute values are multiplied, divided, and squared: 


(r cos 0+ ir sin 6 =r? cos 26 + ir? sin 26. (3) 


For nth powers we reach r" and né. For square roots, r goes to a/r and @ goes to 
49. The number —1 is at 180°, so its square root i is at 90°. 
To see why.6 is doubled in equation (3), factor out r? and multiply as usual: 


(cos 6 +i sin 9)(cos @ + i sin 6) = cos*@ — sin?6 + 2i sin 4 cos 0. 


The right side is cos 20+ isin 20, The double-angle formulas from tngonometry 
match the squaring of complex numbers. The cube would be cos 3@ + i sin 3@ (because 
2@ and @ add to 38, and r is still 1). The nth power is in de Moivre’s formula: 


{cos 6+ i sin 6)" =cos n+ i sin nO. (4) 


With n= —1 we get cos(~@)+isin(—6)—which is cos@—isin 6, the compiex 
conjugate: 
1 1 cos @—isin@ cos @—isin @ 


= ee (5) 


cos @+isin@ cos @+isin @ cos @-isin 0? t 


We are almost touching Euler’s formuia, the key to ail numbers on the unit circle: 


Euler’s formula. cos#+isin@=e". (6) 


Squaring both sides gives (e!)(e) = e?"*, That is equation (3). The — 1 power is 1/e = 
e **, That is equation (5). Multiplying any e by e* produces e****), The special case 
¢ = @ gives the square, and the special case ¢ = — @ gives ee = 1. 

Euler’s formula appeared in Section 6.7, by changing x to i@ in the series for e*: 

x? x ie , FF & - 
e bree tet becomes e“=1+ 16 5 izt 
A highlight of Chapter 10 is to recognize two new series on the right. The real terms 
1-467 ++: add up to cos @. The imaginary part @—%0°+-- adds up to sin @. 
Therefore e'* equals cos + isin 0. It is fantastic that the most important periodic 
functions in all of mathematics come together in this graceful way. 

We learn from Euler (pronounced oiler) that e?*' = 1. The cosine of 27 is 1, the sine 
is zero. If you substitute x = 27i into the infinite series, somehow everything cancels 
except the i—this is almost a miracle. From the viewpoint of angles, @ = 2x carries 
us around a full circle and back to e?" = }. 

Muitiplying Euler’s formula by r, we have a third way to write a complex number: 


Every complex number is x+ip=rcos 6+ir sin é=re®, (7) 
EXAMPLE 4 2e’® times 3e’* equals 6e?. For @= 1/2, 2i times 31 is —6. 


EXAMPLE 2 Find w? and w* and w® and w?* when w= e'™’*. 
Solution e’*/* is 12 + if,/2. Note that r? =4+4= 1. Now watch angles: 
w=¢@"2=jp wr=ei*™=—-1 ws = 1 w2> = wowewew = w. 


Figure 9.9 shows the eight powers of w. They are the eighth roots of 1. 
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Fig.9.9 The eight powers of w and the cube roots of 1. 


EXAMPLE 3 (x* + 4=0) The square roots of —4 are 2i and — 2i. Instead of (ii) = 
— 1 we have (2i)(27) = — 4. If Euler insists, we write 2i and —2i as 2e"*? and 2e!3*/. 


EXAMPLE 4 (The cube roots of 1) In rectangular coordinates we have to solve 
(x + iy = 1, which is not easy. In polar coordinates this same equation is r°e*!”= 1. 
Immediately r= 1. The angle @ can be 22/3 or 4x/3 or 62/3—the cube roots in the 
figure are evenly spaced: 


(gttt)2 _ e2ri =] (ee)? =- ett =t (eonry = oot =]. 
You see why the angle 82/3 gives nothing new. It completes a full circle back to 27/3. 


The nth roots of 1 are e?*"", et", «1. There are n of them. 
They fie at angles 2n/n, 4ajn, ....2n around the unit circle. 


SOLUTION OF DIFFERENTIAL EQUATIONS 


The algebra of complex numbers is now applied to the calculus of complex functions. 
The complex number is c, the complex function is e“. It will solve the equations 
y" = —4y and y” = y, by connecting them to c? = — 4 and c* = 1. Chapter 16 does 
the same for all linear differential equations with constant coefficients—this is an 
optional previcw. 

Please memorize the one key idea: Sudstitute y =e into the differential equation 
and solve for c. Each derivative brings a factor ¢, so y’ = ce’ and y” = c*e": 


d?yide? = - 4y leads to c?e" = — 4e", which gives c? = — 4. (8) 


For this differential equation, c must be a square root of — 4. We know the candidates 
(¢ = 2i and c= — 2i). The equation has two “pure exponential solutions” e*: 


y=e" and y=e2#, (9) 


Their combinations y = Ae?" + Be ~*" give all solutions. In Chapter 16 we will choose 


the two numbers A and B to match two initial conditions at ¢ = 0. 
The solution y = e** = cos 22 + i sin 2t is complex. The differential equation is real. 
For real y’s, take the real and imaginary parts of the complex solutions: 


Yreat = COS 2f aNd Yimaginary = SiN 2¢. (10) 


These are the “‘pure oscillatory solutions.’ When y =e?" travels around the unit 
circle, its imaginary part sin 2t moves up and down. (It is like the ball and its shadow 
in Section 1.4, but twice as fast because of 2.) The real part cos 2t goes backward 
and forward. By the chain rule, the second derivative of cos 2t is —4 cos 2t. Thus 
d*y/dt? = — 4y and we have real solutions. 


363 


364 


9 Polar Coordinates and Complex Numbers 


EXAMPLE 5 Find three solutions and then three reaf solutions to d*y/dt? = y. 


Key step: Substitute y = ¢'. The result is c%e* = e*. Thus c? = 1 and ¢ is a cube root 
of 1. The candidate c= 1 gives » = ¢ (our first solution), The next c is complex: 
P | 3 ce 
c= ets = — 5 +i 2 yields y= et =e tei 3t2, (1) 
The real part of the exponent leads to the absolute value |y| = ee It decreases as t 
gets larger, so y moves toward zero. At the same time, the factor e'¥3"? goes around 
the unit circle. Therefore y spirals in to zero (Figure 9.10). So does its complex 
conjugate, which is the third exponential. Changing i to —i in (11) gives the third 
cube root of 1 and the third solution e~‘/e~‘¥3"?, 
The first real solution is y =e’. The others are the two parts of the spiral: 


Yreat = €~ !? cos J3t/2 and — imaginary = € "7 sin J 31/2. (12) 


That is r cos @ andr sin 0. It is the ultimate use (until Chapter 16) of polar coordinates 
and complex numbers. We might have discovered cos 2¢ and sin 2¢ without help, for 
y” = — 4y. I don’t think these solutions to y” = y would have been found. 


EXAMPLE 6 Find four solutions to d*y/dt* = y by substituting y = e“. 


Four derivatives lead to ct = 1. Therefore c is ior —1 or —i or 1. The solutions are 
y=e'",e-',e-", and e’. If we want real solutions, e” and e~" combine into cos t and 


ane 


sin ¢. In all cases y”” = y. 


Fig. 9.40 Solutions move in the complex plane: y" = —4y and y” = y and y"" = y. 


9.4 EXERCISES 


Read-through questions 


The complex number 3 + 4i has real part __@ _ and imagi- 
nary part_> __. Its absolute value isr=_ ¢ and its com- 
plex conjugate is__9 ___. Its position in the complex plane is 


at(_@__). ts polar form isrcos@+irsinfO=_t  e. Its 
squareis 9 +i 4. Itsnthpoweris | ei”. 
The sum of 1+iand1—/is__i__. The product of 1 +i 


and 1—jis__k _. In polar form this is ,/2e™'* times __! 


The quotient (1+ i)/{1—i) equals the imaginary number 
m__. The number (1 + i)® equals _" _. Aneighth root of 


lisw=_ 9. The other eighth roots are __P 


To solve d®y/dr® = y, look for a solution of the form ) = 
q__. Substituting and canceling e leads to the equation 


t__. There are _ § choices for c, one of which is 
(-l + df./2, With that choice |e|=_!  . The real 
solutions are Ree*=_ 4 = andIme*=_ ¥ 


In 1-6 plot each number in the complex plane. 


1 2+ and its complex conjugate 2 —i and their sum and 
product 


2 1 +i and its square (1 + i)? and its reciprocal 1/(t + i) 
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3 2¢*'® and its reciprocal 4e~*/° and their squares 
4 The sixth roots of { (six of them) 

§ cos 3x/4 + isin 3x/4 and its square and cube 

6 4e* and its square roots 


7 For complex numbers c=x+iy=re® and their con- 
jugates €=x—iy=re~™, find all possible locations in the 
complex plane of {l}c+é@ (Qe-—é (3)cé (Aec/é. 


8 Find x and y for the complex numbers x + iy at angles 
= 45°, 90°, 135° on the unit circle, Verify directly that the 
square of the first is the second and the cube of the first is the 
third. 


9 Ife =2+iand d=4 + 3i find ca and c/d, Verify that the 
absolute value |cd| equals {c} times |d|, and |c/d| equals |c| 
divided by |d]. 


10 Find a solution x to e* =i and a solution to e* = 1/e. 
Then find a second solution. 


Find the sum and product of the numbers in 11-14. 

11 e” and e~*, also e773 and e**/3 

12 €” and e’*, also e*'* and e~*/* 

13 The sixth roots of 1 (add and multiply all six) 

14 The two roots of c?—4c+5=0 

15 if c=re® is not zero, what are c* and c7! and c™“? 


16 Multiply out (cos 6 + i sin ? = e'9°, to find the real part 
cos 3@ and the imaginary part sin 3@ in terms of cos @ and 
sm 8. 

17 Plot the three cube roots of a typical number re’®, Show 
why they add to zero. One cube root is re, 


18 Prove that the four fourth roots of re® multiply to give 
id 
—re’, 


In 19-22, find all solutions of the form y =e”. 
20 y”+y=0 
22 y" + 6y'+ Sy =0 


19 y"+y=0 
21 y"—y'=0 
Construct two real solutions from the real and imaginary parts 
of ¢* (first find c): 
23 y’+49y=0 24 y" —2y' +2y=0 

Sketch the path of y= 2s ¢ increases from zero, and mark 
y=e; 

26 c=1—-i 26 c=—1I +i 27 c=ni/4 


28 What is the solution of dy/dt =iy starting from yo =I? 
For this solution, matching real parts and imaginary parts of 
dy/dt = iy gives and : 


29 In Figure 9.10b, at what time t does the spiral cross the 
real axis at the far left? What does y equal at that time? 


30 Show that cos 6 = He + e-**) and find a similar formula 
for sin @. 
31 True or false, with an example to show why: 
{a) If c, + cz is real, the c’s are complex conjugates. 
(b) Hf Jc,| =2 and |c.| = 4 then c,cz has absolute value 8. 
{c} If jc,{ =1 and jc,|={ then jc, +c,| is (at least 1) (at 
most 2} (equal to 2}. 
(d) If e* approaches zero as t + oo, then {c is negative) (the 
real part of c is negative) ( c| is less than 1). 


32 The polar form of re times Re* is . The rectan- 
gular form is Circle the terms that give 


rR cos(9 + ). 
33 The complex number 1/(re‘) has polar form and 
rectangular form and square roots 


34 Show that cos ix =cosh x and sin ix=jsinh x, What is 
the cosine of i? 
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CHAPTER 10 


Infinite Series 


Infinite series can be a pleasure (sometimes). They throw a beautiful light on sin x 
and cos x. They give famous numbers like x and e. Usually they produce totally 
unknown functions—which might be good. But on the painful side is the fact that 
an infinite series has infinitely many terms. 

It is not easy to know the sum of those terms. More than that, it is not certain 
that there is a sum. We need tests, to decide if the series converges. We also need 
ideas, to discover what the series converges to. Here are examples of convergence, 
divergence, and oscillation: 


T+ht+hte=2 tl tlt--=a0 1-1+1-1-=? 


The first series converges. Its next term is 1/8, after that is 1/16—and every step 
brings us halfway to 2. The second series (the sum of 1’s) obviously diverges to infinity. 
The oscillating example (with I's and —1’s) also fails to converge. 

All those and more are special cases of one infinite series which is absolutely the 
most important of ail: 


1 
1-x 


The geometric series is 1+ x+x?+x3+--= 


This is a series of functions. It is a “‘power series.” When we substitute numbers for 
x, the series on the left may converge to the sum on the nght. We need to know when 
it doesn’t. Choose x = 4 and x=! and x=—1: 


; ' . 1 
i+4+ (%)*+--- is the convergent series. Its sam is —; = 2 


1-3 
ett . i 1 
1+1+1+---1s divergent. Its sum is T-1 = 0 = 00 
ery : 1 1 
t+(-1)+(-1)* + -- is the oscillating series. Its sum should be i-cy => 


The last sum bounces between one and zero, so at least its average is 7. At x= 2 
there is no way that 1+2+4+8+ -- agrees with 1/(l — 2). 
366 This behavior is typical of a power series—to converge in an interval of x’s and 


10.4 The Geometric Series 


to diverge when x is large. The geometric series is safe for x between —1 and 1. 
Outside that range it diverges. 
The next example shows a repeating decimal 1.111...: 


aD F se sng gel LN ef de 
Set x= 10° @ geometric series is rT (3) (7s) 
The decimal 1.111... is also the fraction 1/(1— 7), which is 10/9. Every 
fraction leads to a repeating decimal, Every repeating decimal adds up (through the 
geometric series) to 2 fraction. 

To get 3.333..., just multiply by 3. This is 10/3. To get 1.0101..., set x = 1/100. 
This is the fraction 1/(1 — 745), which is 100/99. 


Here is an unusual decimal (which eventually repeats). I don’t really understand it: 


= 004 115.226 337 448 ... 


243 
Most numbers are not fractions (or repeating decimals). A good example is z: 
1 4 1 5 
=3+—+—+—_+——}+--, 
™=3+75* too * tooo * 10000 


This is 3.1415..., a series that certainly converges. We happen to know the first billion 
terms (the billionth is given below). Nobody knows the 2 billionth term. Compare 
that series with this one, which also equais zx: 


4 
=4--+2-2+4 
melee i ie 
That alternating series is really remarkable. It is typical of this chapter, because its 
pattern is clear. We know the 2 billionth term (it has a minus sign). This is not a 
geometric series, but in Section 10.1 it comes from a geometric series. 


Question Does this series actually converge? What if all signs are +? 
Answer The alternating series converges to x (Section 10.3). The positive series 
diverges to infinity (Section 10.2). The terms go to zero, but their sum is infinite. 


This example begins to show what the chapter is about. Part of the subject deals 
with special series, adding to 10/9 or x or e*. The other part is about series in general, 
adding to numbers or functions that nobody has heard of. The situation was the 
same for integrals—they give famous answers like In x or unknown answers like 
J x* dx. The sum of 1 + 1/8 + 1/27 + --- is also unknown—although a lot of mathema- 
ticians have tried. 

The chapter is not long, but it is full. The last half studies power series. We begin 
with a linear approximation like 1+ x. Next is a quadratic approximation like 
1+ x + x?. In the end we match all the derivatives of f(x). This is the “ Taylor series,” 
a new way to create functions—not by formulas or integrals but by infinite series. 


No example can be better than 1/(1 — x), which dominates Section 10.1. Then we 
define convergence and test for it. (Most tests are really comparisons with a geometric 
series.) The second most important series in mathematics is the exponential series 
@=1]+x+4x*+4x3+4+ --. It includes the series for sin x and cos x, because of the 
formula e* = cos x+isin x. Finally a whole range of new and old functions will 
come from Taylor series. 

In the end, all the key functions of caiculus appear as “infinite polynomials” (except 
the step function). This is the ultimate voyage from the linear function y = mx + b. 
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We begin by looking at both sides of the geometric series: 


1 


lt+xtx?txeters 
1-x 


() 


How does the series on the left produce the function on the right? How does {1 — x) 
produce the series? Add up two terms of the series, then three terms, then n terms: 


— x? 1-x? fx 
Steet e teeth 
—x 1-x Lx 


Ghee (2) 
1 

For the first, 1 + x times 1 — x equals 1 — x? by ordinary algebra. The second begins 
to make the point: 1+ x +x? times 1— x gives |—x+x—x?+x*—x?*. Between 
1 at the start and ~ x? at the end, everything cancels. The same happens in all cases: 
{+-++x""! times t — x leayes 1 at the start and —x” at the end. This proves 
equation (2)—-the sum of n terms of the series. 

For the whole series we will push n towards infinity. On a graph you can see what 
is happening. Figure 10.1 shows n= 1 and n= 2 and n=3 and n=. 


Aig. 10.4 Two terms, then three 
terms, then full series: 


1 
l+xtx?+ +. =~. 
l—x 
ltxtx?4-- 
J—x/1 
l-x 
x 
x—-X 
x? 
x? x3 


The infinite sum gives a finite 
answer, provided x is between 
—1 and 1. Then x" goes to zero: 


1- x" 1 


-l -1/2 0 1/2 2/3 } 


Now start with the function 1/(1 — x). How does it produce the series? One way is 
elementary but brutal, to do “long division” of {— x into | (next to the figure). 
Another way is to look up the binomial formula for (1 — x)~'. That is cheating—we 
want to discover the series, not just memorize it. The successful approach uses cal- 
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culus. Compute the derivatives of f(x) = 1/(1 — x): 
falesy? f"=%Al-x3 f” =6(1—x)"* ves (3) 


At x = 0 these derivatives are 1, 2, 6, 24, .... Notice how —1 from the chain rule keeps 
them positive. The nth derivative at x = 0 is n factorial: 


fO=1 f@=1 f@M=2 f"=6 -- — fM™O=nt. 
Now comes the idea. Te match the series with 1/(1 — x), match all those derivatives at 
x = 0. Each power x" gets one derivative right. Its derivatives at x = 0 are zero, except 
the nth derivative, which is n! By adding all powers we get every derivative right— 
so the geometric series matches the function: 


1+xt+x?+x3+ ++ has the same derivatives at x=0 as 1/(1 — x). 


The linear approximation is 1 + x. Then comes $f”(0)x? = x”. The third derivative 
is supposed to be 6, and x? is just what we need. Through its derivatives, the function 
produces the series. 

With that example, you have seen a part of this subject. The geometric series 
diverges if |x| 21. Otherwise it adds up to the function it comes from (when 
—1<x< 1). To get familiar with other series, we now apply algebra or calculus—to 
reach the square of 1/(1 — x) or its derivative or its integral. The point is that these 
operations are applicd to the series. 

The best I know is to show you eight operations that produce something useful. 
At the end we discover series for In 2 and z. 


1. Multiply the geometric series by a or ax: 


2 ee 2 Sa. hece 
ataxt+ax® +o = — ax tax’ tax’ +e — (4) 
L=% b= x 
The first series fits the decimal 3.333.... In that case a = 3. The geometric series for 
x =75 gave 1.111... = 10/9, and this series is just three times larger. Its sum is 10/3. 
The second series fits other decimals that arc fractions in disguise. To get 12/99, 
choose a = 12 and x = 1/100: 


12 12 12 12/100 12 

121212... =—— + —, + 4 es, 

100 100? 100° 1-1/100 99 
Problem 13 asks about .8787... and .123123.... It is usual in precalculus to writc 
a+artar? +--=a/(1—r). But we use x instead of r to emphasize that this is a 


function—which we can now diflerentiate. 


2. The derivative of the geometric series 1+ x+x?+--- is Lf(1 — x)?: 


d | | 
P+ 2x4 3x? + 4x3 +0 = — = : 
en eae 2(4) (1 — x? 0) 
At x =75 the left side starts with 1.23456789. The right side is 1/(1 — 25)? = 1/(9/10)?, 
which is 100/81. If you have a calculator, divide 100 by 81. 
The answer should aiso be near (1.11111111)?, which is 1.2345678987654321. 
3. Subtract t+x4+x7+-> from 1+ 2x + 3x? +++ as you subtract functions: 
1 l x 


d-xP (l-x) (l-x) 


Curiously, the same series comes from multiplying (5) by x. It answers a question left 
open in Section 8.4— the average number of coin tosses until] the result is heads. This 


xt2x*4 3x2 4% 


(6) 
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is the sum 1(p,} + 2(p,)+ --- from probability, with x = 4: 


(9) + 2) + 3G)? + = (7) 


me 
(1-4) 

The probability of waiting until tbe nth toss is p, = (3)’. The expected value is two 
tosses. 1 suggested experiments, but now tbis mean value is exact. 


4. Multiply series: the geometric series times itself is 1/(1 — x) squared: 
(lta xtx? tlt xtx?t abt ixt 3x72 +e. (8) 


The series on the right is not new! In equation (5) it was the derivative of y = 1/(1 — x). 
Now it is the square of the same y. The geometric series satisfies dy/dx = y”, so the 
function does too. We have stumbled onto a differential equation. 

Notice how the series was squared. A typical term in equation (8) is 3x”, coming 
from | times x? and x times x and x? times 1 on the left side. It is a lot quicker to 
square 1/(1 — x}—but other series can be multiplied when we don’t know what func- 
tions they add up to. 


5. Solve dy/dx = y’ from any starting value—a new application of series: 


Suppose the starting value is y= 1 at x =0. The equation y’= y? gives 1? for the 
derivative. Now a key step: The derivative of the equation gives y” = 2yy’. At x =0 
that is 2-1-1. Continuing upwards, the derivative of 2yy’ is 2yy’ + 2(y')?. At x=0 
that is y”=4+2=6. 

All derivatives are factorials: 1, 2, 6, 24, .... We are matching the derivatives of the 
geometric series 1 + x+x?+x°+.... Term by term, we rediscover the solution to 
y = y*. The solution starting from (0) = 1 is y= 1/1 — x). 


A different starting value is —1. Then y’ = (—1)? = 1 as before. The chain rule gives 
y” = 2py' = — 2 and then y” = 6. With alternating signs to match these derivatives, the 
solution starting from —1 is 


y= —ltx-x? 4x9 = 1 +x). (9) 


It is a smali challenge to recognize the function on the right from the series on the 
left. The series has — x in place of x; then multiply by —1. The sum y= — 1f(1+ x) 
also satisfies y' = y*. We can solve differential equations from all starting values by 
infinite series. Essentially we substitute an unknown series into the equation, and 
calculate one term at a time. 


6. The integrals of 1+x+x?7+° and 1-x+x*--~ are logarithms: 
* d 
rt dete bores [TS = —mg—x (10a) 
oe * dx 
=e _s — see = = + + 
Xx + 3x { a In (1 + x) {10b) 


The derivative of (10a) brings back the geometric series. For logarithms we find 1/n 
not 1/n! The first term x and second term 4x? give linear and quadratic approxi- 
mations. Now we have the whole series. I cannot fail to substitute 1 and 4, to find 
in(1 — 1) and In{1 + 1) and In(1 — 4): 


x=h l+4t+44+54+--=-n0=+o (11a) 
x=h 1-$+$-h4--= In2=.693 (11b) 
x=} $4+$44+4+-°=—-Inf=In2. (12) 
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The first series diverges to infinity. This Aarmonic series 1+ 4 + 4+ --- came into the 
earliest discussion of limits (Section 2.6). The second series has alternating signs and 
converges to In 2. The third has plus signs and also converges to In 2. These will be 
examples for a major topic in infinite series—tests for convergence. 

For the first time in this book we are able to compute a logarithm! Something 
remarkable is involved. The sums of numbers in (11) and (12) were discovered from the 
sums of functions in (10). You might think it would be easier to deal only with numbers, 
to compute In 2, But then we would never have integrated the series for 1/(1 ~ x} and 
detected (10). It is better to work with x, and substitute special values like } at the 
end. 

There are two practical problems with these series. For in 2 they converge slowly. 
For In e they blow up. The correct answer is In e = 1, but the series can’t find it. Both 
problems are solved by adding (10a) to (10b), which cancels the even powers: 

Il+x 


x3 x? 
x4 5+ 5+) ain +x) in == In . (13) 
3 5 I-x 


At x = 4, the right side is In $ — in 4 = In 2. Powers of } are much smaller than powers 
of 1 or }, so In 2 is quickly computed. All logarithms can be found from the improved 
series (13). 
7. Change variables in the geometric series (replace x by x? or — x7): 
Leet et xe ae (1 x") (14} 
L—x?+x4-—x84--= 141+ x?) (15) 


This produces new functions (always our goal). They involve even powers of x. The 
second series will soon be used to calculate x. Other changes are valuable: 


x. x. psy 1 2 

= : b+i4¢f{s} 4-4 = 

5 in place of x: | (5) T= (3/2) 2=x (16) 
1 1 1 1 x 

—in place of x: 1+-+ = +--= =——. 

x se cee a 1-(l/x) x-1 aM?) 


Equation (17) is a series of negative powers x", It converges when |x| is greater than 
1. Convergence in (17) is for large x. Convergence in (16) is for |x| < 2. 


8. The integral of | — x? +x*—x°+--- yields the inverse tangent of x: 


I ! J 
wage gta ne | = tans ha (18) 


7 
We integrated (15) and got odd powers. The magical formula for x (discovered by 
Leibniz} comes when x = |. The angle with tangent | is 2/4: 

1 1 1 t 

ee 

3 5 7 4 
The first three terms give 7 3.47 (not very close). The 5000th term is still of size 
-0001, so the fourth decimal is still not settled. By changing to x = 1/3, the astrono- 
mer Halley and his assistant found 7! correct digits of 2/6 (while waiting for the 
comet). That is one step in the long and amazing story of calculating x. The Chudnov- 
sky brothers recently took the latest step with a supercomputer—they have found 
more than one billion decimal places of x {see Science, June 1989). The digits look 
completely random, as everyone expected. But so far we have no proof that all ten 

digits occur 3/5 of the time. 


1+x? 


(19) 
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Historical note Archimedes located x above 3.14 and below 34. Variations of his 
method (polygons in circles) reached as far as 34 digits—but not for 1800 years. Then 
Hailey found 71 digits of 2/6 with equation (18). For faster convergence that series 
was replaced by other inverse tangents, using smaller values of x: 


aes “a dee apis ees os oe ae 
4 tan 3 tan 3 4 tan 5 tan 339° (20) 


A prodigy named Dase, who coujd multiply 100-digit numbers in his head in § hours. 
finally passed 200 digits of 2. The climax of hand calculation came when Shanks 
published 607 digits. I am sorry to say that only 527 were correct. (With years of 
calculation he went on to 707 digits, but still only 527 were correct.) The mistake was 
not noticed unttl 1945! Then Ferguson reached 808 digits with a desk calculator. 

Now comes the computcr. Three days on an ENIAC (1949) gave 2000 digits. A 
hundred minutes on an IBM 704 (1958) gave 10,000 digits. Shanks (no relation) 
reached 100,000 digits. Finally a million digits were found in a day in 1973, with a 
CDC 7600. Ail these calculations used variations of equation (20). 

The record after that went between Cray and Hitachi and now IBM. But the 
method changed. The calculations rely on an incredibly accurate algorithm, based 
on the “‘arithmetic-geometric mean iteration’’ of Gauss. It is also incredibly simple, 
all things considered: 


a, 4 by / = 2 
an+1 = 3 batt = VS UnDn tym Dabs j( 1~ Om 2a — hi) 


The number of correct digits more than doubles at every step. By n=9 we are far 
beyond Shanks {the hand calculator). No end is in sight. Almost anyone can go past 
a billion digits, since with the Chudnovsky method we don’t have to start over again. 


It is time to stop. You may think (or hope) that nothing more could possibly be 
done with geometric series. We have gone a long way from 1;({1 — x), but some 
functions can never be reached. One is e* (and its relatives sin x, cos x, sinh x, cosh x). 
Another is ./1 — x (and its relatives 1/.)/1 — x’, sin” 'x, sec” 'x, ...). The exponentials 
are in 10.4, with serics that converge for all x. The square roots are in 10.5, closer to 
geometric series and converging for [x| < 1. Before that we have to say what con- 
vergence means. 

The series came fast, but I hope you see what can be done (subtract, multiply, 
differentiate, integrate). Addition is easy, division is harder, all are legal. Some un- 
expected numbers are the sums of infinite series. 


Added in proof By e-mail | just learned that the record for 7 is back in Japan: 
2°° digits which is more than 1.07 billion. The elapsed time was 100 hours (75 hours 
of CPU time on an NEC machine). The billionth digit after the decimal point is 9. 


40.4 EXERCISES 


Read-through questions equals the fraction _g__. The decimal .666... multiplics this 
by _h__. The decimal .999... is the same as __{ 


The geometric series 1 +x +x?4+-~ adds to _a__. It con- The derivative of the geometric series is _]) = kk. 
verges provided |x|<__6 . The sum of terms is _¢ This also comes from squaring the I __ series. By choosing 
The derivatives of the series match the derivatives of 1,;(1 — x) x=.01, the decimal 1.02030405 is close to _m_. The 
at the point x=_d__, where the nth derivative is 6. differential equation dy/dx =)" is solved by the gcomctric 


The decimal! 1.111... is the geometric series alx =__f and series, going term by term starting from y(0)=__n 
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The integral of the geometric series is __9 =  P_. At 
x=1 this becomes the _@__ series, which diverges. At x = 
t__wefindlIn2=_ 3s __. The change from x to —x produ- 


ces the series 1/(t+x)=_t and In(f+x}=__u 


In the geometric series, changing to x? or —x? gives 


Wd—x4y=_ and (1+ x?)=_w_. Integrating the 
last one yields x—4x?+4x5:--=_ x. The angle whose 
tangent isx=1listan-'1=_ y__. Then substituting x =1 
gives the seriesn=_ 2 

1 The geometric series is 1+ x+x?+- =G. Another way 


to discover G is to multiply by x. Then x +x? 4x34 = 
xG, and this can be subtracted from the original series. What 
does that Jeave, and what is G? 


2 A basketball is dropped 10 feet and bounces back 6 feet. 
After every fall it recovers $ of its height. What total distance 
does the ball travel, bouncing forever? 


3 Find the sums of $+ ¢+75+- and 1-4+;— "> and 
10-—1+.{—.01... and 3.040404... 


4 Replace x by 1 —x in the geometric series to find a series 
for 1/x. Integrate to find a series for In x. These are power 
series “around the point x = t.” What is their sum at x = 0? 


§ What is the second derivative of the geometric series, and 
what is its sum at x = 4? 


6 Multiply the series (1+ x+x7+--)(1—x+x?—--} and 
find the product by comparing with equation (14). 


7 Start with the fraction 4. Divide 7 into 1.000... (by long 
division or calculator) until the numbers start repeating. 
Which is the first number to repeat? How do you know that 
the next digits will be the same as the first? 


Note about the fractions 1/q, 10/g, 100/g, ... All remainders are 
less than q so eventually two remainders are the same, By 
subtraction, q goes evenly into a power 10% minus a smaller 
power 10%". Thus ge = 10% — t0%~" for some ¢ and 1/¢ has 
a repeating decimal: 

I c c 1 


= ae i + t + 1 fue 
10% 19” “ 10?" 
Conclusion: Every fraction equals a repeating decimal. 


8 Find the repeating decimal for y and read off c. What is 
the number a of digits before it repeats? 


9 From the fact that every q goes evenly into a power 10” 
minus a smaller power, show that all primes except 2 or 5 go 
evenly into 9 or 99 or 999 or «+. 


10 Explain why .010010001... cannot be a fraction (the 
number of zeros increases}. : 


11 Show that .123456789101112... is not a fraction. 


12 From the geometric series, the repeating decimal 
1.065065... equals what fraction? Explain why every repeating 
decimal] equals a fraction. 


13 Write .878787... and .123123... as fractions and as geo- 
metric series. 


14 Find the square of 1.111... as an infinite series. 


Find the functions which equal the sums 15~24. 


158 x¢xP4x54- 16 1—2x4+4x2_—-» 
17 xP 4x8 4x94 18 $x—4x?74+4x9--- 
19 In x+(n x)? +(in x+- | 20 x—2x?243x3—- 


1 1 


1 x x 
aotatat 22 =P ae” (aay. 


23 tan x—4tan?x+dtan'x—-- Mert +e +e 4-- 


25 Multiply the series for I/(1 —x) and 1A{1 + x) to find the 
coefficients of x, x?, x? and x’. 


26 Compare the integral of 1 + x? + x* + +; to equation (13) 
and find fdx/(1 ~ x2). 


27 What fractions are close to .2468 and .987654321? 

28 Find the first three terms in the series for 1/{t — x). 
Add up the series 29-34. Problem 34 comes from (18). 

WwW .14+,02+.0034+-- 

32 .t —4(.01) + 4(.00t) — -- 


yey Sane A 
1) eee eee Wee ears 
oS tat at 


BM .L4+4.01)4+4{.001) 4+: 


33 .1+4.001)+4.00001}+-* HM I-~ eo +o5-"* 


35 Compute the nth derivative of 1 +2x+3x?+--atx=0, 
Compute also the ath derivative of (1 —x)7?. 


36 The differential equation dy/dx = y? starts from (0) = b. 
From the equation and its derivatives find y’, y’, y” at x =0, 
and construct the start of a series that matches those deriva- 
tives. Can you recognize y{x)? 


37 The equation dy/dx = y* has the differential form dy/y? = 
dx. Integrate both sides and choose the integration constant 
so that y=b at x=0, Solve for y{x) and compare with 
Problem 36. 


38 In a bridge game, what is the average number yp of deals 
until you get the best hand? The probability on the first deal 
is p, =}. Then p, =((4)=(probability of missing on the 
first) times (probability of winning on the second). Generally 
p, =(P"*4)}. The mean value p is py +2p,+3p,+ "= 


39 Show that (Za,)(Zb,) = Xa,5, is ridiculous. 


40 Find a series for In 4 by choosing x in (10b). Find a series 
for In 3 by choosing x in (13). How is In 4 related to In 3, and 
which series converges faster? 
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41 Compute In 3 to its second decimal place without a calcu- 
lator (OK to check). 


42 To four decimal places, find the angle whose tangent is 
x=. 


43 Two tennis players move to the net as they volley the ball, 
Starting together they each go forward 39 feet at 13 feet per 
second. The ball travels back and forth at 26 feet per second. 
How far does it travel before the collision at the net? (Look 
for an easy way and also an infinite series.) 


44 How many terms of the series 1—4+4-—4+4-" are 
needed before the first decimal place doesn’t change? Which 
power of 4 equals the 100th power of 4? Which power !/a" 
equals t/2?0°9 


40.2 Convergence Tests: Positive Series 


45 If tany=4 and tanz=4, then the tangent of y+z is 
(tan y + tan z)/(1 —tan y tan z)=1. If tan y=¢ and tanz= 

, again tan(y+z)= 1. Why is this not as good as 
equation (20), to find 2/4? 


46 Find one decimal of z beyond 3.14 from the series for 
4tan™! 4 and 4 tan”! 4. How many terms are needed in each 
series? 

47 (Calculator) In the same way find one decimal of 2 
beyond 3.14159. How many terms did you take? 


48 From equation (10a) what is Le"/n? 


49 Zeno’s Paradox is that if you go half way, and then half 
way, and then half way..., you will never get there. In your 
opinion, does $+4+4+ + add to 1 or not? 


This is the third time we have stopped the calculations to deal with the definitions. 
Chapter 2 said what a derivative is. Chapter 5 said what an integral is. Now we say 
what the sum of a series is—if it exists. In all three cases a Hmit is involved. That is 
the formal, careful, cautious part of mathematics, which decides if the active and 
progressive parts make sense. 

The series 4+ 4+ 4+ --- converges to 1. The series 1 + 4+4+ -* diverges to infin- 
ity. The series 1 - $+ 4-— --- converges to In 2. When we speak about convergence or 
divergence of a series, we are really speaking about convergence or divergence of its 
“ partialsums.” 


DEFINITION 4 The partial sum s, of the series a, +a, +a,+ -* stops at a,: 
5S, = sum of the first n terms = a, + a,+-' +4,. 
Thus s, is part of the total sum. The example 4 ++ 4+4+ --- has partial sums 
a qe s=i== 
4 8 2" 


Those add up larger and larger parts of the series—what is the sum of the whole 
series? The answer is: The series 4+ 4+ ... converges to | because its partial sums s, 
converge to 1. The series a, + a, +a, + ... converges to s when its partial sums— 
going further and further out—approach this limit s. Add the a’s, not the s’s. 


: | oe 


DEFINITION 2 The sum of a series is the limit of its partial sums s,,. 


We repeat: if the limit exists. The numbers s, may have no limit. When the partial 
sums jump around, the whole series has no sum. Then the series does not converge. 
When the partial sums approach s, the distant terms a, are approaching zero. More 
than that, the sum of distant terms is approaching zero. 

The new idea (2 a, = s} has been converted to the old idea (s, — s). 


EXAMPLE 4 The geometric series 45 + r§5 + roan + ** converges to s=4. 


The partial sums s,, 5, 83, 54 are .1, 11, .111, .1111. They are approaching s= 4. 


40.2 Convergence Tests: Positive Series 


Note again the difference between the series of a’s and the sequence of s’s. The series 
1+1+1+-- diverges because the sequence of s’s is 1, 2, 3, .... A sharper example is 
the harmonic series: 1 + $+ 4+ -- diverges because its partial sums 1, 14, ... eventu- 
ally go past every number s. We saw that in 2.6 and will see it again here. 

Do not confuse a, > 0 with s, + s. You cannot be sure that a series converges, just 
on the basis that a, > 0. The harmonic series is the best example: a, = 1/n + 0 but 
still s, + oc. This makes infinite series into a delicate game, which mathematicians 
enjoy. The line between divergence and convergence is hard to find and easy to cross. 
A slight push will speed up a, +0 and make the s, converge. Even though a, +0 
does not by itself guarantee convergence, it is the first requirement: 


40A_ Ifa series converges (s, -> s) then its terms must approach zero (a, — 0). 


Proof Suppose s, approaches s (as required by convergence). Then also s,_, 
approaches s, and the difference s, — s,_, approaches zero. That difference is a,. So 
a, > 0. 


EXAMPLE 4 (continued) For the geometric series 1 + x + x? + -+-, the test a, > 0 is 
the same as x*-—> 0. The test is failed if |x| > 1, because the powers of x don’t go to 
zero. Automatically the series diverges. The test is passed if -1 <x < 1. But to prove 
convergence, we cannot rely on a, > 0. It is the partial sums that must converge: 


_1-x" 1 


Se=ltxte tx"! and 5, This is s. 


I-x l-x 

For other series, first check that a, +0 {otherwise there is no chance of con- 
vergence). The a, will not have the special form x*—so we need sharper tests. 

The geometric series stays in our mind for this reason. Many convergence tests are 


comparisons with that series, The right comparison gives enough information: 
If |a,| <4 and |a,| <4 and ..., then a, +a, +... converges faster than$+4+.... 


More generally, the terms in a,+a,+a4,;+... may be smaller than 

ax + ax? +ax*+.... Provided x < 1, the second series converges. Then ¥ a, also 

converges. We move now to convergence by comparison or divergence by comparison. 
Throughout the rest of this section, all numbers a, are assumed positive. 


COMPARISON TEST AND INTEGRAL TEST 


In practice it is rare to compute the partial sums s, = a, + -- + a,. Usually a simple 
formula can’t be found. We may never know the exact limit s. But it is still possible 
to decide convergence— whether there is a sum—by comparison with another series 
that is known to converge. 


10B (Comparison test) Suppose that 0<a,<b, and )b, converges. Then 


>}. a, converges. 


The smaller terms a, add to a smaller sum: ¥. a, is below ¥ 5, and must converge. 
On the other hand suppose a, > c, and }' ¢, = oc. This comparison forces : a, = 00. 
A series diverges if it is above another divergent series. 

Note that a series of positive terms can only diverge ‘‘to infinity.” It cannot oscillate, 
because each term moves it forward. Either the s, creep up on s, passing every number 
below it, or they pass ali numbers and diverge. If an increasing sequence s, is bounded 
above, it must converge. The line of real numbers is complete, and has no holes. 
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The harmonic series 1 +3+4+4+ ... diverges to infinity. 


A comparison series is 1+4+44+4+4+4444+4+.... The harmonic series is 
larger. But this comparison series is really 1+ 4+3+3+ .... because 5=3=¢. 


The comparison series diverges. The harmonic series, above it, must also diverge. 


To apply the comparison test, we need something to compare with. In Example 2, 
we thought of another series. It was convenient because of those }’s. But a different 
series will need a different comparison, and where will it come from? There is an 
automatic way to think of a comparison series. It comes from the integral test. 

Allow me to apply the integral test to the same example. To understand the integral 
test, look at the areas in Figure 10,2, The test compares rectangles with curved areas. 


yQ) = + yO} = + v(x) = s 


sum > © so 
integral > es 


[tice 


x? 
: I 
so >> 7: < 2 


p>) 


Fig. 10.2 dntegral test: Sums and integrals both diverge (p = L) and both converge (p > 1). 


EXAMPLE 2 (again) Compare 1+3+4+ ... with the area under the curve y = 1/x. 


Every term a, = 1/n is the area of a rectangle. We are comparing it with a curved 
area c,. Both areas are between x =n and x =n+ 1, and the rectangle is above the 
curve. SO a, > Cy! 

ati dx 


2 


rectangular area a, = exceeds curved area c, = | 


ag = oC 
Here is the point. Those c,’s look complicated, but we can add them up. The sum 
cy +... t¢, is the whole area, from 1! to 2+ 1. It equals In(a + 1)—we know the 
integral of 1/x. We also know that the logarithm goes to infinity. 

The rectangular area | + 1/2+ ... + 1/n is above the curved area. By comparison 


of areas, the harmonic series diverges to infinity—a little faster than In(n + 1). 


Remark The integral of 1/x has another advantage over the series with 4’s. First. 
the integral test was automatic. From 1/n in the serics, we went to |/x in the integral. 
Second, the comparison is closer. Instead of adding only $ when the number of terms 
is doubled, the true partial sums grow like Inn. To prove that, put rectangles under 
the curve. 

Rectangles below the curve give an area below the integral. Figure 10.2b omits the 
first rectangle, to get under the curve. Then we have the opposite to the first 
comparison ---the sum is now smaller than the integral: 

1 1 1 ("dx 

aris rc ee bet a (ae atinA0 EI LE 

2° 3 a I, x 
Adding ! to both sides, s, is below 1+ 1nn. From the previous test, s, is above 
In{n + 1). That is a narrow space--- we have an excellent estimate of s,. The sum of |/n 
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and the integral of 1/x diverge together. Problem 43 will show that the difference 
between s, and In n approaches “Euler’s constant,” which is ? = .577.... 


Main point: Rectangular area is s,. Curved area is close. We are using integrals to 
help with sums (it used to be the opposite). 


Question Ifa computer adds a million terms every second for a million years, how 
large is the partial sum of the harmonic series? 

Answer The number of terms is n = 607 + 24+ 365-10!? < 3.2-10'°. Therefore Inn 
is less than In 3.2 + 19 In 10 < 45. By the integral test s,< 1+ Inn, the partial sum 
after a million years has not reached 46. 


For other series, 1/x changes to a different function y(x). At x =n this function 
must equal a,. Also y(x) must be decreasing. Then a rectangle of height a, is above 
the graph to the right of x =n, and below the graph to the left of x =n. The series 
and the integral box each other in: left sum > integral > right sum. The reasoning is 
the same as it was for a, = 1/n and (x)= 1/x: There is finite area in the rectangles 
when there is finite area under the curve. 

When we can’t add the a’s, we integrate y(x) and compare areas: 


10C (Integral test) If y(x) is decreasing and y(n) agrees with a,, then 


a,+a,+a,;+-° and | y(x) dx both converge or both diverge. 
1 


, Pp I l I 
EXAMPLE 3 The “*p-series” = + = + pt converges if p> 1. Integrate y= —: 
a 8 x 


1 * oe oo : 
=| = so by addition =< | es 


a-1* gaan’ 1 x? 


In Figure 10.2c, the area is finite if p > 1. The integral equals [x’~?/(1 — p)]°, which 
is 1/(p — 1). Finite area means convergent series. If 1/1” is the first term, add 1 to the 


curved area: 
1 1 1 1 p 
tat as SE [EK = 

Le” QP 3? p= 1 p—1 

The borderline case p= 1 is the harmonic series (divergent). By the comparison 
test, every p < 1 also produces divergence. Thus ¥1/,/n diverges by comparison with 
fdx/./x (and also by comparison with £1/n). Section 7.5 on improper integrals runs 
parallel to this section on “improper sums” (infinite series). 

Notice the special cases p = 2 and p= 3. The series 1 + 4 +4 + -*: converges. Euler 
found 27/6 as its sum. The series 1 + 4+ 3+ -*: also converges. That is proved by 
comparing £1/n* with £1/n? or with [dx/x*. But the sum for p = 3 is unkaown. 


Extra credit problem The sum of the p-series leads to the most important problem 
in pure mathematics. The “zeta function” is Z(p) = Z1/n’, so Z(2) = 27/6 and Z(3) is 
unknown. Riemann studied the complex numbers p where Z(p)= 0 (there are infi- 
nitely many). He conjectured that the real part of those p is always }. That has been 
tested for the first billion zeros, but never proved. 


COMPARISON WITH THE GEOMETRIC SERIES 


We can compare any new series a, + a, ++" with 1+x+-:-. Remember that the 
first million terms have nothing to do with convergence. It is further out, as n> «, 
that the comparison stands or falls. We still assume that a, > 0. 
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10D (Ratio test) Hf a,+,/a, approaches a limit L < 1, the series converges. 


40E (Root test) If the nth root (a,)'" approaches L < 1, the series converges. 


Roughly speaking, these tests make a, comparable with £’—therefore convergent. 
The tests also establish divergence if L > 1. They give no decision when L = 1. Unfor- 
tunately L = | is the most important and the hardest case. 

On the other hand, you will now see that the ratio test is fairly easy. 


EXAMPLE 4 The geometric series x + x*+-- has ratio exactly x. The nth root is 
also exactly x. So L = x. There is convergence if x < 1 (known) and divergence if x > | 
(also known). The divergence of 1 + 1+ -+ is too delicate (!} for the ratio test and 
root test, because L= 1. 


EXAMPLE 5 The p-series has a, = L/n? and a, ,/a, = n?/(n + 1)’. The limit as n— co 
is L = 1, for every p. The ratio test does not feel the difference between p = 2 (conver- 
gence} and p= | (divergence) or even p= — I (extreme divergence). Neither does the 
root test. So the integral test is sharper. 


EXAMPLE 6 A combination of p-series and geometric series can now be decided: 


- ig a x" = ee yttl opp i a 
cilia Segpeeeeanaes ary — +--+ has catio = approachin =x. 
1p 3 n (n+ 1p? x" OPP 6 


a, 
It is |x] <1 that decides convergence, not p, The powers x" are stronger than any n°. 
The factorials n! will now prove stronger than any x". 

EXAMPLE 7 The exponential series e = 1+x+3x°+3x°— +> converges for ail x. 
The terms of this series are x"/n! The ratio between neighboring terms is 


tint I x 
x*int n+l 


. which approaches L=0 as n- oc. 


With x = 1, this ratio test gives convergence of } 1/n! The sum is e. With x = 4, the 
larger series ) 4"/n! also converges. We know this sum too—it is e*. Also the sum 
of x"n?/n! converges for any x and p. Again L = 0—the ratio test is not even close. 
The factorials take over, and give convergence. 


Here is the proof of convergence when the ratios approach L< 1, Choose x halfway 
from L to t, Then x < t. Eventually the ratios go below x and stay below: 


An+1/An <X On+2/On4) <x Axy+3/An42<X 
Multiply the first two inequalities. Then multiply ail three: 
dy, /Ay <x Gy +2/Ay <x? ay 43/8y <x? 
Therefore ay.,+@yo2+Gye3+-- is less than ay(x + x7 +x>+ -.-}, Since x< 1, 


comparison with the geometric series gives convergence. 


EXAMPLE 8 The series y l/n” is ideal for the root test. The mth root is lin. Its 
limit is L = 0. Convergence is even faster than for e= }' 1/n! The root test is easily 
explained, since (a,)\"" < x yields a, < x" and x is close to L < 1. So we compare with 
the geometric series. 
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SUMMARY FOR POSITIVE SERIES 


The convergence of geometric series and p-series and exponential series is settled. I 
will put these a,’s in a line, going from most divergent to most convergent. The 
crossover to convergence is after 1/n: 


Et 2 a a a | 
+1+... —_ - — —- - -—- - = 
sad | ae nP n n? pay 7 ant wa Pr 
410A 40B and 10C 40D and 410E 
(a, +0) (comparison and integral) (ratio and root) 


You should know that this crossover is not as sharp as it looks. On the convergent 
side, 1/n(In n)* comes before all those p-series. On the divergent side, 1/n(In n) and 
1/n(In n)(In In n) belong after 1/n. For any divergent (or convergent) series, there is 
another that diverges (or converges) more slowly. 

Thus there is no hope of an ultimate all-purpose comparison test. But comparison 
is the best method available. Every series in that line can be compared with its 
neighbors, and other series can be placed in between. It is a topic that is understood 
best by examples. 


_— Bes , 
EXAMPLE 9 > aa diverges because }) o diverges. The comparison uses In n <n. 


1 dx 1 dx 
EXAMPLE 40 —_ > |— rd = 
ain n)? lz x)? “= Lain n) lz x) * 


The indefinite integrals are —1/In x and In(In x). The first goes to zero as x + 00; the 
integral and series both converge. The second integral In(In x) goes to infinity—very 
slowly but it gets there. So the second series diverges. These examples squeeze new 
series into the line, closer to the crossover. 


1 Dy» wy a i: eee | 
+it—te+<—-+—+=—+ >" (convergence). 


EXAMPLE 44. ————<'— id 
P+. ww” 2° Ss” 10 i 4 5 


The constant 1 in this denominator has no effect—and again in the next example. 


1 1 L, 2h. I. 5 i 
>— so ~tatet>atitst. 
—1 2n 


EXAMPLE 412 = 
a a a & § 


¥. 1/2n is 1/2 times ¥ 1/n, so both series diverge. Two series behave in the same 
way if the ratios a,/b, approach L>(. Examples 11-12 have n?/(n? +1) 1 and 
2n/(2n — 1) > 1. That leads to our final test: 


Reason: a, is smaller than 2Lb, and larger than $Lb, , at least when n is large. So the 
two series behave in the same way. For example ¥ sin (7/n’) converges for p> 1, 
not for p< 1. It behaves like ‘3 1/n? (here L= 7). The tail end of a series (large n) 
controls convergence. The front end (small n) controls most of the sum. 

There are many more series to be investigated by comparison. 
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40.2 EXERCISES 


Read-through questions 


The convergence of a, + a, + ‘+: is decided by the partial sums 

S,=_4_. If the s, approach s, then La,=__» __. For the 
¢_ series 1+x4+--- the partial sums are s,=_d__. In 

that case s, + 1/(1 — x) if and only if __e@ _. In all cases the 

limit s, » s requires thata, > __1__, But the harmonic series 

a, = 1/n shows that we can havea, > __g@ and still the series 
h 


The comparison test says that fO<a,<5,then_ i. In 
case a decreasing (x) agrees with a, at x =n, we can apply 
the J _ test. The sum La, converges if and only if __k 
By this test the p-series L1/m’ converges if and only if p is 

|__. For the harmonic series (p= 1), s,= [+--+ + 1/n is 
close to the integral f(m) = 


The _n__ test applies when a,,,/a, + L. There is con- 
vergence if 0 , divergence if ep __, and no dectsion if 

q__. The same is true for the __¢ __ test, when (a,}!" > L. 
For a geometric-p-series combination a,=x"/n’, the ratio 
Gn+1/a, equals _$__. Its limit is L=__t__ so there is con- 
vergence if__u__. For the exponential e* = £ x"/n! the limit- 
ing ratio @,5,/a, is L=__¥__. This series always _ 
because n! grows faster than any x” or a’. 


There is no sharp line between _* and _y__. But if 
<b, converges and a,/b, + L, it follows from the 2 ___ test 
that La, also converges. 


1 Here is a quick proof that a finite sum 1+44+4+--= 
is impossible. Division by 2 would give $+4+%+°~ = 4s. 
Subtraction would leave 14+ 4+4+ + = 4s. Those last two 
series cannot both add to $s because 


2 Behind every decima! s=.abc... 18 a convergent series 
a/10 + 5/100 + +. By a comparison test prove 
convergence. 


3 From these partial sums s,, find a, and also s= ZY a,: 


{a} 21-2 (b) s,=4n 


4 Find the partial sums s, =a, +a,+°" +4,: 


(c} en 7 


(2) = 1/3" (b) =n" aan 
§ Suppose O<a,<b, and La, converges. What can be 
deduced about £5,? Give examples. 


6 {a} Suppose b, + ¢, <4, (all positive) and La, converges. 
What can you say about Db, and £c,? 
(b) Suppose a, <5, +, (all positive) and La, diverges. 
What can you say about Lb, and Ec,? 


Decide convergence or divergence in 7-10 {and give a reason). 


7 ho + rho t+ sot 8 rho trostiiot '” 


9 shrtitstiyt 10 sort+réetrht+ 


Establish convergence or divergence in 11-20 by a comparison 
test. 


May T0 516 


1 
2 TET 
ayy 


n?+4 
t 1 
1 — = 
6 La cos (5) 
18 }'sin? : 
n 
1 


1 
UP ra ere 


1 
13 
aay 
3 
An 
Blea 


1 
17 Dear 


For 21-28 find the limit L in the ratio test or root test. 


3° 1 
21 pa 22 ba 

n?2" n—1y 
3 2 ni ms 5( 1 ) 


5 6 Don 
27 (Sy 2B rs 


29 (4-4 +4-446-B is “telescoping” because 4 and 4 
cancel —4 and — 4. Add the infinite telescoping series 


-¥(; 4)" (as i): 


30 Compute the sum s for other “telescoping series”: 
(a) 1 1 + i I + 1 1 eee 
t a) \2 4/°\3 5 
{b) ind}4+in¢+1In}+-: 


31 In the integral test, what sum is larger than {} (x) dx and 
what sum is smaller? Draw a figure to illustrate. 


32 Comparing sums with integrals, find numbers larger and 
smaller than 
i i 1 1 
=1 ds = = 
Selt+st+ +5 — ands, Iter" cae 
33 Which integral test shows that }\7° 1/e" converges? What 
is the sum? 


34 Which intcgral test shows that }'? n/e” converges? What 
is the sum? 
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Decide for or against convergence in 35—42, based on | y{x) dx. 
1 


se Le +1 2 3n+5 

vy =] By — sh =” (is a * decreasing 
fs = ~ t 

st al i sg y n(in n)(In In n) 

4t Sen" 42 Snjer 


43 (a) Explain why D, = 1+5 


by using rectangles as in Figure 16,2. 
(b) Show that D,,, is less than D, by proving that 


1 ie ax 
—< —. 
n+l --. I% 
(c) (Calculator) The decreasing D,’s must approach a limit. 


Compute them until they go below .6 and below .58 
(when?}, The limit of the D, is Euler’s constant y = .STT.... 


1 : ar 
+ +1) — Inn is positive 
it 


44 In the harmonic series, use s, +.577+in- to show that 


I i 
S=l+eteo+ : needs more than 600 terms to reach s, > 7, 


2 
How many terms for s,, > 10? 


1 re 1 1 t 1 
45 hat 1— = = oe 
(a) Show that 1 at 374 on oa ia + 57 by 
. i 1 1 : 
adding x(3 + 4 Se 2 an x) to both sides, 


(b) Why is the right side close to In 2n—In n? Deduce that 
1~-~4+4-4+-- approaches In 2. 


46 Every second a computer adds a million terms of 
¥ 1/(n In»). By comparison with { dx/(x in x), estimate the 
partial sum after a million years (see Question in text). 


1000 4 


100 n? 


47 Estimate by companson with an integrai. 


48 If La, converges (all a, > 0) show that Za? converges. 


10.3 Convergence Tests: All Series 


49 If La, converges (all a, > 0) show that £ sin a, converges. 

How could © sin a, converge when 2 a, diverges? 

30 The ath sa number p, satisfies p,/n In n > 1. Prove that 
t t 


fad 
Yo 3 35 FT 


+++: diverges. 


Construct a series La, that converges faster than Lb, but 
slower than &c, (meaning a,/b, — 0, a,/c, + ©). 


St b, = 1/n?, c, = 1/n? 52 b, = nld)", cr = (5)" 
53 b, = ifnl, c, = 1jn® 54 6, =i/n*, c, = 1/e 
In Problem 53 use Stirling’s formula ./27n n"/e"n! — 1, 


55 For the series $+3+4+4+4+$+-" show that the 
ratio test fails. The roots (a,}!" do approach a limit L. Find 
L from the even terms a,, = 1/2". Does the series converge? 


56 (For instructors) If the ratios a,,,/a, approach a positive 
limit L show that the roots (a,)!" also approach L. 


Decide convergence in 57—66 and name your test. 


1 1 
57 Liner 58 year 

i 1 
9 ig 2 i (10") 
61 Yin=— aa 62 Fn 


1 Inn 
63 5° —__ & yY — 
Lin WF (test alt p) + 7 {test all p) 


3" nP 
65 5° ——_ 66 5 — 
>. roy y (nt? (test all p, q) 


67 Suppose a, /b, > 0 in the limit comparison test. Prove that 
La, converges if Lb, converges. 


68 Can you invent a series whose convergence you and your 
instructor cannot decide? 


This section finally allows the numbers a, to be negative. The geometric series 1 — 
$+4-a+ > =4 is certainly allowed. So is the series r= 4-$+4$-4$+---. If we 
change all signs to +, the geometric series would still converge (to the larger sum 2). 
This is the first test, to bring back a positive series by taking the absolute value |a,| 


of every term. 


DEFINITION The series La, is ‘absolutely convergent” if L |a,| is convergent. 
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Changing a negative number from a, to |a,| increases the sum. Main point: The 
smaller series 2 a, is sure to converge if £ |a,| converges. 


40G If X|Ja,| converges then 2 a, converges (absolutely). But © a, might con- 
verge, as in the series for z, even if X|a,| diverges to infinity. 


EXAMPLE 4 Start with the positive series } ++ 4+ + **-. Change any signs to minus. 
Then the new series converges (absolutely). The right choice of signs will make it 
converge to any number between —1 and 1. 


EXAMPLE 2. Start with the alternating series 1 -4+4-— 4+ --: which converges to 
In 2. Change to plus signs. The new series 1 +4+4+ -* diverges to infinity. The 
original alternating series was not absolutely convergent. It was only “conditionally 
convergent.’ A series can converge (conditionally) by a careful choice of signs—even 
if L\a,| = 00. 


If X \a,| converges then > a, converges. Here is a quick proof. The numbers a, + |a,| 
are either zero (if a, is negative) or 2\a,|. By comparison with & 2|a,|, which converges, 
x (a, + |a,|) must converge. Now subtract the convergent series £ |a,|. The difference 
La, also converges, completing the proof. All tests for positive series (integral, ratio, 
comparison, ...) apply immediately to absolute convergence, because we switch to 


|, - 


EXAMPLE 3 Start with the geometric series 4 + $+ 34+ --- which converges to }. 
Change any of those signs to minus. Then the new series must converge (absolutely). 
But the sign changes cannot achieve all sums between — 3 and 4. This time the sums 
belong to the famous (and very thin) Cantor set of Section 3.7. 


EXAMPLE 4 (looking ahead) Suppose 2 a,x" converges for a particular number x. 
Then for every x nearer to zero, it converges absolutely. This will be proved and used 
in Section 10.6 on power series, where it is the most important step in the theory. 


EXAMPLE 5 Since £1/n? converges, so does £ (cos n)/n?. That second series has 
irregular signs, but it converges absolutely by comparison with the first series (since 
|cos n| <1). Probably £(tan n)/n? does not converge, because the tangent does not 
stay bounded like the cosine. 


ALTERNATING SERIES 


The series 1 —-}+4—4+ -- converges to In 2. That was stated without proof. This 
is an example of an alternating series, in which the signs alternate between plus and 
minus. There is the additional property that the absolute values 1, , 4, 4, ... decrease 
to zero. Those two facts—decrease to zero with alternating signs—guarantee 
convergence. 


The best proof is in Figure 10.3. Look at a, — a, + a3. It is below a, , because a; (with 
plus sign) is smaller than a, (with minus sign). The sum of five terms is less than the 
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( 
i 
| , a, | 
) 

eo | 
1 +83 I f 
————_ ! 
\ { 1 a, I l 
\ | to 1 | 
1 | +45 I r : 
a 
0 S> Sq SS Sy 5 


Fig. 40.3 An alternating series converges when the absolute values decrease to zero. 


sum of three terms, because a, is smaller than a,. These partial sums s,, $3, Ss, ... 
with an odd number of terms are decreasing. 

Now look at two terms a, ~ te then four terms, then six terms. Adding on a4 — a, 
increases the sum (because a3 > a,). Similarly s, is greater than s, (hecause it includes 
as — dg which is positive). So the sums s2, S,, Sg, ... are increasing. 

The difference between s,_, and s, is the single number +a,. It is required by 10H 
to approach zero. Therefore the decreasing sequence s,, 53, ... approaches the same 
limit 5 as the increasing sequence s,, 54, .... The series converges to s, which always 
lies between s,_, and s,. 

This plus-minus pattern is special but important. The positive series La, may not 
converge. The alternating series is L(—1)"* 1a, 


EXAMPLE 6 The alternating series 4— $+%-—$--- is conditionally convergent (to 
x). The absolute values decrease to zero. Is this series absolutely convergent? No. 
With plus signs, 4(1 +4+ 4+ ---) diverges like the harmonic series. 


EXAMPLE 7 The alternating series 1 -1+1—1+--- is not convergent at all. Which 
requirement in 10H ts not met? The partial sums s,,53,55,... all equal ! and 
Sy, 54, Sg, --- all equal O—but they don’t approach the same limit s. 


MULTIPLYING AND REARRANGING SERIES 


In Section 10.1 we added and subtracted and multiplied series. Certainly addition 
and subtraction are safe. If one series has partial sums s, + s and the other has partial 
sums {, +¢, then addition gives partial sums s,+t,+s+t. But multiplication is 
more dangerous, because the order of the multiplication can make a difference. More 
exactly, the order of terms is important when the series are conditionally convergent. 
For absolutely convergent series, the order makes no difference. We can rearrange 
their terms and multiply them in any order, and the sum and product comes out 
right: 


: 49h: Suppose Lo, converges. absolutely, If. ‘Ais Ay, a ‘is ‘any reordering of the 
tthe Fy: ae rhe In the: new: ae? 2A alg pomvedars seeteay: : 


Rather than proving 10] and 10J, we show what happens when there is only condi- 
tional convergence. Our favorite is 1—$+4—4+ °°", converging conditionally to 
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In 2. By rearranging, it will converge conditionally to anything! Suppose the desired 
sum is 1000. Take positive terms | + 4+ - until they pass 1000. Then add negative 
terms ~4— 4 - + until the subtotal drops below 1000. Then new positive terms 
bring it above 1000, and so on. All terms are eventually used, since at least one new 
term is needed at each step. The limit is s = 1000. 

We also get strange products, when series fail to converge absolutely: 


! 1 1 1 - 1 1 I 1 1 
¢ aa lt te) ! ata)tlatata)” 
On the left the series converge (conditionally). The alternating terms go to zero. On 
the right the series diverges. Its terms in parentheses don’t even approach zero, and 
the product is completely wrong. 

I close by emphasizing that it is absolute convergence that matters. The most 
important series are power series La,x". Like the geometric series (with all a,= 1) 
there is absolute convergence over an interval of x’s. They give functions of x, which 
is what calculus needs and wants. 

We go next to the series for e*, which is absolutely convergent everywhere. From 
the viewpoint of conyergence tests it is too easy—the danger is gone. But from the 
view point of calculus and its applications, e* is unconditionally the best. 


40.3 EXERCISES 


Read-through questions 


The series La, is absolutely convergent if the series__4_ is 
convergent. Then the original series Xa, is also __b _. But 
the series Da, can converge without converging absolutely. 


That is called __¢ convergence, and the series __¢__is an 
example. 

For alternating series, the sign of each a,,,is__@ _ to the 
sign of a,. With the extra conditions that __f and og , 


the series converges {at least conditionally). The partial sums 
S},83,-.. are andthe partialsumss,,s,,...are__ | 
The difference between s, and s,-; is _/ _. Therefore the 
two series converge to the same number s. An alternating 
sertes that converges absolutely [conditionally] (not at all) is 
kk [_! ){_™_). With absolute [conditional] con- 
vergence a reordering {can or cannot?) change the sum. 


Do the series 1-12 converge absolutely or conditionally? 


Ler, 2 YE yn 3 
1 ¥ 
3 pH dager 4d Ler 
5 Y-irt'3 Jan + 1) 6 ¥(-1)"*! sin2n 
—ptytt 1 pyr sin?n 
7 Deny tin( 2) reyes 


9 FC nal + nt} 
WY (-1tt tal 


10 yet 1glin 
12 5-171 —n") 


13 Suppose La, converges absolutely. Explain why keeping 
the positive a’s gives another convergent series. 


14 Can La, converge absolutely if all a, are negative? 


15 Show that the alternating series 1~3+4—-44+4-44+-°- 
does not converge, by computing the partial sums sz, 54,.... 
Which requirement of 10H is not met? 


16 Show that ¢-#+4-—%+--- does not converge. Which 
requirement of 10H is not met? 


17 (a) For an alternating series with terms decreasing to zero, 
why does the sum s always lie between s,_, and s,? 


(b) Is s —s, positive or negative if s, stops at a positive a,? 
18 Use Problem !7 to give a bound on the difference between 
s5=1-44+4—-1+44 and the sum s=in2 of the infinite 
series. 
19 Find the sum 1-142 
ind the sum “yt RO aAt 
is (above s)(below s) by jess than 


~+ = s. The partial sum s,4 


20 Give a bound on the difference between sino = 


1 i 1 I 
TE gt 32 Tyge and SL in 
: Le sede od "See : 
21 Starting from pio ae with plus signs, 


show that the alternating series in Problem 20 has s = 27/12. 
22 Does the alternating series in 20 or the positive sertes in 
2t give x? more quickly? Compare 1/101? — 1/102? + --- with 
1/101? + 1/102? + ---. 


10.3 Convergence Tests: All Series 


23 If La, does not converge show that Ela,| does not 
converge. 


24 Find conditions which guarantee that a, +a,—a3;+ 
a4+a@s—a,+-*: will converge (negative term follows two 
positive terms). 


25 If the terms of In 2=1—4$+4~44 ++ are rearranged into 
1-—4-34+4-4-4+--, show that this series now adds to 
41n 2. (Combine each positive term with the following nega- 
tive term.) 


26 Show that the series 1+4-4+4+4-44+-- 
to #1n 2. 


27 What is the sum of 1+4—44+4-43+4-§+°7 


converges 


28 Combine tee 4)—tnnsy and 1—}+4—--->in2 


to prove i+4+4-4-4-34:°° =in2. 


29 (a) Prove that this alternating series converges: 


to PS ee fl pe I SS gai 
on ee eed Geek 


(b) Show that its sum is Euler's constant y. 


30 Prove that this series converges, Its sum is x/2. 


Kiet 2n ¢ 
ict [St ht sn ay 
G x 


1 
31 The cosine of 6=1 radian is 1 — rT += z 


cos | to five correct decimals (how sai terms?}. 


-, Compute 


Bs 


32 The sine of 6 =7n radians is x — a +5 


sin x to eight correct decimals {how many aan 


--, Compute 


33 If La? and Lb? are convergent show that La,5, is abso- 
lutely convergent. 


Hint: (a + 5)? > 0 yields 2|ab] < a? + 57. 
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: Verify the Schwarz inequality (£a,b,)? <(Za2}(L5?) if 
in = (4)" and b, = (3). 


35 Under what condition does E (ays —a,) converge and 
what is its sum? 


36 For a conditionally convergent series, explain how the 
terms could be rearranged so that the sum is + oo. All terms 
must eventually be included, even negative terms. 


37 Describe the terms in the product (1+4+4+4-J(t+44+ 
$+ °-) and find their sum. 


38 True or false: 


(a} Every alternating series converges. 

(b} Za, converges conditionally if Z|a,] diverges. 

(c} A convergent series with positive terms is absolutely 
convergent. 

(d) If Za, and 25, both converge, so does E(a, + 5,). 


39 Every number x between 0 and 2 equals 1+3$+34+°" 
with suitable terms deleted. Why? 


40 Every number s between —I and f equals +4+44+4+°° 
with a suitable choice of signs. (Add 1 =4+}+4+ : to get 
Problem 39,} Which signs give s= —t and s=Oand s= 4? 


41 Show that no choice of signs will make +4+$+y7+°~ 
equal to zero. 


42 The sums in Problem 41 form a Cantor set centered at 
zero. What is the smallest positive number in the set? Choose 
signs to show that } is in the set. 


*43 Show that the tangent of 6 = 4(x — t) is sin 1/(1 —cos 1). 
This is the imaginary part of s= —In(1—e'). From 
s=Ze'"/n deduce the remarkable sum E (sin n)}/n = }{x — 1}. 


44 Suppose Ea, converges and |x|<1. Show that La,x* 
converges absolutely. 


40.4 The Taylor Series for e*, sin x, and cos x |x 


This section goes back from numbers to functions. Instead of Za, =s it deals with 
La,x" = f(x). The sum is a function of x. The geometric series has all a, = 1 (including 
ap, the constant term) and its sum is f{x) = 1/(t — x). The derivatives of 1 + x + x?+-- 
match the derivatives of {. Now we choose the a, differently, to match a different 


function, 


The new function is e*. All its derivatives are e*. At x =0, this function and its 
derivatives equal 1. To match these 1’s, we move factorials into the denominators. 
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10 Infinite Series 


Term by term the series is 
eX oe 
x"/n! has the correct nth derivative (= 1). From the derivatives at x = 0, we have built 
back the function! At x = 1 the right side is 1+ 1+4+4+4+--- and the left side is e= 
2.71828.... At x = — f the series gives 1- 1+4—4+---, which is e7?. 
The same term-by-term idea works for differential equations, as follows. 


EXAMPLE 4 Solve dy/dx = — y starting from y= 1 at x= 0. 


Solution The zeroth derivative at x = 0 is the function itself: y= 1. Then the equation 
y=ur-y gives y=-1 and y"=—y'=+1. The alternating derivatives 


x. 


i, -i, 1, —1,... are matched by the alternating series for e~*: 


y= l—-xt+4x?-$x°+-- =e * (the correct solution to y = — y). 


EXAMPLE 2 Solve d?y/dx? = — y starting from y = 1 and y’ = 0 (the answer is cos x). 


Solution The equation gives y” = — | {again at x = 0). The derivative of the equation 
gives y” = — y'=0, Then y’”= — y’= + 1. The even derivatives are alternately +1 
and —1, the odd derivatives are zero. This is matched by a series of even powers, 
which constructs cos x: 


“*=COSs x. 


The first terms 1 — $x* came earlier in the book. Now we have the whole alternating 
series. It converges absolutely for all x, by comparison with the series for e* (odd 
powers are dropped). The partial sums in Figure 10.4 reach further and further before 
they lose touch with cos x. 


—(!0) 
Ag. 10.4 The partial sums 1 — x*/2+4x*/24—-- of the cosine series. 


If we wanted plus signs instead of plus-minus, we could average e* and e*. The 
differential equation for cosh x is d?y/dx? = + y, to give plus signs: 


i ial Dg Bee sot Gan oy 
ze te Jat 5x +47 + ae +--+ (which is cosh x). 


TAYLOR SERIES 
The idea of matching derivatives by powers is becoming central to this chapter. The 


derivatives are given at a basepoint (say x = 0). They are numbers (0), f’(0), .... The 
derivative f°0) will be the nth derivative of a,x", if we choose a, to be f(0)/n! 


40.4 The Taylor Series for e*, sin x, and cos x 
Then the series La,x" has the same derivatives at the basepoint as f(x): 


40K The Taylor series that matches f(x) and all its derivatives at x = 0 is 


1/79) 
a it 


HO)+SO+ 5f"OV? + EF"OP += 


The first terms give the linear and quadratic approximations that we know well. The 
x? term was mentioned earlier (but not used). Now we have all the terms—an “‘infinite 
approximation” that is intended to equal f(x). 

Two things are needed. First, the series must converge. Second, the function must 
do what the series predicts, away from x = 0. Those are true for e* and cos x and 
sin x; the series equals the function. We proceed on that basis. 

The Taylor series with special basepoint x = 0 is also called the “‘Maclaurin series.” 


EXAMPLE 3 Find the Taylor series for f(x) = sin x around x = 0. 


Solution The numbers f(0) are the values of f= sin x, f’=cos x, f” = —sin x, ... 
at x = 0. Those values are 0, 1,0, —1, 0, 1, .... All even derivatives are zero. To find 
the coefficients in the Taylor series, divide by the factorials: 


sin x= x— $x°> + zhyx°—**. (2) 


EXAMPLE 4 Find the Taylor series for f(x) =(1 + x)° around x =0. 


Solution This function starts at f(0) = 1. Its derivative is 5(1 + x)*, so f’(0) = 5. The 
second derivative is 5-4-(1+x)°, so f’(0)=5°4. The next three derivatives are 
5+4°3,5°4+3+2,5+4+3+2+1. After that all derivatives are zero. Therefore the Taylor 
series stops after the x° term: 


26 5 SS 5 SOS ge SASS 

1+ 5x4 GPx + 3! D die 4! x ce oe 

You may recognize 1, 5, 10, 10, 5, 1. They are the binomial coefficients, which appear 
in Pascal’s triangle (Section 2.2). By matching derivatives, we see why 0!, 1!, 2!, ... are 


needed in the denominators. 


(3) 


There is no doubt that x = 0 is the nicest basepoint. But Taylor series can be con- 
structed around other points x = a. The principle is the same—match derivatives by 
powers—but now the powers to use are (x — a)". The derivatives f(a) are computed 
at the new basepoint x = a. 

The Taylor series begins with f(a) +f'(a)(x — a). This is the tangent approximation 
at x =a. The whole “infinite approximation” is centered at a—at that point it has 
the same derivatives as f(x). 


40L The Taylor series for f(x) around the basepoint x = a is 


fa) 


n! 


1 20 
f(x) = f(a) +f (a)(x — a) + of "(a(x — a+ = 2 ——(x-a)". (4) 


EXAMPLE 5 Find the Taylor series for f(x) =(1 + x)° around x=a=1. 


Solution At x=1, the function is (1 + 1)° = 32. Its first derivative 5(1 + x)* is 
5-16 = 80. We compute the nth derivative, divide by n!, and multiply by (x — 1)": 


32 + 80(x — 1) + 80(x — 1)? + 40(x — 1)? + 10(x — 1)* +(x — 1)°. (5) 
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40 infinite Series 


That Taylor series (which stops at m = 5) should agree with (1 + x)*. It does. We could 
rewrite 1+ x as 2+ (x — 1), and take its fifth power directly. Then 32, 16, 8, 4, 2, I will 
multiply the usual coefficients 1,5, 10, 10,5,1 to give our Taylor coefficients 
32, 80, 80, 40, 10, 1. The series stops as it will stop for any polynomial—because the 
high derivatives are zero. 


EXAMPLE 6 Find the Taylor series for f(x) = e around the basepoint x = 1. 


Solution At x=1 the function and all its derivatives equal e. Therefore the 
Taylor series has that constant factor (note the powers of x — 1, not x): 


e € 
em etex— It ett ae IP t+. (6) 


DEFINING THE FUNCTION BY ITS SERIES 


Usually, we define sin x and cos x from the sides of a triangle. But we could start 
instead with the senes. Define sin x by equation (2). The logic goes backward, but it 
is still correct: 


First, prove that the series converges. 
Second, prove properties like (sin x = cos x. 
Third, connect the definitions by series to the sides of a triangle. 


We don’t pian to do all this. The usual definition was good enough. But note first: 
There is no problem with convergence. The series for sin x and cos x and e all have 
terms + x"/n!. The factorials make the series converge for all x. The general rule for 
e* times @ can be based on the series. Equation (6) is typical: e is multiplied by 
powers of {x — 1). Those powers add to e*~'. So the series proves that e* = ee*~!, 
That is just one example of the multiplication (e*)(e’) = e**”: 

2 


2 2 2 
x ¥ x y 
L+xt+—t+..Jilt+ytaet.jeltextytotxuytet.... 7 
( 5 )( yr } PEG PATS {7) 
Term by term, multiplication gives the series for e**”. Term by term, differentiating 
the series for e* gives e*. Term by term, the derivative of sin x is cos x: 


d xe x? x? xt 
Eyes 2 oleate tees. 
mG 31 5) ) a 4! ®) 


We don’t need the famous limit (sin x)/x + 1, by which geometry gave us the deriva- 
tive. The identities of trigonometry become identities of infinite series. We could even 
define x as the first positive x at which x —3x*+ --- equals zero. But it is certainly 
not obvious that this sine series returns to zeto—much less that the point of return 
is near 3.14, 


The function that will be defined by infinite series is e"°. This is the exponential of 
the imaginary number if (a multiple of i= J/-1 ). The result e” is a complex number, 
and our goal is to identify it. (We will be confirming Section 9.4.) The technique is to 
treat i@ like all other numbers, real or complex, and simply put it into the series: 


DEFINITION =e’? is the sum of 1 + (i8) + (io) + 5 (ia) foe, (9) 


Now use i7= — 1. The even powers are *=+4 1, i©=—1, i®=+1,.... We are 
just multiplying —1 by —1 to get 1. The odd powers are i> = — i, 5 = +, .... There- 
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fore e splits into a real part (with no i’s) and an imaginary part (multiplying i): 


a(1- f+ bo) 4i(o- 50+ 08), (10) 


You recognize those series. They are cos @ and sin @. Therefore: 
Euler’s formula is e = cos 0+ i sin 0. Note that ce?" = 1, 


The real part is x = cos @ and the imaginary part is y= sin 8. Those coordinates pick 
out the point e in the “complex plane.” Its distance from the origin (0, 0) is r= 1, 
because (cos 6)” + (sin 6)? = 1. Its angle is @, as shown in Figure 10.5. The number 
—-1 is e'", at the distance r= 1 and the angle x. It is on the real axis to the left of 
zero. If ¢* is multiplied by r = 2 or r= 4 or any r > 0, the result is a complex number 
at a distance r from the origin: 


e!8 = cos O+isin 8 


Complex numbers: re = r(cos 6+ i sin 6) =r cos 0 + ir sin @=x + iy. 


With ¢® a negative number has a logarithm. The logarithm of ~1 is imaginary 
(it is iz, since e'* = ~ 1). A negative number also has fractional powers. The fourth 
root of —1 is (-1)* = e'**, More important for calculus: The derivative of x** is 
$x/4, That sounds old and familiar, but at x= — 1 it was never allowed. 

Complex numbers tie up the loose ends left by the limitations of real numbers. 


The formula e = cos @+ isin @ has been called ‘one of the greatest mysteries of 
undergraduate mathematics.” Writers have used desperate methods to avoid infinite 
series. That proof in (10) may be the clearest (I remember sending it to a prisoner 
studying calculus) but here is a way to start from d/dx(e”) = ie”. 

A different proof of Euler’s formula Any complex number is e'* = r(cos 6 + i sin 6) 
for some r and @. Take the x derivative of both sides, and substitute for ie’: 


(cos 6+ i sin O™)dr/dx + r(—sin 6+ i cos A)dB/dx = ir(cos 6+ i sin 6). 


Comparing the real parts and also the imaginary parts, we need dr/dx =0 and 
d@/dx = t. The starting values r= 1 and 6 =0 are known from e’° = 1. Therefore r is 
always | and @ is x. Substituting into the first sentence of the proof, we have Euler’s 
formula e’* = I(cos @ + i sin 9). 


40.4 EXERCISES 


Read-through questions We define e7, sin x, cos x, and also e by their series, The 
derivative d/dx(l+x+4x7+-")=1+x+-" translates to 
t__. The derivative of |—$x7+--is_ 4. Using i? = 


—1 the series 1 + 16+ 4(i6)? + --- splits into e® =__v¥__. Its 
square gives e7"=  w__. Its reciprocal is e®=_ x, 
Multiplying by r givesre® = ¥ +i_ 2 , which connects 


the polar and rectangular forms of a __A__ number. The 
logarithm of e” is__ 8 


The __@ _ series is chosen to match f{x) and all its__b 
at the basepoint. Around x=0 the series begins with 
f()+_¢ x+_4_x?. The coefficient of x"is__@ . For 
S(x) =e* this series is __¢ . For f{x)=cos x the series is 
¢@__. For f(x) =sin x the series is __h__. If the signs were 
all positive in those series, the functions would be cosh x and 


|__. Addition gives cosh x+sinhx=__ 1 

In the Taylor series for {(x} around x = a, the coefficient of 
(x—a@fisb,=__k . Then b,{x—a)" has the same__' as 1 Write down the series for e?* and compute ali derivatives 
fat the basepoint. In the example f(x} = x?, the Taylor coeffi- at x= 0, Give a series of numbers that adds to e?. 
cients are bb =_ -M™_ , db; = 2 _, b,= 9 _.. The series 
bo +5;(x —@)+52{x—a)* agrees with the original p_. 2 Write down the series for sin 2x and check the third 
The series for e” around x=a has b,=__@_. Then the derivative at x=0. Give a series of numbers that adds to 


Taylor series reproduces the identity e7=(__' }(_ 8). sin 2x = 0. 
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In 3-8 find the derivatives of f(x) at x = 0 and the Taylor series 
(powers of x) with those derivatives. 


3 f(x) = e* 4 f(x) = 10+) 
5 f(x) = 1/(1 — 2x) 6 f(x) = cosh x 
7 fi) sln (L— x) 8 f(x)=In 0+ x) 


Problems 9-14 solve differential equations by series. 


9 From the equation dy/dx = y ~ 2 find all the derivatives 
of y at x=0 starting from (0)= 1. Construct the infinite 
series for y, identify it as a known function, and verify that 
the function satisfies y’ = y — 2. 


10 Differentiate the equation y'=cy+s (ce and s constant) 
to find all derivatives of y at x =O. If the starting value is 
Yo =0, construct the Taylor series for y and identify it with 
the solution of y’=cy +s in Section 6.3. 


11 Find the infinite series that solves y’ = — y starting from 
y=Oand y=?t atx=0. 


12 Find the infinite series that solves y = y starting from y = 
} at x=3 {use powers of x— 3). Identify y as a known 
function. 


13 Find the infinite series (powers of x) that solves y” = 
2y' — y starting from y=0 and y' = 1 at x=0. 

14 Solve p” = y by a series with y= 1 and y’ =O at x =0 and 
identify y as a known function. 


15 Find the Taylor series for f(x} =(1 +x)? around x =a= 
0 and around x =a=1! (powers of x — 1). Check that both 
series add to {I + x)’. 


16 Find all derivatives of f(x} = x3 at x = a and write out the 
Taylor series around that point, Verify that it adds to x’. 


17 What is the series for ({ — x)° with basepoint a = 1? 


18 Write down the Taylor series for f= cos x around x = 22 
and also for f= cos (x — 2m) around x = 90, 


In 19-24 compute the derivatives of { and its Taylor series 
around x = |, 


19 f(x) = 1/x 20 f{x) = 1(2-—x} 
21 f(x) =In x 22 f(x) = x* 
23 f(x) =e7* 24 f(x) =e* 


In 25—33 write down the first three nonzero terms of the Taylor 
series around x = 0, from the series for e*, cos x, and sin x. 


25 xe?* 26 cos \/x 27 (1 —cos x}/x? 
ag OA 29 | ex ax = sin x? 

x o «x 
31 e* 32 bt = ein? 33 e cos x 


*34 For x <0 the derivative of x" is still mx" !: 


a ui) _@ it iE) __ n= 1 ine 2/5 
det) g, (ere) = alte 
What is d|x|/dx? Rewrite this answer as mx}. 


35 Why doesn’t f(x) = Jx have a Taylor series around x = 
0? Find the first two terms around x = 1. 


36 Find the Taylor series for 2* around x = 0. 


In 37—44 find the first three terms of the Taylor series around 
x=0. 


37 f(x) =tan7!x 
39 f(x) =tan x 
41 f(x) =e""* 
43 f(x} = cos*x 


38 f(x) =sin7 1x 
40 f(x) = In{cos x) 
42 f(x) =tanh-'x 
44 f(x) =sec?x 
45 From e”=cos@+isin@ and e~=cos @—isin @, add 
and subtract to find cos @ and sin 0. 
46 Does (e'*)? equal cos76 + i sin?@ or cos 6? + i sin 67? 
47 Find the real and imaginary parts and the 99th power of 
e*, eit!2 eins and e7 iX/6. 
48 The three cube roots of 1 are 1, e7*#/9, e**!/3, 
{a) Find the real and imaginary parts of e?""7. 
(b) Explain why (e7*")> = 1, 
(c} Check this statement in rectangular coordinates. 


49 The cube roots of ~1=e'* are e**/? and and 
. Find their sum and their product, 


50 Find the squares of 2e3=14./3i and dei — 
2/2+ i2,/2 in both polar and rectangular coordinates. 


Sl Multiply e*=coss+isins times e*=cost+isin¢ to 
find formulas for cos(s + ¢) and sin(s + ¢). 


52 Multiply e* times e~" to find formulas for cos(s — t) and 
sin(s — t}. 


53 Find the logarithm of i. Then find another logarithm of i. 
(What can you add to the exponent of ¢”’ without changing 
the result?) 


54 (Proof that e is irrational) If e = p/q then 


! i 4 
=pl)- — — re 
y al ( Ti ae al 


would be an integer. (Why?) The number in brackets—the 
distance from the alternating series to its sum t/e—is less 
than the last term which is {/p! Deduce that |N| < 1 and reach 
a contradiction, which proves that e cannot equal p/g. 


55 Solve dy/dx = y by infinite series starting from y =2 at 
x=0. 
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This section studies the properties of a power series. When the basepoint is zero, the 
powers are x". The series is La,x". When the basepoint is x =a, the powers are 
(x — a)". We want to know when and where (and how quickly) the series converges 
to the underlying function. For e* and cos x and sin x there is convergence for all 
x—but that is certainly not true for 1/(1— x). The convergence is best when the 
function is smooth. 

First I emphasize that power series are not the only series. For many applications 
they are not the best choice. An alternative is a sum of sines, f(x) = 2b, sin nx. That 
is a ‘Fourier sine series”, which treats all x’s equally instead of picking on a basepoint. 
A Fourier series allows jumps and corners in the graph—it takes the rough with the 
smooth. By contrast a power series is terrific near its basepoint, and gets worse as 
you move away. The Taylor coefficients a, are totally determined at the base- 
point—where all derivatives are computed. Remember the rule for Taylor series: 


d,, = (nth derivative at the basepoint)/n! = f(a)/n! (1) 


A remarkable fact is the convergence in a symmetric interval around x = a. 


40M _ A power series La,x" either converges for all x, or it converges only at 
the basepoint x = 0, or else it has a radius of convergence r: 


La,x" converges absolutely if |x| <r and diverges if |x| > r. 


The series £x"/n! converges for all x (the sum is e*). The series Xn!x”" converges for 
no x (except x = 0). The geometric series 2x" converges absolutely for |x|< 1 and 
diverges for |x|> 1. Its radius of convergence is r= 1. Note that its sum 1/(1 — x) is 
perfectly good for |x| > 1—the function is all right but the series has given up. If 
something goes wrong at the distance r, a power series can’t get past that point. 

When the basepoint is x = a, the interval of convergence shifts over to |x — a| <r. 
The series converges for x between a — r and a+r (symmetric around a). We cannot 
say in advance whether the endpoints a +r give divergence or convergence (absolute 
or conditional). Inside the interval, an easy comparison test will now prove con- 
vergence. 


PROOF OF 10M Suppose a, X" converges at a particular point X. The proof will 
show that a,x" converges when |x| is less than the number |X|. Thus convergence 
at X gives convergence at all closer points x (I mean closer to the basepoint 0). Proof: 
Since = a, X" converges, its terms approach zero. Eventually |a, X"|< 1 and then 


|a,X"| = |a,X"| |x/X |" < |x/X|". 
Our series La,x" is absolutely convergent by comparison with the geometric series 
for |x/X|, which converges since |x/X| < 1. 
EXAMPLE 1 The series 2nx"/4" has radius of convergence r= 4. 


The ratio test and root test are best for power series. The ratios between terms 
approach x/4 (and so does the nth root of nx"/4”): 


(n+ oe xn 


approaches L = 


pl * 


qnti 4" 4 


The ratio test gives convergence if L < 1, which means |x| < 4. 
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3 x5 


EXAMPLE 2 The sine series x — = + a has r = 00 (it converges everywhere). 


The ratio of x"*?/(n + 2)! to x"/n! is x?/(n + 2)(n + 1). This approaches L = 0. 


EXAMPLE 3 The series £(x — 5)"/n? has radius r= 1 around its basepoint a= 5. 


The ratios between terms approach L = x — 5. (The fractions n?/(n + 1)? go toward 
1.) There is absolute convergence if |x — 5| < 1. This is the interval 4 < x < 6, symmet- 
ric around the basepoint. This series happens to converge at the endpoints 4 and 6, 
because of the factor 1/n?. That factor decides the delicate question—convergence at 
the endpoints—but all powers of n give the same interval of convergence 4 < x < 6. 


CONVERGENCE TO THE FUNCTION: REMAINDER TERM AND RADIUS r 


Remember that a Taylor series starts with a function f(x). The derivatives at the 
basepoint produce the series. Suppose the series converges: Does it converge to 
the function? This is a question about the remainder R,,(x) = f(x) — s,(x), which is the 
difference between f and the partial sum s, = dag + *** + a,(x — a)". The remainder R,, 
is the error if we stop the series, ending with the nth derivative term a,(x — a)". 


10N Suppose fhas an (n+ 1)st derivative from the basepoint a out to x. Then 
for some point c in between (position not known) the remainder at x equals 


R, (x) = f(x) — 5p(x) =f" Mc)(x — a)"**/(n + 1)! (2) 


The error in stopping at the nth derivative is controlled by the (n + 1)st derivative. 


You will guess, correctly, that the unknown point c comes from the Mean Value 
Theorem. For n= 1 the proof is at the end of Section 3.8. That was the error e(x) in 
linear approximation: 


Ry (x) = f(x) — fla) — f'(a)(x — a) = 3 f"(e)(x — a)’. 


For every n, the proof compares R,, with (x — a)"**. Their (n + 1)st derivatives are 
f"*” and (n+ 1)! The generalized Mean Value Theorem says that the ratio of R, to 
(x — a)"*' equals the ratio of those derivatives, at the right point c. That is equation 
(2). The details can stay in Section 3.8 and Problem 23, because the main point is 
what we want. The error is exactly like the next term a,, (x — a)"*', except that the 
(n + 1)st derivative is at c instead of the basepoint a. 


EXAMPLE 4 When fis e*, the (n + 1)st derivative is e*. Therefore the error is 


n n+1 

R=e—(14x+~ +2) oer 5 (3) 
At x=1 and n=2, the error is e—(1+1+4).218. The right side is e°/6. The 
unknown point is c = In (.218+6)=.27. Thus c lies between the basepoint a = 0 and 
the error point x= 1, as required. The series converges to the function, because 
R, > 0. 

In practice, n is the number of derivatives to be calculated. We may aim for an 
error |R,| below 10~°. Unfortunately, the high derivative in formula (2) is awkward 
to estimate (except for e*). And high derivatives in formula (1) are difficult to compute. 
Most real calculations use only a few terms of a Taylor series. For more accuracy we 
move the basepoint closer, or switch to another series. 
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There is a direct connection between the function and the convergence radius r, 
A hint came for f{x)= 1/(1 — x). The function blows up at x = 1—-which also ends 
the convergence interval for the series. Another hint comes for f= 1/x, if we expand 
around x= a= 1: 


1 1 
~ = ——___-=1+(f-x)+(l-xP te. 4 
a par (es nee me a a (4) 
This geometric series converges for |! — x|< 1. Convergence stops at the end point 
x = 0—exactly where 1/x blows up. The failure of the function stops the convergence 
of the series. But note that 1/(1 + x2), which never seems to fail, also has convergence 
radius r= |: 


1/1 + x7} = 1 — x? + x*—x®+ --- converges only for |x| <1. 


When you see the reason, you will know why r is a “radius.”’ There is a circle, and 
the function fails at the edge of the circle. The circle contains complex numbers as 
well as real numbers. The imaginary points i and —i are at the edge of the circle. 
The function fails at those points because 1/(1 + i”) = 00. 

Complex numbers are pulling the strings, out of sight. The circle of convergence 
reaches out to the nearest “singularity” of f(x), real or imaginary or complex. For 
1(1 + x”), the singularities at i and —i make r= 1. If we expand around a= 3, the 


distance to i and —i is r= ./10. If we change to In(1+ x), which blows up at 
x= —1, the radius of convergence of x — 4x?+4x?—->isr=1. 
11 + i2) = 00 


InO and 07 atx =-1 


a=0 


In(] +x) and (4 +4)” 


Ct +.x2} = 00 also at —i 


Ag. 10.6 Convergence radius r is distance from basepoint a to nearest singularity. 


THE BINOMIAL SERIES 


We close this chapter with one more series. It is the Taylor series for (1 + x)’, around 
the basepoint x = 0. A typical power is p= 4, where we want the terms in 


Jl+x=1lt4xtax?te. 


The slow way is to square both sides, which gives 1 + x + (2a, + 4)x” on the right. 
Since 1+ x is on the left, a, = — 4 is needed to remove the x* term. Eventually a, 
can be found. The fast way is to match the derivatives of f= {1 + x}!/?: 


Shey fP-OehUt yr? f= OCH D+ YP. 
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At x = 0 those derivatives are $, — 4, 3. Dividing by 1!, 2!, 3! gives 


ae eG NN IN2 ills: 6 


These are the binomial coefficients when the power is p= 4. 

Notice the difference from the binomials in Chapter 2. For those, the power p was 
a positive integer. The series (1 + x)? = 1+ 2x + x? stopped at x”. The coefficients for 
p= 2 were 1, 2, 1,0, 0,0, .... For fractional p or negative p those later coefficients are 
not zero, and we find them from the derivatives of (1 + x): 

(l+x¥ pilt+xyP > pip—tlt+xP7? f=p(p—1)--(p— nt YU +xP". 
Dividing by 0!, t!, 2!, ..., n! at x =0, the binomial coefficients are 
pp-1) £0) _ plp—1)-(p—n+1) 

2 n! n! : 
For p=n that last binomial coefficient is n!/n! = 1. It gives the final x” at the end of 
(1 + x)’. For other values of p, the binomial series never stops. It converges for |x| < 1: 


© Ap—1)(p—nth) , 
pm n! se 


i p (5) 


(14x = 14 prt P24 = (6) 
When p= 1, 2,3, ... the binomial coefficient p!/n'(n — p)! counts the number of ways 
to select a group of n friends out of a group of p friends. If you have 20 friends, you 
can choose 2 of them in (20)(19)/2 = 190 ways. 


Suppose p is not a positive integer. What goes wrong with (1+ x}, to stop the 
convergence at |x| = 1? The failure is at x = —1. If p is negative, (1+ x} blows up. 
If p is positive, as in ./1+ x, the higher derivatives blow up. Only for a positive 
integer p = n does the convergence radius move out to r= oo. In that case the series 
for (1 + x)" stops at x”, and f never fails. 


A power series is a function in a new form. It is not a simple form, but sometimes 
it is the only form. To compute f we have to sum the series. To square f we have to 
multiply series. But the operations of calculus—derivative and integral—are easier. 
That explains why power series help to solve differential equations, which are a rich 
source of new functions. (Numerically the series are not always so good.) I should 
have said that the derivative and integral are easy for each separate term a,x"—and 
fortunately the convergence radius of the whole series is not changed. 


If {(x) = La,x" has convergence radius r, so do its derivative and its integral: 
df/dx=Zna,x""* and f f(x)dx=Za,x"**/{n + 1) also converge for |x| <r. 


EXAMPLES The series for 1/(1— x) and its derivative 1/(1 — x)? and its integral 
— In{1 — x) ali have r = 1 (because they all have trouble at x = 1}. The series are 2x" 
and Lrx"~! and Dx***/(n + 1). 


EXAMPLE 4 We can integrate e*’ (previously impossible) by integrating every term 
in its series: 


2 1 x 1 x5 I x? 
= + y24 yt 4s = — + —|[ — | + —[ — ree, 
fe dx {(: x a =F ) ax xt 3 +3(5) ae + 


This always converges (r = 0). The derivative of e*’ was never a problem. 
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Read-through questions 


If |x|< |X} and Za,X" converges, then the series La,x" also 
a__. There is convergence ina __b ___ interval around the 
¢__. For £(2x)" the convergence radius isr=_d__. For 


Zx*/n! the radius is r=_e@ _. For £(x — 3)’ there is con- 
vergence for |x —3|<__f__. Then x is between _g and 
h 


Starting with f(x), its Taylor series 2a,x" has a,=__|__. 
With basepoint a, the coefficient of (x—a)" is _ ji. The 
error after the x" term is called the _k__&,(x}. It is equal to 

} where the unknown point c ts between _m__. Thus 


the error is controlled by the _n___ derivative. 


The circle of convergence reaches out to the first point 
where f(x) fails. For f=4(2—x), that point is x=_o_. 
Around the basepoint a = 5, the convergence radius would be 
y=_p_. For sin x and cos x the radius is r=__a 


The series for ./f+x is the _ Fr 


series with p=4. Its 
coefficients are a,=__ 8 __. Its convergence radius is _ ¢ 


Its square is the very short series 1 + x. 


In 1-6 find the Taylor series for /(x) around x = 0 and its radius 
of convergence r. At what point does f(x) blow up? 


1 f(x) = 1/1 — 4x) 2 f(x) = 1 ~ 4x?) 
3 f(x)=e'* 4 f(x)=tan x (through x°*) 
5 f(x) =In(e +x) 6 f(x) = 1/1 + 4x?) 


Find the interval of convergence and the function in 7-10. 


7 fx) 2s (? 1 ‘) 


8 f(x) = 5 mx —a"? 


0) i) 


1 
n+l 


(x -—ay*} 


x—2ay 
10 f(x) = (x — 2x) Rt + 
11 Write down the Taylor series for (e* — 1)/x, based on the 
sertes for e*. At x =0 the function is 0/0. Evaluate the series 
at x =0. Check by l’H6pital’s Rule on (e* — 1)/x. 


12 Write down the Taylor series for xe* around x = 0. Inte- 
grate and substitute x = 1 to find the sum of 1/n!{n + 2). 


13 If f(x) is an even function, so f(—x) =f(x), what can you 
say about tts Taylor coefficients in f= £a,x”? 


14 Puzzle out the sums of the following series: 


(a}xtx?—xt4xt4 xo xh 4h 
xt x8 
(lta tat 


(c} (x — 1) —H(x—1P + Xx - 18 — + 


15 From the series for (1 —cos x)/x? find the limit as x +0 
faster than ]’H6pital’s rule. 


16 Construct a power series that converges for 0< x < 2. 


17-24 are about remainders and 25—36 are about binomials. 


17 If the cosine series stops before x°/8! show from (2) that 
the remainder R; is less than x°/8t Does this also follow 
because the series 1s alternating? 


18 If the sine series around x = 2x stops after the terms in 
problem 10, estimate the remainder from equation (2). 


19 Estimate by (2) the remainder R, = x"*' + x74? 4-- in 
the geometric series, Then compute R,, exactly and find the 
unknown point c for n=2 and x =4. 


20 For —In(1 — x) =x +4x?44x74 Ra, use equation (2) to 
show that R,; <} atx =}. 

21 Find R, in Problem 20 and show that the sertes converges 
to the function at x = } (prove that R, — 0). 


22 By estimating R, prove that the Taylor series for e* around 
x = 1 converges to e* as m > 06. 


23 (Proof of the remainder formula when n = 2) 
(a) At x =a find R2, R2, Rz, Ry”. 
(b) At x =a evaluate g(x) = {x —a)* and gg", e”. 
Ri(x) — Rola) _ Rakes) 
ax)— gta} g'{ey) 
Ric.) — Ra) _ Riker) 
&(e)—g'(a}—g"(e2) 
i{c2) — Rua) _ Ry’) 
g'(e2)—9"(a} g"(c)} 
(e) Combine (a-b-c-d) into the remainder formnla (2). 


9 


(c) What rule gives 


(d) In and 


where are c, and ¢c, and c? 


24 All derivatives of f(x) =e” 1° are zero at x = 0, including 
f(0)=0. What is f(.1}? What is the Taylor series around 
x = 0? What is the radius of convergence? Where does the 
sertes converge to f(x)? For x=! and m=I1 what is the 
remainder estimate in (2)? 


25 (a) Find the first three terms in the binomial series for 
til —x?. 
(b) Integrate to find the first three terms in the Taylor 


series for sin™'x. 


26 Show that the binomial coefficients in 1/./1—x = Za,x" 
are a, = 1°3+°5-+-(2n— 1)/2"n! 

27 For p= —1 and p= — 2 find nice formulas for the bino- 
mnial coefficients. 


28 Change the dummy variable and add lower limits to make 
LP nx"! = L” (n+ 1)x", 
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29 In (1 ~x) | =I x" the coefficient of x" is the number of 
groups of n friends that can be formed from | friend (not 
binomia! - repetition is allowed!) The coefficient is 1 and 
there ts only one group—the same friend n times. 
(a) Describe all groups of n friends that can be formed 
from 2 friends. (There are n+ | groups.) 
(b) How many groups of 5 friends can be formed from 3 
friends? 


30 (a) What is the coefficient of x“ when 1+x+x7+4+-> 
multiplies 1 + x + x? 4+ °:-? Write the first three terms. 
(b) What is the coefficient of x? in (£x*)*? 


31 Show that the binomial series for ./1+4x has integer 


coefficients. (Note that x” changes to (4x)". These coefficients 
are important in counting trees, paths, parentheses...) 


32 In the series for 1/,/1 — 4x, show that the coefficient of x* 
is (2n)! divided by (a!)?. 

Use the binomial series to compute 33—36 with error less than 
1/1000. 
33 (15)!"* 


35 (1.4) 


44 (1001)'8 
46 el looo 
37 From sec x = 1/[{1 —(t —cos x)] find the Taylor series of 


sec x up to x®. What is the radius of convergence r (distance 
to blowup point)? 


38 From sec?x = 1/[1 —sin?x] find the Taylor series up to 
x*, Check hy squaring the secant series tn Problem 37, Check 
by differentiating the tangent serics in Problem 39. 


39 (Division of series) Find tan x by long division of sin x/ 
COS x: 


40 (Composition of series) If f=ay+a,x +az,x*4+--° and 
g=bxtb.x?+-~ find the t,x, x? coefficients of f(g{x)). 
Test on f= 1/0. + x), g=x/(1 ~ x), with f(g(x)) = 1 —~x. 


41 (Multiplication of series) From the series for sin x and 
1/1 — x) find the first four terms for f= sin x/(1 — x). 


42 (Inversion of series) If f= a,x + a)x* + -- find coefficients 
bi, b, in g=b;x+b,x72+°-- so that f(g(x)) =x. Compute 
b,,6,forf=e*—1,g=f '=In{1 +). 


43 From the multiplication (sin x)(sin x) or the derivatives of 
f(x) =sin?x find the first three terms of the series, Find the 
first four terms for cos*x by an easy trick. 


44 Somehow find the first six nonzero terms for f=(1 — x) 
(l—x3). 


45 Find four terms of the series for 1/./1 —x. Then square 
the series to reach a geometric series. 


46 Compute §} e-“'dx to 3 decimals by integrating the 
power series. 


47 Compute {j sin*s dz to 4 decimals by power series. 


48 Show that Ex"/n converges at x = —1, even though its 
derivative Dx"! diverges. How can they have the same 
convergence radius? 


49 Compute lim (sin x — tan x)/x> from the series, 
a ** 


50 If the nth root of a, approaches L > 0, explain why La,x" 
has convergence radius r = 1/L. 


51 Find the convergence radius r around basepoints a=0 
and a=! from the blowup points of (1 + tan x)/() + x”). 
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CHAPTER 14 


Vectors and Matrices 


This chapter opens up a new part of calculus. It is multidimensional calculus, because 
the subject moves into more dimensions, In the first ten chapters, all functions 
depended on time t or position x—but not both. We had f(¢) or (x). The graphs 
were curves in a plane. There was one independent variable (x or t) and one dependent 
variable (y or f). Now we meet functions f(x, t) that depend on both x and ¢. Their 
graphs are surfaces instead of curves. This brings us to the calculus of several variables. 

Start with the surface that represents the function f(x, f) or f(x, y) or f(x, y,.t). I 
emphasize functions, because that is what calculus is about, 


EXAMPLE 1 f(x, t}=cos (x — ¢} is a traveling wave (cosine curve in motion). 


At t=0 the curve is f=cosx. At a later time, the curve moves to the right 
(Figure 11.1). At each t we get a cross-section of the whole x-t surface. For a wave 
traveling along a string, the height depends on position as well as time. 

A similar function gives a wave going around a stadium. Each person stands up 
and sits down. Somehow the wave travels. 


EXAMPLE 2 f(x, y)}=3x+ y+ 1 is a sloping roof (fixed in time). 


The surface is two-dimensional—you can walk around on it. It is flat because 
3x+y+1 is a linear function. In the y direction the surface goes up at 45°. If y 
increases by 1, so does f. That slope is 1. In the x direction the roof is steeper (slope 3). 
There is a direction in betwcen where the roof is steepest (slope ./ 10). 


EXAMPLE 3 f(x, y, t)= cos(x — y — ¢) is an ocean surface with traveling waves. 


This surface moves. At each time t we have a new x-y surface. There are three 
vaniables, x and y for position and t for time. I can’t draw the function, it needs four 
dimensions! The base coordinates are x, y, t and the height is {| The alternative is a 
movie that shows the x-p surface changing with ¢. 

At time ¢=0 the ocean surface is given by cos (x — y). The waves are in straight 
lines. The line x — y = 0 follows a crest because cos 0 = 1. The top of the next wave 
is on the parailel line x — y= 2n, because cos 2x = 1. Figure 11.1 shows the ocean 
surface at a fixed time. 

The line x — y = ¢ gives the crest at time t. The water goes up and down (like people 
in a stadium). The wave goes to shore, but the water stays in the ocean. 
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Slope AN 


Slope 10 


Fig. 14.4 Moving cosine with a small optical illusion—the darker Fig. 44.2 Linear functions give planes. 
bands seem to go from top to bottom as you turn. 


Of course multidimensional calculus is not only for waves. In business, demand is 
a function of price and date. In engineering, the velocity and temperature depend on 
Position x and time ¢. Biology deals with many variables at once (and statistics is 
always looking for linear relations like z = x + 2y). A serious job lies ahead, to carry 
derivatives and integrals into more dimensions. 


Ee §=§=§=6§114.4 Vectors and Dot Products | 


In a plane, every point is described by two numbers. We measure across by x and 
up by y. Starting from the origin we reach the point with coordinates (x, y). I want 
to describe this movement by a vector—the straight line that starts at (0, 0) and cnds 
at (x, p). This vector v has a direction, which goes from (0, 0) to (x, y) and not the 
other way. 

In a picture, the vector is shown by an arrow. In algebra, ¥ is given by its two 
components. For a column vector, write x above y: 


Y= (x and } are the components of ¥). (1) 


Note that v is printed in boldface; its components x and y are in lightface.t The 
vector —¥ in the opposite direction changes signs. Adding ¥ to —v gives the zero 
vector (diflerent from the zero number and also in boldface): 


—x x—-X 0 


-y= and y-v= = =0. (2) 
ae roy 0 


Notice how vector addition or subtraction is done separately on the x’s and y's: 
3 —1 2 


yvtw= - =| |. 3 
1 2 3 " 


tAnother way to indicate a vector is © You will recognize vectors without needing arrows. 
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Ag. 14.3 Parallelogram for v + w, stretching for 2v, signs reversed for —v. 


The vector ¥ has components v, = 3 and v, = 1. {I write v, for the first component 
and v, for the second component. I also write x and y, which is fine for two com- 
ponents.) The vector w has w, = — | and w, =2. To add the vectors, add the com- 
ponents. Te draw this addition, place the start of w at the end of y. Figure 11.3 shows 
how w starts where v ends. 


VECTORS WITHOUT COORDINATES 


In that head-to-tail addition of v + w, we did something new. The vector w was maved 
away from the origin. Its length and direction were not changed! The new arrow is 
parallel to the old arrow—only the starting point is different. The vector is the same 
as before. 

A vector can be defined without an origin and without x and y axes. The purpose 
of axes is to give the components—the separate distances x and ». Those numbers 
are necessary for calculations. But x and y coordinates are not necessary for head- 
to-tail addition y+ w, or for stretching to 2v, or for linear combinations 2v + 3w. 
Some applications depend on coordinates, others don’t. 

Generally speaking, physics works without axes—it is “coordinate-free.” A velocity 
has direction and magnitude, but it is not tied to a paint. A force also has direction 
and magnitude, but it can act anywhere—not only at the origin. In contrast, a vector 
that gives the prices of five stocks is not floating in space. Each component has a 
meaning—there are five axes, and we know when prices are zero. After examples 
from geometry and physics (no axes), we return to vectors with coordinates. 


EXAMPLE 1 (Geometry) Take any four-sided figure in spacc. Connect the midpoints 
of the four straight sides. Remarkable fact: Those four midpoints lie in the same plane. 
More than that, they form a parallelogram. 


Frankly, this is amazing. Figure 11.4a cannot do justice to the problem, because it 
is printed on a flat page. Imagine the vectors A and D coming upward. B and C go 
down at different angles. Notice how easily we indicate the tour sides as vectors, not 
caring about axes or origin. 

I wiil prove that V= W. That shows that the midpoints form a parallelogram. 

What is ¥? It starts halfway along A and ends halfway along B. The smal! triangle 
at the bottom shows V = 4A + 4B. This is vector addition—the tail of 4B is at the 
head of $A. Together they equal the shortcut Y. For the same reason W=43C + 4D. 
The heart of the proof is to see these relationships. 

One step is left. Why is }A +4B equal to $C + 4D? In other words, why is A+ B 
equal to C + D? (I multiplied by 2.) When the right question is asked, the answer 
jumps out. A head-to-tail addition A + B brings us to the point R. Adso C + D brings 
us to R. The proof comes down to one line: 


A+B=PR=C+D. Then V=4A+ 4B equals W=$C+3D. 
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Fig. 44.4 Four midpoints form a parallelogram (¥ = W). Three medians mect at P. 


EXAMPLE 2 (Also geometry) In any triangle. draw lines from the corners to the 
midpoints of the opposite sides. To prove by vectors: Those three lines meet at a point. 
Problem 38 finds the meeting point in Figure 11.4c. Problem 37 says that the three 
vectors add to zero. 


EXAMPLE 3. (Mcdicine) An electrocardtogram shows the sum of many small vectors, 
the voltages in the wall of the heart. What happens to this sum—the Aeart vector 
YV—in two cases that a cardiologist is watching for? 


Case 1. Part of the heart is dead (infarction). 
Case 2. Part of the heart is abnormally thick (Aypertroepay). 


A heart attack kills part of the muscle. A defective valve. or hypertension, overworks 
it. In case 1 the cells die from the cutoff of blood (loss of oxygen), In case 2 the heart 
wail can triple in size, from excess pressure. The causes can be chemical or mechanical. 
The effect we see is electrical. 

The machine is adding small vectors and ‘‘projecting”’ them in twelve directions. The 
leads on the arms, left leg, and chest give twelve directions in the body. Each graph 
shows the component of V in one of those directions. Three of the projections 
two in the vertical plane, plus lead 2 for front-back — produce the ‘‘mean QRS vector” 
in Figure 11.5. That is the sum V when the ventricles start to contract. The left 
ventricle is larger. so the heart vcctor normally points down and to the left. 


Pacemy 
SA 


Fig. 44.5 ¥ is a sum of small voltage vectors, at the moment of depolarization. 
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Enlarged 
Left 
Ventricle 


Fig. 14.6 Changes in V show dead muscle and overworked muscle. 


We come soon to projections, but here the question is about ¥ itself. How does 
the ECG identify the problem? 


Case 1: Heart attack The dead cells make no contribution to the electri- 
cal potential. Some small vectors are missing. Therefore the sum V 
turns away from the infarcted part. 

Case 2: Hypertrophy The overwork increases the contribution to the 
potential. Some vectors are larger than normal. Therefore V turns 
toward the thickened part. 


When ¥ points in an abnormal direction, the ECG graphs locate the problem. The 
P, Q, R, S, T waves on separate graphs can ail indicate hypertrophy, in different 
regions of the heart. Infarctions generally occur in the left ventricle, which needs the 
greatest blood supply. When the supply of oxygen is cut back, that ventricle feels it 
first. The resuit can be a heart attack (= myocardial infarction = coronary occlusion). 
Section 11.2 shows how the projections on the ECG point to the location. 

First come the basic facts about vectors—components, lengths, and dot products. 


COORDINATE VECTORS AND LENGTH 
To compute with vectors we need axes and coordinates. The picture of the heart is 


“coordinate-free.”” but calculations require numbers. A vector is known by its compo- 
nents. The unit vectors along the axes are i and jin the plane and i, j, k in space: 


! 0 0 

I 0 
in 2D: i= j= in3D: i=|0],j=| 1 [.k=] 0 
0 0 1 


Notice how easily we moved into three dimensions! The only change is that vectors 
have three components. The combinations of i and j (or i, j, k) produce all vectors y 
in the plane (and all vectors V in space): 


1 

3 
reise] | Veit 4j-2k= 2 
=2 
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Those vectors are also written v = (3, 1)andV = (1, 2, — 2). The components of the vector 
are also the coordinates of a point. (The vector goes from the origin to the point.) This 
relation between point and vector is so close that we allow them the same notation: 
P =(x, y, z)and v = (x, y, z)= xi+ yj + zk. 

The sum v + V is totally meaningless. Those vectors live in different dimensions. 

From the components we find the /ength. The length of (3, 1) is \/3? + 17= /10. 
This comes directly from a right triangle. In three dimensions, V has a third com- 
ponent to be squared and added. The length of V = (x, y, z) is |V| = ./x? + y? + 2”. 

Vertical bars indicate length, which takes the place of absolute value. The length 
of y = 3i+j is the distance from the point (0, 0) to the point (3, 1): 


W=/o3+03= 10  [V|=./1?7 +2? +2)? =3. 


A unit vector is a vector of length one. Dividing v and V by their lengths produces 
unit vectors in the same directions: 


1/3 
3/,/10 
ae / d ae 2/3 are unit vectors. 
lvl} 1/./10 IV| 
— 2/3 


A unit vector in the plane is determined by its angle @ with the x axis: 
cos @ ; ; ; ; 
u=| . =(cos 6)i+(sin @)j is a unit vector: |u|? = cos?@ + sin?@ = 1. 
n 


In 3-space the components of a unit vector are its “direction cosines”: 
U = (cos «i+ (cos B)j + (cos y)k: a, B, y= angles with x, y, z axes. 


Then cos?a + cos?f + cos*y = 1. We are doing algebra with numbers while we are 
doing geometry with vectors. It was the great contribution of Descartes to see how 
to study algebra and geometry at the same time. 


Fig. 14.7 Coordinate vectors i, j, k. Perpendicular vectors v+ w = (6)(1) + (—2)(3) = 0. 


THE DOT PRODUCT OF TWO VECTORS 


There are two basic operations on vectors. First, vectors are added (v + w). Second, 
a vector is multiplied by a scalar (7v or — 2w). That leaves a natural question—how 
do you multiply two vectors? The main part of the answer is—you don’t. But there 


41.4 Vectors and Dot Products 


is an extremely important operation that begins with two vectors and produces a 
number. It is usually indicated by a dot between the vectors, as in v- w, so it is called 
the dot product. 

DEFINITION 4 The dot product multiplies the lengths \v\ times \w| times a cosine: 


v-w=|v||w| cos 0, @=angle between v and w. 
3 2 
EXAMPLE . has length 3, 5 has length J/8, the angle is 45°. 


The dot product is |v||w| cos 6 = (3)(,/8)(1/,/2), which simplifies to 6. The square 
roots in the lengths are “‘canceled”’ by square roots in the cosine. For computing v- w, 
a second and much simpler way involves no square roots in the first place. 


DEFINITION 2. The dot product y- w multiplies component by component and adds: 


3 2 
v°w=0,W, + v2W2 : = (3)(2) + (0)(2) = 6. 
0 2 


The first form |v||w| cos @ is coordinate-free. The second form v,w, + vw, computes 
with coordinates. Remark 4 explains why these two forms are equal. 


44B The dot product or scalar product or inner product of three-dimensional 
vectors is 


V+ W=|V||W| cos 0 = V, W, + V>W, + V3W3. (4) 
If the vectors are perpendicular then @ = 90° and cos 6= 0 and V- W=0. 


1 4 2 = 
2 |*| 5S | = 32 (not perpendicular) 2 {+} 2 | =0 (perpendicular). 
3 6 | 2 


These dot products 32 and 0 equal |V||W| cos @. In the second one, cos 6 must be 
zero. The angle is 2/2 or —2/2—.in either case a right angle. Fortunately the cosine 
is the same for @ and —8@, so we need not decide the sign of 0. 


Remark 1 When V = W the angle is zero but not the cosine! In this case cos 0 = 1 
and V-V=|V|?. The dot product of V with itself is the length squared: 


V-V=(V,, V2, V3)*(Vi, V2, V3) = Vi + V3 + VE=V/?. (5) 


Remark 2 The dot product of i=(1, 0, 0) with j= (0, 1, 0) is i-j =0. The axes are 
perpendicular. Similarly i-k=0 and j-k=0. Those are unit vectors: i-i=j-j= 
k-k=1. 


Remark 3 The dot product has three properties that keep the algebra simple: 
1. V-W=W:-V 2. (cV)» W=c(V°> W) 3. (U+V)>W=U-W+V-W 


When V is doubled (c = 2) the dot product is doubled. When V is split into i, j, k 
components, the dot product splits in three pieces. The same applies to W, since 
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(V,-W,)? 
Ww ae |v |? +| wl? 
(V- W) 
Pets -2(¥] |W] cos 8 
9 (V,-W3) 


Fig. 14.8 Length squared =(¥ — W)+(V — W), from coordinates and the cosine law. 


V-W=W-V. The nine dot products of i, j, k are zeros and ones, and a giant splitting 
of both ¥ and W gives back the correct V-> W: 


VY: W= Vis Wyit+ V3j° Wij+ Vik: Wk + six zeros = ViW,+ VW, + V3W3. 


Remark 4 The two forms of the dot product are equal, This comes from computing 
|¥ — W/* by coordinates and also by the “law of cosines”: 


with coordinates: |[¥ — W|? =(V, — W,)? +(V, — W.)? +(V; — W;) 
from cosine law: |¥V — W/|? = |¥|? + |W? — 2/¥| |W] cos 8. 


Compare those two lines. Line 1 contains V? and V3 and V3. Their sum matches 
|¥|? in the cosine Law. Also W? + W2 + W% matches |W|?. Therefore the terms contain- 
ing —2 are the same (you can mentally cancel the — 2). The definitions agree: 


— 2(V,W, + V,W, + V3W3) equals —2|¥||W| cos 6 equals —2V-W. 


The cosine law is coordinate-free. [t applies to all triangles (even in n dimensions). 
Its vector form in Figure 11.8 is |¥ — W|? = |¥|? — 2V: W + |W|’. This application to 
V-W is its brief moment of glory. 


Remark 5 The dot product is the best way to compute the cosine of @: 
y- 
cos 9 = ——.. (6) 


Here are examples of V and W with a range of angles from 0 to x: 


i and 3i have the same direction cos 6= 1 6=0 

i+ (i+ j}= 1 is positive cos 6= t//2 8=n/4 
i and j are perpendicular: i-j=90 cos 0=0 6= 2/2 
i-(-i+j)= — 1 is negative cos0=—1/,/2 0=3n/4 
i and — 3i have opposite directions cos @= —1 d=n 


Remark 6 The Cauchy-Schwarz inequality |V -W| < |V¥||W| comes from |cos 6| < 1. 


The left side is |¥||W||cos 4]. It never exceeds the right side |¥!|W|. This is a key 
inequality in mathematics, from which so many others follow: 

Geometric mean ./xy < arithmetic mean 4(x+ y) {true for any x20 and y20). 
Triangle inequality |\¥ + W\ <|¥|+ |WI (|¥I, WI, {¥ + W| are lengths of sides). 


These and other examples are in Problems 39 to 44. The Schwarz inequality 
|¥ > W| <|¥||W| becomes an equality when |cos &| = 1 and the vectors are 
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44.4 EXERCISES 


Read-through questions 


A vector has length and _a__. If ¥ has components 6 and 
—8, its length is |yj/=__b and its direction vector is u= 

¢__. The product of [¥| with uis__d___. This vector goes 
from (0, 0)to the pointx=_@ »y=_ ft . Acombination 
of the coordinate vectorsi=_g  andj=_h __ produces 
y=_ 1 i+_J)_j 


To add vectors we add their_k__. The sum of (6, —8) and 
(1, O}is_4__. To see ¥ +i geometrically, put the m__ ofi 
atthe © of y. The vectors form a_° with diagonal 
¥+i. (The other diagonal is __e _.) The vectors 2¥ and —¥ 
. Their lengths are__3 and _ 


are a ander 


In a space without axes and coordinates, the tail of ¥ can 
be placed _u__. Two vectors with the same_v___ are the 
same. If a triangle starts with V and continues with W, the 
third side is__w__. The vector connecting the midpoint of V 
to the midpoint of Wis __x__. That vectoris__y the third 


side. In this coordinate-free form the dot product is V- W= 
z 


Using components, V-W=_A and (1, 2, 1)- 
(2, —3, 7)=__8__. The vectors are perpendicular if _¢ . 
The vectors are parallel if_D .V-Visthesameas_ 
The dot product of U+ ¥ with W equals F__. The angle 
between V and W hascos?=_ @__. When V -W is negative 
then Otis _H__. The angle betweeni+jandi+kis_ | 
The Cauchy-Schwarz inequality is J 


, and for V=i+j 
and W=i+k it becomes 1<_K 


Ia 1-4 compute V + W and 2V — 3W and |¥|? and V-W and 
cos @. 


1 V=(1,1, 0, W=(-1, -1, —) 
2 V=i+j, W=j—-k 

3 V=i-+k, W=i+j—2k 
4 V=(1, t, 1, ), W=(, 2, 3, 4) 


§ (a} Find a vector that is perpendicular to (p,, v2). 
(b) Find two vectors that are perpendicular to (v,, v3, v3). 


6 Find two vectors that are perpendicular to (f, 1, 0) and to 
each other. 


7 What vector ts perpendicular to all 2-dimensional vectors? 
What vector ts parallel to all 3-dimensional vectors? 


8 In Problems 1-4 construct unit vectors in the same direc- 
tion as V. 


9 If v and w are unit vectors, what is the geometrical mean- 
ing of vw? What is the geometrical meaning of {¥ - w)v? Draw 
a figure with ¥ =i and w =(3/5)i + (4/5)}j. 


10 Write down all unit vectors that make an angle @ with the 
vector (1, 0). Write down aif vectors at that angle, 


11 True or faise in three dimensions: 
1. If both U and ¥ make a 30° angle with W, so does 
U+Y. 
2. If they make a 90° angle with W, so does U + ¥. 
3. If they make a 90° angle with W they are perpendicular: 
U-V¥=0. 
12 From W =(1, 2, 3} subtract a multiple of V =(1, 1, [) so 
that W—c¥ is perpendicular to ¥. Draw V and W and 
W—-cV. 


13 (a) What is the sum Y of the twelve vectors from the center 
of a clock to the hours? 
(b) If the 4 o’clock vector is removed, find ¥ for the other 
eleven vectors. 
(c) If the vectors to 1, 2,3 are cut in half, find V for the 
twelve vectors. 


14 (a) By removing one or more of the twelve clock vectors, 
make the iength |¥| as large as possible. 
(6) Suppose the vectors start from the top instead of the 
center (the origin is moved to 12 o'clock, so ¥,2 = 0). What 
is the new sum ¥*? 


15 Find the angle POQ by vector methods if P =(1, 1, 0), 
0 =(0, 0, 0,0 =(1, 2, —2). 


16 (a) Draw the unit vectors u, =(cos 8, sin @) and u,= 
(cos ¢, sind). By dot products find the formula for 
cos (6 — 9). 

(b) Draw the unit vector uy from a 90° rotation of u,. By 
dot products find the formula for sin (6 + @). 


17 Describe all points (x, y) such that v = xi+ yj satisfies 
{a) |¥| =2 (b) |v —i] =2 
(c) ¥*i=2 (d) ¥-i=|y| 
18 (Important) If A and B are non-parallel vectors from the 
origin, describe 
(a) the endpoints of ¢B for all numbers ¢ 
(b) the endpoints of A + 1B for ail ¢ 
(c) the endpoints of sA + /B for all s and t 
(d) the vectors v that satisfy y“A = ¥°B 


19 (a) If ¥+ 2w =i and 2¥ + 3w =j find ¥ and w, 
(b) If v=i+j and w=3i1+4j then i= v+ 
w. 
20 If P=(0, 0) and R =(0, 1) choose @ so the angle POR is 
90°. All possible Q’s lie in a : 


21 (a) Choose d so that A=2i+3j is perpendicular to 
B= 9i+ dj. 
(b) Find a vector C perpendicular to A=i+j+k and 
B=i-k. 
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22 if a boat has velocity V with respect to the water and the 
water has velocity W with respect to the land, then 
The speed of the boat is not |¥| + |W} hut 


23 Find the angle between the diagonal of cube and (a) an 
edge (b) the diagonal of a face (c} another diagonal of the 
cube. Choose lines that meet. 


24 Draw the triangle PQR in Example 1 (the four-sided figure 
in space). By geometry not vectors, show that PR is twice as 
long as ¥. Similarly |PR|=2|W|. Also V is parallel to W 
because both are parallel to . So V = W as before. 


28 (a) If A and B are unit vectors, show that they make equal 
angles with A + B. 
(b) If A, B, C are unit vectors with A+ B+C=0, they 
form a triangle and the angle between any two 
is 

26 {a) Find perpendicular unit vectors | and J in the plane 
that are different from i and j. 


{b) Find perpendicular unit vectors I, J, K different from 
i, j,k. 


27 If Land J are perpendicular, take their dot products with 
A =al + 6J to find a and 6, 


28 Suppose I = (i+j/,/2 and J =(i—§)/,/2, Check 1-3 =0 
and write A = 2i-+ 3j as a combination al + bJ. (Best method: 
use a and b from Problem 27. Alternative: Find i and j from 
I and J and substitute into A.) 


29 (a) Find the position yector OP and the velocity vector 
PQ when the point P moves around the unit circle (see figure) 
with speed 1. (b} Change to speed 2. 


3 The sum (A-i)? +(A-j)? +(A>k)? equals 


31 In the semicircle find € and D in terms of A and B. Prove 
that C-D=0 (they meet at right angles). 


AN 


32 The diagonal PR has (PR? ==(A+B)-(A+B)=A'A+ 
A‘B+B-A+B-°B. Add |QS|* from the other diagonal to 
prove the parallelogram law: |PR|? + |QS|? = sum of squares 
of the four side lengths. 


33 If (1, 2, 3), (3, 4, 7), and (2, 1, 2) are corners of a parallelo- 
gram, find all possible fourth corners. 


34 The diagonals of the parallelogram are A+B and 
. If they have the same length, prove that A‘ B=0 
and the region is a : 


35 The vector from the earth's center to Seattle is ai + bj + ck. 
(a) Along the circle at the latitude of Seattle, what two 
functions of a, b, c stay constant? k goes to the North Pole. 
(b}On the circle at the longitude of Seattle—the 
meridian—what two functions of a, b, c stay constant? 
(c) Extra credit: Estimate a, 5, c in your present position. 
The 0° meridian through Greenwich has 5 = 0. 


36 If|A +B]? =|Aj? + {B}’, prove that A is perpendicular to B. 


37 In Figure 11.4, the medians go from the corners to the 
midpoints of the opposite sides. Express M,, M2, M3 in terms 
of A,B,C. Prove that M, +M,+M,=0, What relation 
holds between A, B, C? 


38 The point ¢ of the way along is the same for all three 
medians. This means that A + $M; = 4M, = . Prove 
that those three vectors are equal. 


39 (a) Verify the Schwarz inequality |¥ > W| < |V||Wi for V = 
i+ 2j+ 2k and W = 21+ 2j)+k. 
(b) What does the inequality become when V = (/x, Jy) 
and W =(\/y, \/x)? 


40 By choosing the nght vector W in the Schwarz inequality, 
show that (V, + V, + V3)? < 3(V? + V3 + V3}. What is W? 


41 The Schwarz inequality for ai+ dj and ci+ dj says that 
(a? + b7)(c? + d?} > (ac + bd)’. Multiply out to show that the 
difference is >0. 


42 The vectors A, B, C form a triangle if A+B+C=0. The 

triangle inequality |A + Bj <|A| + |B] says that any one side 

length is less than . The proof comes from Schwarz: 
|A+B?=A-A+2A-B+B°B 


<A)? + + |B? = (Al + BI)’. 


43 True or false, with reason or example: 
(a) [¥ + W/? is never larger than |¥|? + |W]? 
(b) In a real triangle {V + W| never equals |¥| + |W 
(c) V- W equals W-V 
(d) The vectors perpendicular to i+ j+k lie along a line. 


44 If V=i+2k choose W so that V-W=|¥||W| and 
|¥ + W| =|¥| + |W. 
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45 A methane molecule has a carbon atom at (0, 0, 0} and 46 (a) Find a vector V at a 45° angle with i and j. 
hydrogen atoms at (4,1, —1) (1, -1,0, (1,1, 1), and (b) Find W that makes a 60° angle with i and j. 
(-1, —1, ~1). Find (c) Explain why no vector makes a 30” angle with i and j. 
(a) the distance between hydrogen atoms 
{b) the angle between vectors going out from the carbon 
atom to the hydrogen atoms. 


Re §=§86©§14.2 Planes and Projections 


The most important “curves” are straight lines. The most important functions are 
linear. Those sentences take us back to the beginning of the book—the graph of 
mx + b is a line. The goal now is to move into three dimensions, where graphs are 
surfaces. Eventually the surfaces will be curved. But calculus starts with the flat 
surfaces that correspond to straight lines: 


What are the most important surfaces? Planes. 
What are the most important functions? Still linear. 


The geometrical idea of a plane is turned into algebra, by finding the equation of a 
Plane. Not just a general formula, but the particular equation of a particular plane. 


A line is determined by one point (xp, yo) and the slope m. The point-slope equation 
iS ¥ — ¥o = m{x — Xp). That is a linear equation, it is satisfied when y = yo and x = xX, 
and dy/dx is m. For a plane, we start again with a particular point—which is now 
(Xo, Yo. Zo). But the slope of a plane is not so simple. Many planes climb at a 45° 
angle—with “slope 1” —and more information is needed. 

The direction of a plane is described by a vector N. The vector is not in the plane, 
but perpendicular to the plane. In the plane, there are many directions. Perpendicular 
to the plane, there is only one direction. A vector in that perpendicular direction is 
a normal vector. 

The normal vector N can point “up” or “down’”’. The length of N is not crucial (we 
often make it a unit vector and call it n). Knowing N and the point Po = (x9, Yo; Zo); 
we know the plane (Figure | 1.9). For its equation we switch to algebra and use the 
dot product—which is the key to perpendicularity. 

N is described by its components (a, b,c). In other words N is ai+ bj +ck. This 
vector is perpendicular to every direction in the plane. A typical direction goes from 


N=(1,-1,3 
N=aithj+ck ( ) VAx-ye3ce] 


normal vector 


N=(1, 1, 1) 


3x -3y +92 5-15 


Fig. 44.9 The normal vector to a plane. Parallel planes have the same N. 


408 


41 Vectors and Matrices 


P, to another point P = (x, y, z)in the plane. The vector from P, to P has components 
(x — X9, ¥ — Yo. Z— Zp). This vector lies in the plane, so its dor product with N is zero: 
44C The plane through Py perpendicular to N = (a, 6, c) has the equation 
{a, b,c)*(x— Xo, Y— Yo: 2— Zp) = 9 or 


a(x — Xo) + Hy— yo) + (2 — Zp) = 0. (1) 


The point P lies on the plane when its coordinates x, y, z satisfy this equation. 


EXAMPLE 4 The plane through P, = (1, 2, 3) perpendicular to N =(1, 1, 1) has the 
equation (x — 1}+(y— 2) +{z — 3) = 0. That can be rewritten as x + y+z= 6, 


Notice three things. First, Po lies on the plane because } + 2+ 3= 6. Second, N= 
(1, 1, 1) can be recognized from the x, y, z coefficients in x + y + z= 6, Third, we could 
change N to (2, 2, 2) and we could change Py, to (8, 2, — 4)——because N is still perpen- 
dicular and Po is still in the plane: 8+ 2—4=6. 

The new normal vector N =(2, 2, 2) produces 2(x ~ 1) + 2{y — 2) + 2(z — 3) =0. 
That can be rewritten as 2x + 2y + 2z = 12. Same normal direction, same plane. 

The new point Py =(8, 2, —4) produces (x — 8)+(y~- 2)+(z+4)=0. That is 
another form of x + y + z= 6. All we require is a perpendicular N and a point Py in 
the plane. 


EXAMPLE 2 The plane through (1, 2, 4) with the same N =({1, 1, 1) has a different 
equation: {x — 1)+(y—2)+(z—4)=9. This is x+ y+z=7 (instead of 6). These 
planes with 7 and 6 are parailel. 


Starting from a(x — x9) + b(4 — yo) + c(z — zo} = 0, we often move axy + byg + CZ 
to the right hand side—and call this constant d: 


41D With the P, terms on the right side, the equation of the plane is N° P = d: 


ax+ by +cz=axg t+ byy + ez =d. (2) 
A different d gives a parallel plane; d= gives a plane through the origin. 


EXAMPLE3 The plane x — » + 3z = 0 goes through the origin (0, 0, 0). The normal 
vector is read directly from the cquation: N =(!, —1I, 3). The equation is satisfied hy 
Py =(1, 1,0) and P=(1, 4, 1). Subtraction gives a vector V = (0, 3, 1) that is in the 
plane, and N- ¥ =0. 

The parallel planes x — y + 3z =d have the same N but different d’s. These planes 
miss the origin because d is not zero (x = 0, y=0, z=0 on the left side needs d=0 
on the right side). Note that 3x — 3y+9z= — 15 is parallel to both planes. N is 
changcd to 3N in Figure 11.9, but its direction is not changed. 


EXAMPLE 4 The angle between two planes is the angle between their normal vectors. 


The planes x— ++ 3z=0 and 3y+z=0 are perpendicular, because {1, —1, 3)- 
(0,3, 1)=0. The planes z=0 and y=0 are also perpendicular, because (0, 0, 1)> 
(0, 1,0}=0. (Those are the xy plane and the xz plane.) The planes x + y= 0 and 
x+z=0 make a 60° angie, because cos 60° = (1, 1, 0)-(t, 0, 1y/./2/2 =1. 

The cosine of the angle between two planes is IN, * N2I/JN,||Nz|- See Figure 11.10. 
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normal 
direction 


(—m, 1) 


parallel 
direction 
(1, m) 


Py=(1,1,.1) 
V =2i-j-k 


line 0 = (3,0, 0) 


y=mx+b 


Fig. 14.40 Angle between planes = angle between normals. Parallel and perpendicular to a 
line. A line in space through Po and Q. 


Remark 1 We gave the “point-slope” equation of a line (using m), and the “point- 
normal” equation of a plane (using N). What is the normal vector N to a line? 

The vector V =(1, m) is parallel to the line y= mx + b. The line goes across by 1 
and up by m. The perpendicular vector is N =(—™m, 1). The dot product N+ V is 
—m+m=0. Then the point-normal equation matches the point-slope equation: 


—m(x — Xo) + 1(y — Vo) = 0 is the same as y — yo = m(x — Xo). (3) 


Remark 2 What is the point-slope equation for a plane? The difficulty is that a 
plane has different slopes in the x and y directions. The function f(x, y)= 
m(x — Xo) + M(y — yo) has two derivatives m and M. 

This remark has to stop. In Chapter 13, “slopes” become “partial derivatives.” 


A LINE IN SPACE 


In three dimensions, a line is not as simple as a plane. A line in space needs two 
equations. Each equation gives a plane, and the line is the intersection of two planes. 


The equations x + y + z= 3 and 2x + 3y +2z=6 determine a line. 


Two points on that line are Py = (1, 1, 1) and Q = (3, 0, 0). They satisfy both equations 
so they lie on both planes. Therefore they are on the line of intersection. The direction 
of that line, subtracting coordinates of Py from Q, is along the vector V = 2i—j —k. 


The line goes through P,=(\. 1. 1) in the direction of V = 2i—j-— k. 
i ) J 


Starting from (Xo, Yo, Zo)=(1, 1, 1), add on any multiple tV. Then x= 1+ 2t and 
y=1-t and z=1-—t. Those are the components of the vector equation 
P = Py + tV—which produces the line. 


Here is the problem. The line needs two equations—or a vector equation with a 
parameter t. Neither form is as simple as ax + by + cz = d. Some books push ahead 
anyway, to give full details about both forms. After trying this approach, I believe 
that those details should wait. Equations with parameters are the subject of 
Chapter 12, and a line in space is the first example. Vectors and planes give plenty 
to do here—especially when a vector is projected onto another vector or a plane. 


PROJECTION OF A VECTOR 
What is the projection of a vector B onto another vector A? One part of B goes along 


A—that is the projection. The other part of B is perpendicular to A. We now compute 
these two parts, which are P and B-— P. 
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In geometry, projections involve cos @. In algebra, we use the dot product (which 
is closely tied to cos 8). In applications, the vector B might be a velocity V or a force 
F: 

An airplane flies northeast, and a 100-mile per hour wind blows due 
east. What is the projection of V = (100, 0) in the flight direction A? 


Gravity makes a ball roll down the surface 2x + 2y + z=0. What are 
the projections of F = (0,0, — mg) in the plane and perpendicular to 
the plane? 


The component of V along A is the push from the wind (tail wind). The other 
component of V pushes sideways (crosswind). Similarly the force parallel to the 
surface makes the ball move. Adding the two components brings back V or F. 


N=2i+2j+k 


downhill force: 


tailwind = pearson force 
projection of F 1 
of Von A a 
V= 100i 
crosswind 


force of gravity 
F =-—mgk 


Fig. 11.144. Projections along A of wind velocity V and force F and vector B. 


We now compute the projection of B onto A. Call this projection P. Since its 
direction is known—P is along A—we can describe P in two ways: 


1) Give the length of P along A 
2) Give the vector P as a multiple of A. 


Figure 11.11b shows the projection P and its length. The hypotenuse is |B|. The 
length is |P| = |B| cos 6. The perpendicular component B — P has length |B| sin 6. The 
cosine is positive for angles less than 90°. The cosine (and P!) are zero when A and 
B are perpendicular. |B| cos 6 is negative for angles greater than 90°, and the pro- 
jection points along —A (the length is |B||cos 6|). Unless the angle is 0° or 30° or 45° 
or 60° or 90°, we don’t want to compute cosines—and we don’t have to. The dot 
product does it automatically: 

B 


A . 
|A||B| cos = A-B so the length of P along A is \B\ cos 0 = TAT” 


(4) 
Notice that the length of A cancels out at the end of (4). If A is doubled, P is 
unchanged. But if B is doubled, the projection is doubled. 

What is the vector P? Its length along A is A+ B/|A|. If A is a unit vector, then 
|A| = 1 and the projection is A- B times A. Generally A is not a unit vector, until we 
divide by |A|. Here is the projection P of B along A: 


A-B\/A A-B 
P = (length of P)(unit vector) = ( \( )- — A. 5 
sited jal /\iAi) IAP ©) 


\ 


41.2 Planes and Projections 
EXAMPLE 5 For the wind velocity V = (100, 0) and flying direction A = (1, 1), find P. 
Here V points east, A points northeast. The projection of V onto A is P: 


“Vv “V 100 
length |P| = ar = 7 vector P= car = > A 1) = (50, 50). 


EXAMPLE 6 Project F = (0, 0, — mg) onto the plane with normal N = (2, 2, 1). 
The projection of F along N is not the answer. But compute that first: 


F-N__ meg F-N 


IN) 3 ~ INP 


Na— ze 2, 1). 


P is the component of F perpendicular to the plane. It does not move the ball. The 
in-plane component is the difference F — P. Any vector B has two projections, along 
A and perpendicular: 


AB 
The projection P = ——. A is perpendicular to the remaining component B— P. 


|Al? 


EXAMPLE 7 Express B=i-—j as the sum of a vector P parallel to A= 3i+j and a 
vector B— P perpendicular to A. Note A- B= 2. 


A-B 2 6 2 4 12 
luti = a= A= i+ Fj. -P=—j-—j 
Solution P Ti 10 10'* to! Then B—-P 10 to! 
Check: P+(B—P)=(45)(5) — (443) = 0. These projections of B are perpendicular. 
Pythagoras: |P|* + |B— P|? equals |B|*. Check that too: 0.4 + 1.6 = 2.0. 


Question When is P=0? Answer When A and B are perpendicular. 


EXAMPLE 8 Find the nearest point to the origin on the plane x + 2y + 2z=5S. 


The shortest distance from the origin is along the normal vector N. The vector P to 
the nearest point (Figure 11.12) is t times N, for some unknown number t. We find t 
by requiring P = tN to lie on the plane. 

The plane x + 2y+2z=5 has normal vector N =(1, 2, 2). Therefore P= tN = 
(t, 2t, 2t). To lie on the plane, this must satisfy x + 2y + 2z = 5: 


t+2(2t)+2(2)=5 or 9t=5 or t=. (6) 
Then P = §N = (8, 42, 42). That locates the nearest point. The distance is $|N| = #. 


This example is important enough to memorize, with letters not numbers: 


44E On the plane ax + by + cz =d, the nearest point to (0, 0, 0) is 
_ (da, db, de) Id| 


p=... The distance is ————————. 
a*>+b?+ 0c? ./a? +b? + c? 


The steps are the same. N has components a, b, c. The nearest point on the plane is 
a multiple (ta, tb, tc). It lies on the plane if a(ta) + b(tb) + c(tc) = d. 

Thus t = d/(a? + b? + c?). The point (ta, tb, tc) = tN is in equation (7). The distance 
to the plane is |tN| = |d|/|N|. 
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N=i+2j+2k 
4 


Q=i+3j+2k 


in| = SEU -> 
V1+44+4 


P=Q+ IN 


P=iN |p| = —2—=3 


plane x+2y+2z=5 


Fig. 11.42 Vector to the nearest point P is a multiple tN. The distance is in (7) and (9). 


Question How far is the plane from an arbitrary point Q =(x,, y;, 2;)? 
Answer The vector from Q to P is our multiple tN. In vector form P= Q + tN. This 
reaches the plane if P+ N = d, and again we find t: 


(Q+tN):N=d yields t=(d—Q:N)/|N/’. (8) 
This new term Q<N enters the distance from Q to the plane: 
distance = |tN| = |d — Q-> N|/|N| = |d — ax, — by, — ¢2,|/y ‘a? + b? + c?. (9) 


When the point is on the plane, that distance is zero—because ax, + by, + cz, =d. 
When Q is i + 3j + 2k, the figure shows Q- N = 11 and distance = 2. 


PROJECTIONS OF THE HEART VECTOR 


An electrocardiogram has leads to your right arm—left arm-—left leg. You produce the 
voltage. The machine amplifies and records the readings. There are also six chest 
leads, to add a front-back dimension that is monitored across the heart. We will 
concentrate on the big “Einthoven triangle,” named after the inventor of the ECG. 

The graphs show voltage variations plotted against time. The first graph plots the 
voltage difference between the arms. Lead II connects the left leg to the right arm. 
Lead III completes the triangle, which has roughly equal sides (especially if you are 
a little lopsided). So the projections are based on 60° and 120° angles. 


The heart vector V is the sum of many small vectors—all moved to the same 
origin. V is the net effect of action potentials from the cells—small dipoles adding to 
a single dipole. The pacemaker (S—A node) starts the impulse. The atria depolarize 
to give the P wave on the graphs. This is actually a P loop of the heart vector—the 


LEAD Il LEAD aVp LEAD Ill 


Fig.A The graphs show the component of the moving heart vector along each lead. These 
figures are reproduced with permission from the CIBA Collection of Medical Illus- 
trations by Frank H. Netter, M.D. Copyright 1978 CIBA-GEIGY, all rights reserved. 


14.2 Pianes and Projections 


graphs only show its projections. The impulse reaches the AV node, pauses, and 
moves quickly through the ventricles. This produces the QRS complex—the large 
sharp movement on the graph. 

The total QRS interval should not exceed 1/10 second (25 spaces on the printout), 
¥ points first toward the right shoulder. This direction is opposite to the leads, so 
the tracings go slightly down. That is the Q wave, small and negative. Then the heart 
yector sweeps toward the left leg. In positions 3 and 4, its projection on lead I 
{between the arms) is strongly positive. The R wave is this first upward deflection in 
each lead. Closing the loop, the S$ wave is negative (best seen in leads J and aVR). 


Question 1 How many graphs from the arms and leg are really independent? 
Answer Only two! In a plane, the heart vector V has two components. If we know 
two projections, we can compute the others. (The ECG does that for us.) Different 
vectors show better in diflerent projections. A mathematician would use 90° angles, 
with an electrode at your throat. 


Question 2 How are the voltages related? What is the aVR lead? 
Answer Project the heart vector V onto the sides of the triangle: 
The lead vectors have L, — Ly + Ly, = 0—they form a triangle. 
The projections have ¥;— ¥y + Vin = V°L,— ¥°Ly t+ ¥°Ly =9. 
The aVR lead is — 4L, — $L,. It is pure algebra (no wire). By vector addition it points 
toward the electrode on the right arm. Its length is ./3 if the other lengths are 2. 


Including aVL and aYF to the left arm and foot, there are six leads intersecting at 
equal angles. Visualize them going out from a single point (the origin in the chest). 


aVR 


Fig.B Heart vector goes around the QRS loop. Projections are spikes on the ECG. 


Question 3 If the heart vector is V = 2i—j, what voltage differences are recorded? 
Answer The leads around the triangle have length 2. The machine projects Y: 


Lead I is the horizontal vector 2%. So ¥ -L, = 4. 
Lead IT is the — 60° vector i— \/3j. So V-Ly =2+/3. 
Lead III is the —120° vector —i—- J 3. So VeLy = —2+ /3. 


The first and third add to the second. The largest R waves are in leads I and II. In 
aVR the projection of V will be negative (Problem 46), and will be labeled an S wave. 


aVL 
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Question 4 What about the potential (not just its differences). Is it zero at the center? 
Answer It is zero if we say so. The potential contains an arbitrary constant C. (It is 
like an indefinite integral. Its differences are like definite integrals.) Cardiologists 
define a “central terminal” where the potential is zero. 


The average of V over a loop is the mean heart vector H. This average requires 
J Vdt, by Chapter 5. With no time to integrate, the doctor looks for a lead where the 
area under the QRS complex is zero. Then the direction of H (the axis) is perpendicu- 
jar to that lead. There is so much to say about calculus in medicine. 


14.2 EXERCISES 


Read-through questions 


A plane in space is determined by a point Py = (xo, Yo. Zo) 
and a__4a___vector N with components (a, b,c). The point 
P =(x, y, z} is on the plane if the dot product of N with» 

is zero. ne answer was not hy The equation of this plane 
isa{_¢ )+b_d_)+c{_¢@ _}=0. The equation is also 
written as ax + by +cz=d, mie dequals__¢ _. A parallel 
plane has the same _@_ and a different oh. A plane 
through the origin hasd=__| 


The equation of the plane through P, = (2, 1, 0) perpendic- 
ular to N = (3, 4, 5) is__!__. A second point in the plane is 
P=(0,0,__« ). The vector from P, to P is , and it ts 

m__ to N. (Check by dot product.) The plane through Py = 
(2, 1, 0) perpendicular to the z axishas N=__ 9 __ and equa- 
tion ° 


The component of B in the direction of Ais__P__, where 
8 is the angle between the vectors. This is A‘ B divided by 

a__. The projection vector P is |B|cos@ times a__° 
vector in the direction of A. Then P =( |B] cos 6)(A/|Al) sim- 
plifies to__$ _. When B is doubled, P is__t__. When A is 
doubled, Pis_ vu. If Breverses direction then P__¥ _. If 
A reverses direction then P__# 


When B is a velocity vector, P represents the __* _. When 
B is a force vector, P is__y__. The component of B perpen- 
dicular to A equals . The shortest distance from (0, 0, 0) 
to the plane ax + by +cz=d is along the _A__ vector. The 
distance is__8 and the closest point on the plane is P= 
C__. The distance from Q=(x,, ¥1,2;) to the plane is 
D 


Find two points P and Py on the planes 1-6 and a normal 
vector N. Verify that N -(P — P.) = 


1x+2y+3z2=0 2x+2y+3z=6 3 the yz plane 
4 the plane through (0, 0, 0) perpendicular toi+j—k 
5 the plane through (1, I, 1) perpendicular toi+j—k 


6 the plane through (0, 0, 0) and (1, 0, 0) and (0, 1, 1). 


Find an x — y — z equation for planes 7-10. 

7 The plane through Py =(1, 2, —1) perpendicular to N= 
i+j 

8 The plane through Py =(I, 2, 
i+2-k 


—1) perpendicular to N = 


9 The plane through (1, 0, 1) parallel to x + 2y+2=0 
10 The plane through (x9, yo, Zo) parallel to x + y+z=1. 


11 When is a plane with normal! vector N parallel to the 
vector ¥? When is it perpendicular to ¥? 


12 (a) If two planes are perpendicular (front wall and side 
wall), is every line in one plane perpendicular to every line 
in the other? 

(b} If a third plane is perpendicular to the first, it might 
be (parallel) (perpendicular) (at a 45° angle) to the second. 


13 Explain why a plane cannot 
(a} contain (1, 2, 3) and (2, 3, 4) and be perpendicular to 
N=i+j 
(b) be perpendicular to N =i+j and parallel to V=i+k 
(c} contain (1, 0, 0}, (0, 1, 0), (0, 0, 1), and (1, 1, 1) 
(d) contain (1, 1, —f}if it has N=i+j—k (maybe it can) 
(e} go through the ongin and have the equation 
ax+by+ez=t. 
14 The equation 3x + 4y + 7z —1 =0 yields a hyperplane in 
four dimensions. Find its norma] vector N and two points P, 
@ on the hyperplane. Check (P — Q}-N =0. 


15 The plane through (x, y, z) perpendicular to ai + bj + ck 
goes through (0, 0, ie if . The plane goes through 


(Xp, Jos Zo) if 

16 A curve in three dimensions is the intersection of 

surfaces. A line in four dimensions is the intersection of 
hyperplanes, 

17 (angle between planes} Find the cosine of the angle 

between x+2y+2z=0 and (a} x+2z=0 (b) x+2z=5 

(c)x =0. 
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18 N is perpendicular to a plane and Y is along a line. Draw 
the angle 9 between the plane and the line, and explain why 
¥-N/|¥I|N] is sin 6 not cos 6. Find the angle between the xy 
plane and ¥ =i+j+./2k. 


In 19-26 find the projection P of B along A. Also find |P|. 
19 A=(4, 2, 4), B=(1, —1, 0) 

26 A=(1, —1, 0), B=(4, 2, 4) 

21 B=unit vector at 60° angle with A 

22 B=vector of length 2 at 60° angle with A 

23 B=—A 244 A=itj, B=i+k 

25 A is perpendicular to x— y+ z=0, B=i+j. 

26 A is perpendicular to x — y+z=5, B=i+jt 5k. 


27 The force F = 3i — 4k acts at the point (1, 2, 2). How much 
force pulls toward the origin? How much force pulls vertically 
down? Which direction does a mass move under the force F? 


28 The projection of B along Ais P = . The projec- 
tion of B perpendicular to A is . Check the dot 
product of the two projections. 


29 P=(x, y,z) is on the plane ax+by+cz=5 if P*-N= 
|PiiN| cos @ = 5. Since the largest value of cos @ is 1, the small- 
est value of |P| is . This is the distance between 


W If the air speed of a jet is 500 and the wind speed is 50, 
what information do you need to compute the jet’s speed over 
land? What is that speed? 


31 How far is the plane x + y—z=1 from (0, 0, 0) and also 
from (1, 1, —1)? Find the nearest points. 


32 Describe all points at a distance 1 from the plane 
x + 2y + 2z=3. 


33 The shortest distance from @=(2, 1,1) to the plane 
x+y+z=0 is along the vector . The point P= 
Q+iN=(24+4,1+¢, 1+) lies on the plane if t= . 
Then P= and the shortest distance is 

(This distance is not |P|.) 


34 The plane through (1,1,1) perpendicular to N= 
i+ 2j + 2k is a distance from (0, 0, 0). 


35 (Distance between planes) 2x—2y+z=1 is parallel 
to 2x —2y + z=3 because . Choose a vector Q on 
the first plane and find ¢ so that Q+¢N lies on the second 
plane. The distance is |rN] = 


3% The distance between the planes x+y+5z=7 and 
3x+2y+z=1 is zero because 


In Problems 37-41 all points and vectors are in the xy plane. 


37 The line 3x + 4y = 10 is perpendicular to the vector N= 
. On the line, the closest point to the origin is P = 
tN. Find ¢ and P and {P|. 


38 Draw the line x+2y=4 and the vector N=i+ 2j. The 
closest point to Q =(3, 3) is P=Q+¢N. Find ¢. Find P. 


39 A new way to find P in Problem 37: minimize x? + y? = 
x* + A2—4x)*. By calculus find the best x and y. 


40 To catch a drug runner going from (0, 0) to (4,0) at 8 
meters per second, you must travel from (0, 3) to (4, 0) at 

meters per second. The projection of your velocity 
vector onto his velocity vector will have length 


41 Show by vectors that the distance from (x,, ¥,) to the line 
ax + by =d is |d — ax, — dby,|/,/a? + 6%. 


42 It takes three points to determine a plane. So why does 
ax +by+cz=d contain four numbers a, b, c,d? When does 
ex+fy +gz=1 represent the same plane? 


43 (projections by calculus) The dot product of B— tA with 
itself is |B? —2¢A-B+e7|Al?. (a) This has a minimum at 
t= . (0) Then tA is the projection of _A 
figure showing B, ¢A, and B— fA is worth 1000 words. 


44 From their equations, how can you tell if two planes are 
(a) parallel (b) perpendicular ({c) at a 45° angle? 


Problems 45—48 are about the ECG and heart vector. 


45 The aVR lead is —$1,—4L,. Find the aVL and aVF 
leads toward the left arm and foot. Show that 
aVR +aVL+aVF =0, They go out from the center at 120° 
angles. 


46 Find the projection on the aVR lead of V =2i~j in 
Question 3. 


47 If the potentials are gp, = t (right arm) and ¢,, = 2 and 
Or. = —3, find the heart vector Y, The differences in potential 
are the projections of ¥. 


48 If V is perpendicular to a lead L, the reading on that lead 
is . If [ ¥(t}dt is perpendicular to lead L, why is the 
area under the reading zero? 
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es = 11.3 Cross Products and Determinants ‘x 


After saying that vectors are not multiplied, we offered the dot product. Now we 
contradict ourselves further, by defining the cross product. Where A - B was a number, 
the cross product A x B is a vector. It has length and direction: 


The length is |A||B| |sin 0|. The direction ts perpendicular to A and B. 


The cross product {also called vector product) is defined in three dimensions only. 
A and B lie on a plane through the origin. A x B is along the normal vector N, 
perpendicular to that plane. We still have to say whether it points ‘‘up” or “down” 
along N. 

The iength of A x B depends on sin 6, where A> B involved cos 8. The dot product 
rewards vectors for being parailel (cos 0 = 1). The cross product is largest when A is 
perpendicular to B (sin 2/2 = 1). At every angle 


|A-B)? + |A x BI? = |A/?|Bl? cos?6 + [A|?|B/? sin26 = |Al?|BI?. (1) 


That will be a bridge from geometry to algebra. This section goes from definition to 
formula to volume to determinant. Equations (6) and (14) are the key formulas for 
Ax B. 

Notice that A x A= 0. (This is the zero vector, not the zero number.) When B is 
parallel to A, the angle is zero and the sine is zero. Parallel vectors have A x B= 0. 
Perpendicular vectors have sin@=1 and |A x B| =|A||B| = area of rectangle with 
sides A and B. 

Here are four examples that lead to the cross product A x B. 


EXAMPLE 4 (From geometry) Find the area of a parallelogram and a triangle. 


Vectors A and B, going out from the origin, form two sides of a triangle. They produce 
‘the parallelogram in Figure 11.13, which is twice as large as the triangle. 

The area of a parallelogram is base times height (perpendicular height not sloping 
height). The base is |A|. The height is |B||sin 6]. We take absolute values because 
height and area are not negative. Then the area is the length of the cross product: 


area of parallelogram = |A| |B||sin | = |A x BI. (2) 


tumimg Paxis 


height 
|B| {sin 6 


I 
| 
| 
| 
[ 
t 


base | Al 
area [A|[B[|sin @]=|AxB| moment |R|{F| sin @ 


Rg. 44.43 Area |A x B| and moment |R x F|, Cross products are perpendicular to the page. 


EXAMPLE 2 (From physics) The torque vector T = R x F produces rotation. 


The force F acts at the point (x, y, z). When F is parallel to the position vector R = 
xi+ yj + zk, the force pushes outward (no turning). When F is perpendicular to R, 
the force creates rotation. For in-between angles there is an outward force |F| cos @ 
and a turning force |F} sin @. The turning force times the distance {R| is the moment 
|R| {F| sin 6. 
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The moment gives the magnitude and sign of the torque vector T= R x F. The 
direction of T is along the axis of rotation, at right angles to R and F. 


EXAMPLE 3 Does the cross product go up or down? Use the right-hand rule. 


Forces and torques are probably just fine for physicists. Those who are not natural 
physicists want to see something turn.t We can visualize a record or compact disc 
rotating around its axis—which comes up through the center. 

At a point on the disc, you give a push. When the push is outward (hard to do), 
nothing turns. Rotation comes from force “‘around” the axis. The disc can turn either 
way—depending on the angle between force and position. A sign convention is 
necessary, and it is the right-hand rule: 


A x B points along your right thumb when the fingers curl from A toward B. 


This rule is simplest for the vectors i, j, k in Figure 11.14—-which is all we need. 

Suppose the fingers curl from i to j. The thumb points along k. The x-y-z axes 
form a “right-handed triple.” Since |i| = 1 and |j| = 1 and sin z/2 = 1, the length of i x j 
is 1. The cross product is ix j=k. The disc turns counterclockwise—its angular 
velocity is up—when the force acts at i in the direction j. 


Figure 11.14b reverses i and j. The force acts at j and its direction is i. The disc 
turns clockwise (the way records and compact discs actually turn). When the fingers 
curl from j to i, the thumb points down. Thus j x i= — k. This is a special case of an 
amazing rule: 

The cross product is anticommutative: B x A= —(A x B). (3) 


That is quite remarkable. Its discovery by Hamilton produced an intellectual revolu- 
tion in 19th century algebra, which had been totally accustomed to AB= BA. This 
commutative law is old and boring for numbers (it is new and boring for dot pro- 
ducts). Here we see its opposite for vector products A x B. Neither law holds for 
matrix products. 


ixj=k 


screw going in screw coming out 


Fig. 11.44 ixj=k=—(jxi) ixk=—j=—(kxi) jxk=i=—(k~x/j). 


EXAMPLE 4 A screw goes into a wall or out, following the right-hand rule. 


The disc was in the xy plane. So was the force. (We are not breaking records here.) 
The axis was up and down. To see the cross product more completely we need to 
turn a screw into a wall. 

Figure 11.14b shows the xz plane as the wall. The screw is in the y direction. By 
turning from x toward z we drive the screw into the wall—which is the negative y 
direction. In other words i x k equals minus j. We turn the screw clockwise to make 
it go in. To take out the screw, twist from k toward i. Then k x i equals plus j. 


+Everybody is a natural mathematician. That is the axiom behind this book. 
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To summarize: k x i=j and j x k=i have plus signs because kij and jki are in the 
same “cyclic order’ as ijk. (Anticyclic is minus.) The z—x—y and y—z—x axes form right- 
handed triples like x—y—z. 


THE FORMULA FOR THE CROSS PRODUCT 


We begin the algebra of A x B. It is essential for computation, and it comes out 
beautifully. The square roots in |A| |B| |sin 6| will disappear in formula (6) for A x B. 
(The square roots also disappeared in A~B, which is |A||B| cos 0. But |A||B| tan 6 
would be terrible.) Since A x B is a vector we need to find three components. 

Start with the two-dimensional case. The vectors a,i+ a,j and b,i+ b,j are in the 
xy plane. Their cross product must go in the z direction. Therefore A x B= __?_k 
and there is only one nonzero component. It must be |A||B| sin 6 (with the correct 
sign), but we want a better formula. There are two clean ways to compute A x B, 
either by algebra (a) or by a bridge (4) to the dot product and geometry: 


(a) (a,i+ aaj) x (b)i + b2j) =a, by ixita,b,ixj+anb,jxita,b,jxj. (4) 
On the right are 0, a,b,k, —a,b,k, and 0. The cross product is (a,b, — ab, )k. 


(6) Rotate B= b,i+ b,j clockwise through 90° into B* = b,i— b,j. Its length is 
unchanged (and B- B* = 0). Then |A||B| sin 6 equals |A||B*| cos 6, which is A - B*: 


a b 
| ial “|b. aa (5) 
ay —b, 


EXAMPLES For A=i+ 2j and B= 4i + 5j the cross product is (1-5 — 2+4)k = — 3k. 
Area of parallelogram = 3, area of triangle = 3/2. The minus sign in A x B= — 3k is 
absent in the areas. 


Note Splitting A x B into four separate cross products is correct, but it does not 
follow easily from |A||B| sin 8. Method (a) is not justified until Remark 1 below. An 
algebraist would change the definition of A x B to start with the distributive law 
(splitting rule) and the anticommutative law: 


Ax(B+C)=(Ax B)+(AxC) and Ax B= -—(Bx A). 
THE CROSS PRODUCT FORMULA (3 COMPONENTS) 


We move to three dimensions. The goal is to compute all three components of A x B 
(not just the length). Method (a) splits each vector into its i, j, k components, making 
nine separate cross products: 


(a,i+ a,j +a 3k) x (b,i+ b,j + b3k) = a,b,(i x i) + a,b,{i x j) + seven more terms. 


Remember i x i=j x j=k x k=0. Those three terms disappear. The other six terms 
come in pairs, and please notice the cyclic pattern: 


FORMULA A x B= (a,b; — a3b3)i + (a3b, — a,b3)j + (a,b. — ayb,)k. (6) 


The k component is the 2 x 2 answer, when a; = b; = 0. The i component involves 
indices 2 and 3, j involves 3 and 1, k involves 1 and 2. The cross product formula is 
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written as a “determinant” in equation (14) below—many people use that form to 
compute A x B. 


EXAMPLE 6 (i+ 2j+3k) x (41+ 5j+ 6k) = (2°6—3-5)i+(3°4—1°6)j+(1°-5—2°4)k. 
The i, j, k components give A x B= — 3i + 6j — 3k. Never add the —3, 6, and —3. 
Remark 1 The three-dimensional formula (6) is still to be matched with A x B from 
geometry. One way is to rotate B into B* as before, staying in the plane of A and B. 
Fortunately there is an easier test. The vector in equation (6) satisfies all four geo- 
metric requirements on A x B: perpendicular to A, perpendicular to B, correct length, 
right-hand rule. The length is checked in Problem 16—here is the zero dot product 
with A: 


A+(A x B)= a,(a2b3 — a3b2) + a2(a3b, — a,b3) + a3(a,b2 — arb,) = 0. (7) 
Remark 2 (Optional) There is a wonderful extension of the Pythagoras formula 
a? + b? =’. Instead of sides of a triangle, we go to areas of projections on the yz, xz, 
and xy planes. 3? + 6? + 3? is the square of the parallelogram area in Example 6. 

For triangles these areas are cut in half. Figure 11.15a shows three projected trian- 
gles of area 4. Its Pythagoras formula is (3)? + (3)? + (3)? = (area of PQR)?. 
EXAMPLE7 P=(1, 0,0), Q =(0, 1,0), R= (0,0, 1) lie in a plane. Find its equation. 
Idea for any P,Q, R: Find vectors A and B in the plane. Compute the normal N= A x B. 


Solution The vector from P to Q has components —1, 1, 0. It is A=j—i (subtract 
to go from P to Q). Similarly the vector from P to R is B= k —i. Since A and B are 
in the plane of Figure 11.15, N= A x B is perpendicular: 


(j-—i) x (k—i)=(j x k)-—(@ix k)-—(j xi) + x )=i+jt+k. (8) 
The normal vector is N =i+j+k. The equation of the plane is \x + ly + 1z=d. 


With the right choice d = 1, this plane contains P, Q, R. The equation is x + y+z=1. 


EXAMPLE 8 What is the area of this same triangle PQR? 
Solution The area is half of the cross-product length |A x B| = |i+j+k|= Jt 


R= (0,0, 1) planex+y+z=1 
normal N=i+j+k 
B=k-i Q =(0, 1, 0) |A|cos 8 
A=j-i 

P=(1,0,0) 


Fig. 14.45 Area of PQR is \/3/2. N is PQ x PR. Volume of box is |A-(B x C)|. 


DETERMINANTS AND VOLUMES 


We are close to good algebra. The two plane vectors a,i+ a,j and b,i+ bj are the 
sides of a parallelogram. Its area is a,b, — a,b,, possibly with a sign change. There 
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is a special way to write these four numbers—in a “square matrix.” There is also a 
name for the combination that leads to area. It is the “determinant of the matrix”: 


a 1 a 2 
, its determinant is 
b, by; 


ay ay 


The matrix is = a,b, — azb,. 


b, bp 
This is a 2 by 2 matrix (notice brackets) and a 2 by 2 determinant (notice vertical 
bars). The matrix is an array of four numbers and the determinant is one number: 


2 l 


Examples of determinants: 
l 


1 
=6-4=2, 
3 


The second has no area because A = B. The third is a unit square (A =i, B=)j). 

Now move to three dimensions, where determinants are most useful. The parallelo- 
gram becomes a parallelepiped. The word “box” is much shorter, and we will use it, 
but remember that the box is squashed. (Like a rectangle squashed to a parallelogram, 
the angles are generally not 90°.) The three edges from the origin are A = (a;, a2, a3), 
B= (b,, bz, b3), C =(c,, 2, ¢3). Those edges are at right angles only when A: B= 
A:C=B-C=0. 


Question: What is the volume of the box? The right-angle case is easy—it is length 
times width times height. The volume is |A| times |B| times |C|, when the angles are 
90°. For a squashed box (Figure 11.15) we need the perpendicular height, not the 
sloping height. 

There is a beautiful formula for volume. B and C give a parallelogram in the base, 
and |B x C| is the base area. This cross product points straight up. The third vector 
A points up at an angle—its perpendicular height is |A| cos 8. Thus the volume is 
area |B x C| times |A| times cos 0. The volume is the dot product of A with B x C. 


Important: A-(B x C) is a number, not a vector. This volume is zero when A is in 
the same plane as B and C (the box is totally flattened). Then B x C is perpendicular 
to A and their dot product is zero. 


Useful facts: A+(Bx C)=(A x B)-C=C:(A x B)=B>-(C x A). 


All those come from the same box, with different sides chosen as base—but no change 
in volume. Figure 11.15 has B and C in the base but it can be A and B or A and C. 
The triple product A*+(C x B) has opposite sign, since C x B= —(B x C). This order 
ACB is not cyclic like ABC and CAB and BCA. 


To compute this triple product A -(B x C), we take B x C from equation (6): 
A _ (B x C) = a,(b3c3 = b3¢2) i Ue a,(b3c, a byc3) 3 a3(b,c> —™ bc). (9) 


The numbers a,, a3, a3; multiply 2 by 2 determinants to give a 3 by 3 determinant! 
There are three terms with plus signs (like a,b,c;). The other three have minus signs 
(like —a,b3c,). The plus terms have indices 123, 231, 312 in cyclic order. The minus 
terms have anticyclic indices 132, 213, 321. Again there is a special way to write the 
nine components of A, B, C—as a “3 by 3 matrix.” The combination in (9), which 
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gives volume, is a “3 by 3 determinant:” 
@; @ a3 a 4) dy 
matrix=| 5b, b, »b; |, determinant=A-(BxC)=|b, 6, 53]. 
€; €2 Cy €; Cz 3 
A single number is produced out of nine numbers, by formula (9). The nine numbers 


are multiplied three at a time, as in a,b,c.— except this product is not allowed. Eack 
row and column must be represented once. This gives the six terms in the determinant: 


4;b2¢4 + a2b3¢, + a3b,c 
by b, b,| = i%2%3 2°31 aU iv2Z (10) 


= A;b3C2 a a,b,c a a3b2¢, 


The trick is in the + signs. Products down to the right are ‘“‘plus”: 


2 
2°2*24+1°1+14+1-+1-1 8+14+1 


= = 4, 
—2-1e1-1-1+2-1+2-1 -2-2-2 


2 1 
11 2 
With practice the six products like 2+2+2 are done in your head. Write down only 


8+1+1—2—2-—2=4, This is the determinant and the volume. 
Note the special case when the vectors are i, j, k. The box is a unit cube: 


10 0 
1+0+0 
volume of cube=/0 1 O|= =1. 
—-0-0-0 
00 1 


if A, B, C lie in the same plane, the volume is zero. A zero determinant is the test 
to see whether three vectors lie in a plane. Here row A = row B— row C: 


O1- 
Or1e1+1+0-C1)+1)-E 1) 

ay ae “gs nee Sa 2 a aa (11) 

bath =0°0-°0-1°(-1)11-( 1-11) 


Zeros in the matrix simplify the calculation. All three products with plus signs— 
down to the right—are zero. The only two nonzero products cancel each other. 

If the three — 1's are changed to +1’s, the determinant is — 2. The determinant can 
be negative when all nine entries are positive! A negative determinant only means 
that the rows A, B, C form a “left-handed triple.” This extra information from the 
sign—right-handed vs. left-handed—is free and useful, but the volume is the absolute 
value, 

The determinant yields the volume also in higher dimensions. In physics, four 
dimensions give space-time. Ten dimensions give superstrings. Mathematics uses all 
dimensions. The 64 numbers in an 8 by 8 matrix give the volume of an eight- 
dimensional box—with 8! = 40,320 terms instead of 3! = 6. Under pressure from my 
class I omit the formula. 
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Question When is the point {x, y, z) on the plane through the origin containing B 
and C? For the yector A= xi+ yj+ zk to lie in that plane, the volume A+(B x C) 
must be zero, The equation of the plane is determinant = zero. 


Follow this example for B = j — iand C = k — ito find the plane parallel to B and C: 


x yz 
x-1-L+y-O-(-1)+2-0-(-1) 

ear heer fossa): (2) 
ene ee ova eee 

oe ae | : 


This equation is x + y+2z=0. The normal vector N = B x C has components 1, 1, 1. 
THE CROSS PRODUCT AS A DETERMINANT 


There is a connection between 3 by 3 and 2 by 2 determinants that you have to see. 
The numbers in the top row multiply determinants from the other rows: 


By 9.43 


b, b 
b, by b3| =a, ° 


b, by 


C, €2 


— az + ay « (13) 


Cy C3 
Cy 2 Ly 


The highlighted product a,(b,c, — b3c,) gives two of the six terms. All six products 
contain an a and b and c from different columns. There are 3! = 6 different orderings 
of columns 1, 2, 3. Note how a, multiplies a determinant from columns 1 and 2. 

Equation (13) is identical with equations (9) and (10). We are meeting the same six 
terms in different ways. The new feature is the minus sign in front of @2,—and the 
common mistake is to forget that sign. In a 4 by 4 determinant, a,, ~ az, a3, — 44 
would multiply 3 by 3 determinants. 

Now comes a key step. We write A x B as a determinant. The vectors i, j, k go in 
the top row, the components of A and B go in the other rows. The “determinant” is 
exactly A x B: 


ij k 
a, @3 @, 43 a, @2 
AXB=/4@ @ 43/=i ~j +k ; (14) 
b, 5; b, by, b, by 
b, by by ieee 


This time we highlighted the j} component with its minus sign. There is no great 
mathematics in formula (14)—it is probably illegal to mix i, j, k with six numbers but 
it works. This is the good way to remember and compute A x B. In the example 
{j — i) x (k— 3) from equation (8), those two vectors go into the last two rows: 


ijk 
1 0 —1 0 coz a 
—1 1 Ol=i -j + =itjtk 
>= 0 1 -~it 1] “]-1 0 - 
-101 fi 


The k component is highlighted, to see a,b, — a,b, again. Note the change from 
equation (11), which had 0, 1, —1 in the top row. That triple product was a number 
(zero). This cross product is a vector i+ j+ k. 
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Review question 1 With the i, j, k row changed to 3, 4, 5, what is the determinant? 
Answer 3:1+4-1+5+1= 12. That triple product is the volume of a box. 


Review question 2) When is A x B=0 and when is A-(B x C) = 0? Zero vector, 


zero number. 


Answer When A and B are on the same line. When A, B, C are in the same plane. 


Review question 3 Does the parallelogram area |A x BI equal a 2 by 2 determinant? 
Answer If A and B lie in the xy plane, yes. Generally no. 


Review question 4 What are the vector triple products (Ax B)xC and 


Ax (Bx C)? 


Answer Not computed yet. These are two new vectors in Problem 47, 


Review question 5 Find the plane through the ongin containing A = i+ j+ 2k and 
B=i+k. Find the cross product of those same vectors A and B. 
Answer The position vector P = xi+ yj + zk is perpendicular to N= A x B: 


yz 
P-(AxB)=|1 1 2[=x+y-z=0. 
0 


ij k 
AxB=/1 1 2b=i+j-k. 
101 


41.3 EXERCISES 


Read-through questions 


The cross product Ax Bisa__ © whose length is __b 
Its direction is__¢__ to A and B. That length is the area of 


a__d__, whose base is |Aj and whose height is__ ¢ . When 


A=a,i+ a,jand B= 5,i+ 5,j, theareais__ _. This equals 


a2by2_9 In general |A-BI?+/A x B)?=__h 


The rules for cross products are Ax A=__|_ and 
AxB=~(_J/_) and Ax(B+C)=AxB+_k_. In 
particular Ax B needs the __! -hand rule to decide its 


direction. If the fingers curl from A towards B (not more than 
180°), then = m_ points n_. By this ruleixj= © 


andixk= _ ep andjxk=_ 4 


The vectors a,§+ @,j + ask and 5,1+ b,j + 53k have cross 
product t_i+_s j+_t_k The vectors A= 
i+j+kand B=i+jhaveAxB=_ u_. (This is also the 
3 by 3 determinant __¥__.) Perpendicular to the plane con- 
taining (0, 0, 0), (1, 1, 1), (1, 1,6) is the normal vector N= 

w__. The area of the triangle with those three vertices is 

x__, which is half the area of the parallelogram with fourth 
vertex at ¥ 


Vectors A, B, C from the origin determine a __2_. Its vol- 
ume j|A-({__A_)| comes froma 3 by 3__8 _. There are six 
terms, ©  withaplussignand 0 with minus. In every 
term each row and _ E _ is represented once. The rows 
(1, 0, 0), (0, 0, 1), and (0, 1, 0) have determinant =__ F__. That 
boxisa __G@ _, but its sides form a __H__-handed triple in 
the order given. 


If A, B, C lic in the same plane then A-(Bx Cjis_ l. 
For A = xi+ yj+ zk the first row contains the letters J 
So the plane containing B and C has the equation. _K = 
0. When B=i+jand C =k that equationis__ | .Bx Cis 
Ms. 


A 3 by 3 determinant splits into__N__ 2 by 2 determinants. 
They come from rows 2 and 3, and are multiplied bythe 
entries in row 1. With i, j, k in row 1, this ‘determinant equals 


the © product. Its j component is P__, including the 


@_ sign which is easy to forget. 


Compute the cross preducts 1-8 from formula (6) or the deter- 
minant (14). Do one example both ways. 


1(ixj) xk 

3 (i+ 3j) x G+k) 

5 (21+ 37-+k) x (i-j—k) 
7 (i+ 2j + 3k) x (41-9) 
8 (i cos 0 +-j sin 0) x (isin 0 —j cos 8) 

9 When are [A x B| =|A!|B] and |A -(B x ©) = [AI |BIIC|? 


10 True or false: 
(a) A x B never equals A-B, 
(b) If A x B=0 and A-B=9, then either A=0 or B=0. 
(c) If Ax B=AxC and A#0D, then B=C. 


2(ixjxi 
4 (21+ 3) +k) x 21+ 3j)-k) 
6 (i+j—k)x (i-j+h) 
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In 11-16 find |A x B| by equation (1} and then by computing 
A « B and its length. 


1) A=i+j+k, B=i 
13 A=-B 


12 A=i+j, B=i-j 
14 A=i+j, B=j+k 
15 A=2a,i+ a,j, B= 5,i+ b2j 

16 A= (a), 43, a3), B=(b,, b2, 53) 


In Problem 16 (the general case), equation (1) proves that the 
length from equation (6} is correct. 
17 True or false, by testing on A =i, B=j, C =k: 
(a) Ax (A x B)=0 (b} A-(B x C}=(A x B)-C 
(c) A*(Bx C)=C-(B x A) 
(d) (A — B) x (A + B) = 2(A x B). 
18 (a) From A x B = — (B x A) deduce that Ax A=0. 


(b) Split (A + B} x (A + B) into four terms, to deduce that 
(A x B) = —(Bx A). 


What are the normal vectors to the planes 19-22? 


19 (2, 1, O-(x, y, 2) =4 20 3x+42=5 
x yp 2 x y 2 

21 ji 1 Oj =2 22 |t 1 1)=0 
011i { 1 2 


Find N and the equation of the plane described in 23-29. 
23 Contains the points (2, t, 1}, (1, 2, 1), (1, 1, 2) 

24 Contains the points (0, 1, 2}, (1, 2, 3), (2, 3, 4) 

25 Through (0, 0, 0}, (1, 1, 1), (a, 5, c) [What if a= b =¢?] 
26 Parallel to i+jandk 

27 N makes a 45° angle with i and j 

28 N makes a 60° angle with i and j 

29 N makes a 90” angle with i and j 


30 The triangle with sides i and } is as large as the 
parallelogram with those sides. The tetrahedron with edges 
i, j, k is as large as the box with those edges. Extra 
credit: In four dimensions the “simplex” with edges i, j, k, 4 
has volume = 


31 If the points (x, y, 2), (1, 1, 0), and (1, 2, 1) lie on a plane 
through the origin, what determinant is zero? What equation 
does this give for the plane? 


32 Give an example of a right-hand triple and left-hand triple. 
Use vectors other than just i, j, k. 


33 When B= 3i+j is rotated 90° clockwise in the xy plane 
it becomes B* = . When rotated 90° counterclock- 
wise It is . When rotated 180° it is 


34 From formula (6) verify that B-(A x B) = 0. 
35 Compute 
1 2 3] {2 1 OJ Jt O 2 
2 3 4/,J/1 2 1),)/0 3 OF. 
3 4 6] |O 1 2[ |2 0 ! 
3%6 Which of the following are equal to A x B? 


(A+ B)x B, (~B)x(—A), |AI/B||sin |, (A+C) x (B— CO), 
3A — B) x (A+B). 


37 Compare the six terms on both sides to prove that 


The matrix is “transposed” —same determinant. 


38 Compare the six terms to prove that 

G@, @2 @3 
a, a3 
by by b3}=—hy 


a, G2 
+ bz 


GQ, ay 


Cy C3 
Cy C2 C3 


This is an “expansion on row 2.”” Note minus signs. 
39 Choose the signs and 2 by 2 determinants in 
a; dz dy 


by by b; =ite; 


€; €2 €3 


40 Show that (A x B)+(B x C)+(C x A) is perpendicular to 
B-—A and C—Band A-C. 


Problems 41-44 compute the areas of triangles. 


41 The triangle PQR in Example? has squared area 
(/3/2)7 = (3)? + (4)? + 9)’, from the 3D version of Pythagoras 
in Remark 2. Find the area of POR when P = (a, 0,0}, 0 = 
(0, b, 0), and R =(0, 0, c), Check with 3|A x BI. 


42 A triangle in the xp plane has corners at (a,, 6), (a2, 3) 
and (a3, 63). Its area A ts half the area of a parallelogram. 
Find two sides of the parallelogram and explain why 


A = 4\(a; — a, ){b3 — by) — (a3 — ay (by — By}. 


43 By Problem 42 find the area A of the triangle with corners 
(2, 1) and (4, 2) and (1, 2). Where is a fourth corner to make a 
parallelogram? 
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44 Lifting the triangle of Problem 42 up to the plane z= 1 
gives comers (a), b,, 1), (az, 52, 1), (a3, by, 1). The area of the 
triangle times 4 is the volume of the upside-down pyramid 
from (0, 0, 0) to these corners. This pyramid volume is 4 the 
box volume, so 4 (area of triangle) = } (volume of box): 


a, b 1 
area of triangle = ; a, b, If. 
a, 56; 1 
Find the area A in Problem 43 from this determinant. 


45 (1) The projections of A=a,i+a,j+a,k and B= 
b,i+ b,j + 53k onto the xy plane are 
(2} The parallelogram with sides A and B projects toa 
parallelogram with area 
(3) General fact: The projection onto the plane normal to 
the unit vector n has area (A x B)}+n. Verify for n=k. 


46 (a} For A=i+j—4k and B= —i +j, compute (A x B)+i 
and (A x B}-j and (A x B)-k. By Problem 45 those are 
the areas of projections onto the yz and xz and xy pianes. 
(b} Square and add those areas to find |A x B|?. This is 
the Pythagoras formula in space (Remark 2). 
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47 {a) The triple cross product (A x B) x C is in the plane of 
A and B, because it is perpendicular to the cross product 


(b) Compute (A x B} x C when A=a,i+a2j+a3;k, B= 
bt+ baj + b3k, C=i 

{c) Compute (A+ C)B—(B-C)A when C =i. The answers 
in (b) and (c) should agree. This is also true if C =jor C= 
k or C=cyit+c.j+c,k. That proves the tricky formula 


{A x B) x C=(A' OB— (BOA. (*) 
48 Take the dot product of equation (*) with D to prove 
(A x B}-(C x D)= (A+ C)(B- D} — (B+ C)(A« D). 


49 The plane containing P = (0, 1, 1) and Q=(I, 0, t) and 
R=(1,1,0) is perpendicular to the cross product N= 

. Find the equation of the plane and the area of 
triangle POR. 


50 Let P=(1,0, —1) @=(, 1, 1), R =(2, 2, 1). Choose S$ so 
that PQRS is a parallelogram and compute its area. Choose 
T, U, V so that OPQRSTUV is a box (parallelepiped) and 
compute its volume. 


We are moving from geometry to algebra. Eventually we get back to calculus, where 
functions are nonlinear—but linear equations come first. In Chapter 1, p=mx+b 
produced a line. Two equations produce two lines. If they cross, the intersection point 
solyes both equations—and we want to find it. 

Three equations in three variables x, y,z produce three planes. Again they go 
through one point (usually). Again the problem is to find that intersection point 


—which solves the three equations. 


The ultimate problem is to solve n equations in n unknowns. There are n hyper- 
planes in n-dimensional space, which meet at the solution. We need a test to be sure 
they meet. We also want the solution. These are the objectives of near algebra, which 
joins with calculus at the center of pure and applied mathematics.t 

Like every subject, linear algebra requires a good notation. To state the equations 
and solve them, we introduce a “matrix.” The problem will be Au=d. The solution 


will be u= A- 


‘d. It remains to understand where the equations come from, where 


the answer comes from, and what the matrices A and A~! stand for. 


TWO EQUATIONS IN TWO UNKNOWNS 


Linear algebra has no reason to choose one variable as special. The equation p — yo = 
m{x — xX) separates y from x. A better equation for a line is ax + by =. (A vertical 


tLinear algebra dominates some applications while calculus governs others. Both are essential. 
A fuller treatment is presented in the author’s book Linear Algebra and Its Applications 
(Harcourt Brace Jovanovich, 3rd edition 1988}, and in many other texts. 
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line like x = 5 appears when b= 0. The first form did not allow slope m= co.) This 
section studies two lines: 

a,xtby=d, 

(t) 

a,xt+byy= dy. 
By solving both equations at once, we are asking (x, y) to lie on both lines. The 
practical question is: Where do the lines cross? The mathematician’s question is: Does 
a solution exist and is it unique? 

To understand everything is not posstble. There are parts of life where you never 
know what is going on (until too Jate), But two equations in two unknowns can have 
no mysteries. There are three ways to write the system—by rows, by columns, and 
by matrices. Please look at ail three, since setting up a problem is generally harder 
and more important than solving it. After that comes the concession to the real world: 
we compute x and y. 


EXAMPLE 4 How do you invest $5000 to earn $400 a year interest, if a money market 
account pays 5% and a deposit account pays 10%? 


Set up equations by rows: With x dollars at 5% the interest is .05x. With y dollars at 


10% the interest is .10y. One row for principal, another row for interest: 
x+ y= 5000 
(2) 
05x +10) = 400. 


Same equations by columns: The left side of (2) contains x times one vector plus y 
times another vector. The right side is a third vector. The equation by columns is 


fo Lio a 
x +} = : (3) 
05 L0 400 


Same equations by matrices: Look again at the left side. There are two unknowns x 
and y, which go into a vector u. They are multiplied by the four numbers 1, .05, 1, 
and .10, which go into a two by two matrix A. The left side becomes @ matrix times 


a vector: 
1 1 ][x] [sooo 
An= = ; (4) 
Fenees 


Now you see where the “rows” and “‘columns” came from. They are the rows and 
columns of a matrix. The rows entered the separate equations (2). The columns 
entered the vector equation (3). The matrix-vector multiplication Au is defined so 
that all these equations are the same: 


Au by rows: a |e | ieee by (cach row is 
: a, by || y a,x + boy a dot product} 
i a | te | b, | (combination of 
pe eng oeneg ii | a - | * fr column vectors) 


A is the coefficient matrix, The unknown vector is u. The known vector on the right 
side, with components 5000 and 400, is d. The matrix equation is 4u=d. 
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7000 x +y=5000 05x +.10y = 400 vatyb=d 


Fig. 11.16 Each row of Au=d gives a line. Each column gives a vector. 


This notation Au=d continues to apply when there are more equations and more 
unknowns. The matrix A has a row for each equation (usually m rows). It has a column 
for each unknown (usually n columns). For 2 equations in 3 unknowns it is a 2 by 3 
matrix (therefore rectangular). For 6 equations in 6 unknowns the matrix is 6 by 6 
(therefore square). The best way to get familiar with matrices is to work with them. 
Note also the pronunciation: “‘matrisees” and never “‘matrixes.” 


Answer to the practical question The solution is x = 2000, y= 3000. That is the 
intersection point in the row picture (Figure 11.16). It is also the correct combination 
in the column picture. The matrix equation checks both at once, because matrices 
are multiplied by rows or by columns. The product either way is d: 


1 1 |} 2000 2000 + 3000 $000 F 
OS 10 || 3000 (.05)2000 + (.10)3000 400 
Singular case In the row picture, the lines cross at the solution. But there is a case 


that gives trouble. When the lines are parallel, they never cross and there is no solution. 
When the lines are the same, there is an infinity of solutions: 


2x+y=0 2x+ y=0 
parallel lines same line (5} 
2xty=1 4x + 2y=0 


This trouble also appears in the column picture. The columns are vectors a and b. 
The equation Au = d is the same as xa + yb = d. We are asked to find the combination 
of a and b (with coefficients x and y) that produces d. In the singular case a and b lie 
along the same line (Figure 11.17). No combination can produce d, unless it happens 
to lie on this line. 


parallel > lines cross at solution 
lines a=| | vel, y= 
{ 2 
0 
d [ | 4 _ | 
] b > 1 
xatyb 7 
2v+y=0 2x4+y=1 misses d 


Fig. 11.17 Row and column pictures: singular (no solution) and nonsingular (x = y = 1). 


428 


44 Vectors and Matrices 


The investment problem is nonsingular, and 2000 a + 3000 b equals d. We also drew 
Example 2: The matrix A multiplies u=(1, 1) to solve x + 2y = 3 and x — y=0: 


af AEECSFE) = -GFE) 


The crossing point is (1, 1) in the row picture. The solution is x= 1, y=1 in the 
column picture (Figure 11.17b). Then 1 times a plus | times b equals the right side d. 


SOLUTION BY DETERMINANTS 
Up to now we just wrote down the answer. The real problem is to find x and y when 
they are unknown. We solve two equations with letters not numbers: 
a,x+b,y=d, 
a,x+b,y=d. 


The key is to eliminate x. Multiply the first equation by a, and the second equation 
by a,. Subtract the first from the second and the x’s disappear: 


(a,b, — a,b, )y = (a, dz — apd). (6) 
To eliminate y, subtract b, times the second equation from b, times the first: 
(ba, — by a2)x =(b2d, — by dy). (7) 


What you see in those parentheses are 2 by 2 determinants! Remember from 
Section 11.3: 

a, b a, »b; 
= a,b, —ayb,. 


a> b, ay 22 


. ~ 1 « 
The determinant of is the number 


This number appears on the left side of (6) and (7). The right side of (7) is also a 
determinant—but it has d’s in place of a’s. The right side of (6) has d’s in place of 
b’s. So x and y are ratios of determinants, given by Cramer’s Rule: 


44H Cramer's Rule The solution is x = 


The investment example is solved by three determinants from the three columns: 
l 1 5000 1 5000 
05 .10 400 .10 05 400 


= 150. 


Cramer’s Rule has x = 100/.05 = 2000 and y = 150/.05 = 3000. This is the solution. 
The singular case is when the determinant of A is zero—and we can’t divide by it. 


441 Cramer’s Rule breaks down when det A = 0—which is the singular case. 


Then the lines in the row picture are parallel, and one column is a multiple of 
the other column. 
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EXAMPLE 3 The lines 2x + y=0, 2x + y= 1 are parallel. The determinant is zero: 


EEC 


The lines in Figure 11.17a don’t meet. Notice the columns: [3} is a multiple of [}]. 

One final comment on 2 by 2 systems. They are small enough so that all solution 
mcthods apply. Cramer’s Rule uses determinants. Larger systems use elimination 
(3 by 3 matrices are on the borderline). A third solution {the same solution!) comes 
from the inverse matrix A~', to be described next. But the inverse is more a symbol 
for the answer than a new way of computing it, because to find A~! we still use 
determinants or elimination. 


Il 
S 


THE INVERSE OF A MATRIX 


The symbol A~' is pronounced “A inverse.” It stands for a matrix—the one that 
solves Au=d. I think of A as a matrix that takes u to d. Then A~! is a matrix that 
takes d back to u. If Au=d then u= A~ ‘d (provided the inverse exists). This is exactly 
like functions and inverse functions: g(x) = y and x = g~ *(y). Our goal is to find A! 
when we know A. 

The first approach will be very direct. Cramer’s Rule gave formulas for x and y, 
the components of u. From that rule we can read off A’, assuming that D= 
a,b, — ab, is not zero. D is det A and we divide by it: 


bad; — bid eee 
Cramer, a=t) °F 7?) quits Atd= |” | fies (8) 
D —a,d,t+a,d, D — a, a; d, 


The matrix on the right (including 1/D in all four entries) is 4~!. Notice the sign 
pattern and the subscript pattern. The inverse exists if D is not zero—this is impor- 
tant. Then the solution comes from a matrix-vector multiplication, A~* times d. We 
repeat the rules for that multiplication: 


DEFINITION A matrix M times a vector v equals a vector of dot products: 


ie al he | 
My= y|= : (9) 
row 2 (row 2)+y¥ 


Equation (8) follows this rule with Af = A~* and y= d. Look at Example 1: 


1 1 1f 10 -1 2 —20 
A= , detA=05, At=— = 
05.10 05}-05 1} |[-1 20 


There stands the inverse matrix. It multiplies d to give the solution u: 
Pera 2 = A Be 7 (2}(5000) — via _ pea 
=} 20 |} 400 (— 1)(5000) + (20)(400) 3000 
The formulas work perfectly, but you have to see a direct way to rcach A~ 1d. Multiply 


both sides of Au=d by A~!. The multiplication “cancels” A on the left side, and 
leaves u= A ‘d. This approach comes next. 
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MATRIX MULTIPLICATION 


To understand the power of matrices, we must multiply them. The product of A~! 
with Au is a matrix times a vector. But that multiplication can be done another way. 
First A~' multiplies A, a matrix times a matrix. The product A~ ‘A is another matrix 
(a very special matrix). Then this new matrix multiplies u. 

The matrix-matrix rule comes directly from the matrix-vector rule. Effectively, a 
vector v is a matrix V with only one column. When there are more columns, M times 
V splits into separate matrix-vector multiplications, side by side: 


DEFINITION A matrix M times a matrix V equals a matrix of dot products: 


row | (row l)-¥, (row 1)*y¥, 
V= y V2/= i (10) 
row 2 (row 2)*v, (row 2)+¥> 
2s 6 eS+2Z°7 FO F2"8 19 22 
EXAMPLE 4 = = ‘ 
3 4]/7 8 3°5+4°7 3°64+4°8 43 50 


1 0 
EXAMPLE5 Multiplying A~' times A produces the “identity matrix” ; | 


A'A= 
D D 


b, —b; ina" 0 
— a2 4; }la, by 0 — ab, + a,b, i 0 
a. et et OO 


a, by, 


This identity matrix is denoted by /. It has 1’s on the diagonal and 0’s off the diagonal. 
It acts like the number |. Every vector satisfies lu =u. 


445 (Inverse matrix and identity matrix) AA~' =I and A~'A=I and lu=u: 


ee) el] EH) 


Note the placement of a, b, c,d. With these letters D is ad — bc. 


The next section moves to three equations. The algebra gets more complicated (and 
4 by 4 is worse). It is not easy to write out A~'. So we stay longer with the 2 by 2 
formulas, where each step can be checked. Multiplying Au = d by the inverse matrix 
gives A-'Au= A~'d—and the left side is Ju=u. 


-l4_ | sin 6 
. a= [sin @] 


Fig. 14.48 Rotate vy forward into Av. Rotate d backward into A~'d. 
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cos? —sin@ 


EXAMPLES 4A= 
sin@ cosé@ 


rotates every v to Ay, through the angle @. 


1 
Question 1 Where is the vector ¥ = 5 rotated to? 
Question 2 What is A+? 


0 
Question 3 Which vector u is rotated into d= Hi 


cos# —sin@}} 1 cos @ 
Solution | v rotates into Av = = . 
sin @ cos @ {1 0 sin @ 


cos@ sin@ 


Solution 2 det A=1s0 Aq! -| = rotation through —8@. 


—sin@ cos@ 


cos@ sind || 0 sin 8 
Solution3 If Au=d then u=A~'d= = ; 
—sin@ cos@]] 1 cos 6 


Historical note | was amazed to learn that it was Leibniz (again!) who proposed the 
notation we use for matrices. The entry in row i and column j is a,;. The identity 
matrix has a,, = a3, = | and a,, = a,, = 0. This is ina linear algebra book by Charles 
Dodgson—better known to the world as Lewis Carroll, the author of Alice in 
Wonderland. I regret to say that he preferred his own notation ili instead of a;;. 
‘Y have turned the symbol toward the left, to avoid all chance of confusion with {.” 
It drove his typesetter mad. 


PROJECTION ONTO A PLANE = LEAST SQUARES FITTING BY A LINE 


We close with a genuine application. It starts with three-dimensijonal vectors a, b, d 

and leads to a 2 by 2 system. One good feature: a, b, d can be n-dimensional with no 

change in the algebra. In practice that happens. Second good feature: There is a 

calculus problem in the background. The example is to fit points by a straight line. 
There are three ways to state the problem, and they look different: 


1. Solve xa + yb=d as well as possible (three equations, two unknowns x and y). 
2. Project the vector d onto the plane of the vectors a and b. 
3. Find the closest straight line (“‘deast squares’) to three given points. 


Figure 11.19 shows a three-dimensional vector d above the plane of a and b. Its 
projection onto the plane is p= xa + yb. The numbers x and y are unknown, and 
our goal is to find them. The calculation will use the dot product, which is always 
the key to right angles. 

The difference d — pis the “error.” There has to be an error, because no combination 
of a and b can produce d exactly. (Otherwise d is in the plane.) The projection p is 
the closest point to d, and it is governed by one fundamental law: The error is 
perpendicular to the plane. That makes the error perpendicular to both vectors a 
and b: 

a:(xa+ yb—d)=0 and b+ {xa + yb— d)=0. (13) 
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Rewrite those as two equations for the two unknown numbers x and y: 
(ava)x + (a~b)y =a-d 
(b-a)x + (b ~b)y = b wd. 


These are the famous normal equations in statistics, to compute x and y and p. 


(14) 


EXAMPLE? For a=(l, t, 1) and b=(1, 2, 3) and d=(0, 5, 4}, solve equation (14): 
see Gps 9. ead : 
ves sO p= —a+2b=(1, 3, 5)= projection. 
6x + 14y = 22 Bt ye 2 B { ) = projection 


Notice the three equations that we are not solving {we can’t): xa + yh =d is 


x+ p=0 1 1 
x+2y=5 with the 3 by 2 matrix A=j;]1 2 (15} 
x+3y=4 1 3 


For d= (0, 5, 4) there is no solution; d is not in the plane of a and b, For p=(l, 3, 5) 
there is a solution, x= — 1 and y=2. The vector p is in the plane. The error d—p 
is (—1, 2, —1). This error is perpendicular to the columns {1, 1, {) and (1, 2, 3), so it is 
perpendicular to their plane. 


SAME EXAMPLE (written as a line-fitting problem) Fit the points (1, 0) and (2, 5) and 
(3, 4) as closely as possible (“least squares”’) by a straight line. 


Two points determine a line. The example asks the line f= x + yt to go through three 
points. That gives the three equations in (15), which can’t be solved with two un- 
knowns. We have to settle for the closest line, drawn in Figure 11.19b, This line is 
computed again below, by calculus. 

Notice that the closest line has heights 1, 3,5 where the data points have heights 
0, 5, 4. Those are the numbers in p and d! The heights 1, 3, 5 fit onto a line; the heights 
0, 5, 4 do not. In the first figure, p=(1, 3, 5) is in the plane and d = (0, 5, 4) is not. 
Vectors in the plane lead to heights that lie on a line. 

Notice another coincidence. The coefficients x = — 1 and y = 2 give the projection 
—a+ 2b. They also give the closest line f= — 1 + 2t. All numbers appear in both 
figures, 


projection 1 
p=-a+t2b= 3] 


5 


Fig. 11.49 Projection onto plane is (1, 3, 5) with coefficients —1, 2, Closest line has heights 
1, 3, 5 with coefficients —1, 2, Error in both pictures is —1, 2, —1. 


Read-through questions 


The equations 3x + y= 8 and x + y= 6 combine into the vec- 


ao +y_b “=_c¢_ =d. The left side is A _ x 3 
Au, with coefficient matrix A=__d__ and unknown vector B a|= , {a b MD ial on io , 
. The determinant of A is ___t _, so this problem a y 
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Remark Finding the closest line is a calculus problem: Minimize a sum of squares. 
The numbers x and y that minimize E give the least squares solution: 


E(x, y)=(x + y—0)? + (x + 2y — 5)? + (x + 3y — 4)”. (16) 


Those are the three errors in equation (15), squared and added. They are also the 
three errors in the straight line fit, between the line and the data points. The projection 
minimizes the error (by geometry), the normal equations (14) minimize the error (by 
algebra), and now calculus minimizes the error by setting the derivatives of E to zero. 

The new feature is this: E depends on two variables x and y. Therefore E has two 
derivatives. They both have to be zero at the minimum. That gives two equations for 
x and y: 


x derivative of E is zero: 2(x + y)+2(x+2y—5) +2(x+3y—-4) =0 
y derivative of E is zero: 2(x + y) + 2(x + 2y — 5)(2) + 2(x + 3y — 4)(3) = 0. 


When we divide by 2, those are the normal equations 3x + 6y=9 and 6x + l4y= 
22. The minimizing x and y from calculus are the same numbers —1 and 2. 

The x derivative treats y as a constant. The y derivative treats x as a constant. 
These are partial derivatives. This calculus approach to least squares is in Chapter 13, 
as an important application of partial derivatives. 


We now summarize the least squares problem—to find the closest line to n data 
points. In practice n may be 1000 instead of 3. The points have horizontal coordinates 
b,, b>, ...,b,. The vertical coordinates are d,,d,,...,d,. These vectors b and d, 
together with a=(1, 1, ..., 1), determine a projection—the combination p= xa + yb 
that is closest to d. This problem is the same in n dimensions—the error d— p is 
perpendicular to a and b. That is still tested by dot products, p-a=d-a and p-b= 
d-b, which give the normal equations for x and y: 


(asa)x +(a*b)y=a-d (n) x + (Zb,;)y = Xd; 


or (17) 
(b-a)x +(b+b)y=b-d (Xb;)x + (Zb?)y = Lb, d;. 


41K The least squares problem projects d onto the plane of a and b. The 


projection is p= xa + yb, in n dimensions. The closest line is f= x + yt, in two 
dimensions. The normal equations (17) give the best x and y. 


41.4 EXERCISES 


te. r_/D. 
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A matrix-vector multiplication produces a vector of dot 


m_ from the rows, and also a combination of the __n 


| 


isnot __g__. The row picture shows two intersecting __h If the entries are a, b, c, d, the determinant is D = a* 
The column picture shows xa + yb=d, wherea=__|__ and is [__p__] divided by D. Cramer’s Rule shows components 
. The inverse matrix is A~'=__k _. The solution of u= A” 'd as ratios of determinants: x = /D and y= 
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A matrix-matrix multiplication MV yields a matrix of dot 
products, from the rows of __% and the columns of _ + 


ea a Pa 
a oe ee 


The last line contains the _u__s matrix, denoted by /, Jt has 


the property that /A=Azl=__v___ for every matrix A, and 


lu=__ __ for every vector un. The inverse matrix satisfies 
A-'A=__x_. Then Au=d is solyed by multiplying both 
sides by_¥__,togiveu=__2__. There is no inverse matrix 
when A 


The combination xa + yb is the projection of d when the 
error _8 _ is perpendicular to __¢ and _D_. If a= 
(1, 1, 1), b= (1, 2, 3), and d = (0, 8, 4), the equations for x and 
yare__€ __. Solving them also gives the closest__F_ to the 
data points (1, 0), © _, and (3, 4}. The solution is x = 0, y= 
2, which means the best line is _4 . The projection is 
Oa+2b=__!_. The three error components are _J_. 
Check perpendicularity:_K =Oand _t =O. Applying 
calculus to this problem, x and » minimize the sum of squares 
E= M 


In 1-8 find the point (x, y) where the two lines intersect (if they 
do). Also show how the right side is a combination of the 
columns on the left side (if it is). Also find the determinant D. 


lx+y=7 Z2x+y=ll 
x—y=3 x+y=6 
3 3x-— y=8 4 x+2y=3 
x—3y=0 2x+4y=7 
§ 2x—4y=0 6 10x+y=1 
x—2y=0 x+ye=l 
7 ax+ by=0 8 ax+by=1 
2ax + 2by =2 extdy=1 


9 Solve Problem 3 by Cramer’s Rule. 
19 Try to solve Problem 4 by Cramet’s Rule. 
11 What are the ratios for Cramer’s Rule in Problem 5S? 
12 If A=/ show how Cramer's Rule solves Au=d. 
13 Draw the row picture and column picture for Probiem [. 
14 Draw the row and column pictures for Problem 6, 
15 Find A~’ in Problem 1. 
16 Find A~! in Problem 8 if ad — be = 1. 


17 A 2 by 2 system ts singular when the two lines in the row 
picture . This system is still solvable if one equation 
isa of the other equation. In that case the two fines 
are and the number of solutions is 


18 Try Cramer’s Rule when there is no solution or infinitely 
many; 


3x+ y=0 3x+ y=l 


6x +2y=2 6x + 2y=2. 


19 Au=d is singular when the columns of A are : 
A solution exists if the right side d is . In this solvable 
case the number of solutions is 


20 The equations x — y= d, and 9x —9y=d, can be solved 
if ; 


21 Suppose x=} billion people live in the U.S. and y=5 
billion live outside. If 4 per cent of those inside move out and 
2 per cent of those outside move in, find the populations d, 
inside and d, outside after the move. Express this as a matrix 
multiplication Au =d (and find the matrix). 


22 In Problem 21 what is special about @, + a, and 6, + by 
{the sums down the columns of A)? Explain why d, + dz equ- 
als x + y. 


23 With the same percentages moving, suppose d, = 0.58 bil- 
lion are inside and d, = 4,92 billion are outside at rhe end. Set 
up and solve two equations for the original populations x 
and y, 

24 What is the determinant of A in Probiems 21-23? What 
is A~'? Check that A7'A =J. 


25 The equations ax + y=0, x+ay=0 have the solution 
x = y=0. For which two values of a are there other solutions 
(and what are the other solutions)? 


26 The equations ax + by = 0, cx + dy = 0 have the solution 
x=y=0. There are other solutions if the two lines are 
. This happens if a, b, c, d satisfy P 


27 Find the determinant and inverse of A=(2 74). Do the 
same for 2A, A~', — A, and f. 


28 Show that the determinant of A~? is 1/det A: 
| ai(ad—be) = =—d/{ad — | 


—ciad—be) af(ad — be) 
29 Compute AB and BA and also BC and CB; 


“E ] Py] hy 


Verify the associative law: AB times C equals A times BC. 


30 (a) Find the determinants of A, B, AB, and BA above. 
(b) Propose a law for the determinant of BC and test it. 


ab e 
31 For a-| | a-| 
cf ld g 


factor its determinant into (ad —bc)(eh —fg). Therefore 
det(AB) = (det A)(det B). 


4 er 
‘. write out AB and 
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32 Usually det(A +5) does not equal det A+det B. Find 
examples of inequality and equality. 


33 Find the inverses, and check A~14 =I and BB”! =<], for 


14 2 2 
A= and B= . 
0 2 0 1 


34 In Problem 33 compute AB and the inverse of AB. Check 
that this inverse equals B~! times A~!. 


35 The matrix product ABB~*A~?! equals the mat- 
rix, Therefore the inverse of AB is - Important: The 
associative law in Problem 29 allows you to multiply BB™' 
first. 


36 The matrix multiplication C-'B~*A™~'ABC yields the 
matrix. Therefore the inverse of ABC is 


37 The equations x + 2y+3z and 4x+5y+cz=0 always 
have a nonzero solution. The vector u= (x, y, z) is required 
to be to y=(1, 2, 3} and w =(4, 5, c). So choose u= 


38 Find the combination p=xa+ yb of the vectors a= 
(1, 1, 1) and b=({—1, 0, 1) that comes closest to d =(2, 6, 4). 
(a) Solye the normal equations (14) for x and y. (b) Check that 
the error d —p is perpendicular to a and b. 
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39 Plot the three data points (—1, 2), (0, 6), (1, 4) in a plane. 
Draw the straight line x+ yt with the same x and y as in 
Problem 38. Locate the three errors up or down from the data 
points and compare with Problem 38. 


40 Solve equation (14) to find the combination xa + yb of 
a=(l[, 1, 1) and b=(—1, 1, 2) that is closest to d=(1, 1, 3). 
Draw the corresponding straight line for the data points 
(—1, 1), (1, 1), and (2, 3). What is the vector of three errors and 
what is it perpendicular to? 


41 Under what condition on @,,d,,¢3 do the three points 
(0, @;)}, (1, 42), (2, 23) lie on a line? 


42 Find the matrices that reverse x and y and project: 
x y x x 
M ts and P = ‘ 
y x y 0 


> eee) 
43 Multiplying by P -| , q projects u onto the 45° line. 


(a) Find the projection Pu of u= [$]. 
(b) Why does P times P equal P? 
{c) Docs P~? exist? What vectors give Pu = 0? 


44 Suppose u is not the zero vector but Au=9, Then A™’ 
can’t exist: It would multiply and produce u, 


This section moves from two to three dimensions. There are three unknowns x, y, z 
and also three equations. This is at the crossover point between formulas and 
algorithms—it is real linear algebra. The formulas give a direct solution using det- 
erminants. The algorithms use elimination and the numbers x, y,z appear at the 
end. In practice that end result comes quickly. Computers solve linear equations by 
elimination. 

The situation for a nonlinear equation is similar. Quadratic equations 
ax? + bx +¢=0 are solved by a formula. Cubic equations are solved by Newton’s 
method (even though a formula exists). For equations involving x* or x}°, algorithms 
take over completely. 

Since we are at the crossover point, we look both ways. This section has a lot to 
do, in mixing geometry, determinants, and 3 by 3 matrices: 


1. The row picture: three planes intersect at the solution 

2. The column picture: a vector equation combines the columns 
3. The formulas: determinants and Cramer’s Rule 

4. Matrix multiplication and A~? 

5. The algorithm: Gaussian elimination. 


Part of our goal is three-dimensional calculus. Another part is n-dimensional algebra, 
And a third possibility is that you may not take mathematics next year. If that 
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happens, I hope you will use mathematics. Linear equations are so basic and impor- 
tant, in such a variety of applications, that the effort in this section is worth making. 

An example is needed. It is convenient and realistic if the matrix contains zeros. 
Most equations in practice are fairly simple—a thousand equations each with 990 
zeros would be very reasonable. Here are three equations in three unknowns: 


x+ y = | 
x +2z= 0 (1) 
—2y+2z= -4. 


In matrix-vector form, the unknown u has components x, y, z. The right sides 1,0, —4 
go into d. The nine coefficients, including three zeros, enter the matrix A: 


1 24 Oz 1 
1 0 2//y]= 0} or Au=d. (2) 
;=—2 Zils = 


The goal is to understand that system geometrically, and then solve it. 
THE ROW PICTURE: INTERSECTING PLANES 


Start with the first equation x + y= 1. In the xy plane that produces a line. In three 
dimensions it is a plane. It has the usual form ax + by + cz = d, except that c happens 
to be zero. The plane is easy to visualize (Figure 11.20a), because it cuts straight down 
through the line. The equation x + y= 1 allows z to have any value, so the graph 
includes all points above and below the line. 

The second equation x + 2z=0 gives a second plane, which goes through the 
origin. When the right side is zero, the point (0, 0, 0) satisfies the equation. This time y 
is absent from the equation, so the plane contains the whole y axis. All points (0, y, 0) 
meet the requirement x + 2z=0. The normal vector to the plane is N=i+ 2k. The 
plane cuts across, rather than down, in 11.20b. 

Before the third equation we combine the first two. The intersection of two planes 
is a line. In three-dimensional space, two equations (not one) describe a line. The 
points on the line have to satisfy x + y= 1 and also x + 2z=0. A convenient point 
is P=(0, 1, 0). Another point is Q@ = (—1, 2, 4). The line through P and Q extends out 
in both directions. 

The solution is on that line. The third plane decides where. 


intersection 
line of first 
two planes 


third plane —2y + 2z =-4 


x 


Fig. 14.20 First plane, second plane, intersection line meets third plane at solution. 
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The third equation —2y + 2z = — 4 gives the third plane—which misses the origin 
because the right side is not zero. What is important is the point where the three 
planes meet. The intersection line of the first two planes crosses the third plane. 
We used determinants (but elimination is better) to find x= —2, y=3, z= 1. This 
solution satisfies the three equations and lies on the three planes. 

A brief comment on 4 by 4 systems. The first equation might be x + y+ z—1=0. 
It represents a three-dimensional “hyperplane” in four-dimensional space. (In physics 
this is space-time.) The second equation gives a second hyperplane, and its intersection 
with the first one is two-dimensional. The third equation (third hyperplane) reduces 
the intersection to a line. The fourth hyperplane meets that line at a point, which is 
the solution. It satisfies the four equations and lies on the four hyperplanes. In this 
course three dimensions are enough. 


COLUMN PICTURE: COMBINATION OF COLUMN VECTORS 


There is an extremely important way to rewrite our three equations. In (1) they were 
separate, in (2} they went into a matrix. Now they become a vector equation: 


1 1 0 1 
x{ilty] of+z}/2]=] of. (3) 
0 ~2 2 -4 


The columns of the matrix are multiplied by x, ), z. That is a special way to see matrix- 
vector multiplication: Au is @ combination of the columns of A. We ate looking for 
the numbers x, y, z so that the combination produces the right side d. 

The column vectors a,b, ¢ are shown in Figure 11.2la. The vector equation is 
xa + yh+ zc =d. The combination that solves this equation must again be x = — 2, 
y =3, z=1. That agrees with the intersection point of the three planes in the row 
picture. 


f : 
4, 8,b,cin 


newe 4, same plane 
é 


4 ¥ e d not in that plane: 
‘ No solution 


WY 0=1c+2b-2a 


Aig. 14.24 Columns combine to give d. Columns combine to give zero (singular case). 


THE DETERMINANT AND THE INVERSE MATRIX 


For a 3 by 3 determinant, the section on cross products gave two formulas. One was 
the triple product a+(b x e). The other wrote out the six terms: 


det A=a “(b x e)= a,(b2¢3 = b3¢2) + a,(b3¢, = byc3) + a3(b,C2 ra bz¢,). 
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Geometrically this is the volume of a box. The columns a, b, ¢ are the edges going out 
from the origin. In our example the determinant and volume are 2: 


a) b, Cy 1 


(DYO2)— OE YQ) + YE 2)O) 


a2 by C2) >= 1 0 = = 2. 


7 1}(2) — (O)(0 
dy bs |. 10-2 2 (1)(1)(2) + (O)(1)(2) — (0)(0)(0) 


A slight dishonesty is present in that calculation, and will be admitted now. In 
Section 11.3 the vectors A, B, C were rows. In this section a, b, ¢ are columns. It doesn’t 
matter, because the determinant is the same either way. Any matrix can be 
“transposed” —exchanging rows for columns— without altering the determinant. The 
six terms (a,b,c, is the first) may come in a different order, but they are the same six 
terms. Here four of those terms are zero, because of the zeros in the matnx. The sum 
of all six terms is D = det A = 2. 

Since D is not zero, the equations can be solved. The three planes meet at a point. 
The column vectors a, b, e produce a genuine box, and are not flattened into the same 
plane (with zero volume), The solution involves dividing by D—which is only possible 
if D = det A is not zero. 


The 3 by 3 identity matrix / is at the end of equation (5). Always /u=u. 
We now compute A~', first with letters and then with numbers. The neatest 
formula uses cross products of the columns of A—it is special for 3 by 3 matrices. 


bxe 
exa |. (4) 


axb 


Every entry is divided by D: The inverse matrix is A~' = 


Sl 


To test this formula, multiply by A. Matrix multiplication produces a matrix of dot 
products—from the rows of the first matrix and the columns of the second, A~'A =I: 


bxe a'(bxe) be(bxc) c‘(bxc) 10 0 
‘ cxa a be =< ac(exa) be(exa) ec(exa) | =]/0 1 OF]. (5) 
axhb a-{axb) b+(a xb) ¢:(a xb) 0 0 1 


On the right side, six of the triple products are zero. They are the off-diagonals like 
b-(b x c), which contain the same vector twice. Since b x c is perpendicular to b, this 
triple product is zero. The same is true of the others, like a+(a x b) = 0. That is the 
volume of a box with two identical sides. The six off-diagonal zeros are the volumes 
of completely flattened boxes. 

On the main diagonal the triple products equal D. The order of vectors can be abe 
or bea or cab, and the volume of the box stays the same. Dividing by this number D, 
which is placed outside for that purpose, gives the 1’s in the identity matrix J. 

Now we change to numbers. The goal is to find A~' and to test it. 


1 {t O 4 -2 2 
EXAMPLE 4 The inverseofA=|1 0 2]is A -'==-| -2 2 -2 
0-2 2 —2 2-1 


i ot 
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That comes from the formula, and it absolutely has to be checked. Do not fail to 
multiply A~* times A (or A times A~?), Matrix multiplication is much easier than 
the formula for A~?. We highlight row 3 times column 1, with dot product zero: 


4-2 2])1 1 =O 4-2 4-4 -4+4 1 0 0 
; —-2 2 -2//1 0 2 =; —-2+2 -2+4 4-4]=|0 i 90 
—2 2 -111;0-2 2 22 m2b2> a=2 00 1 


Remark on A~' Inverting a matrix requires D #0. We divide by D = det A. The 
cross products b x c and c x a and a xb give A~* in a neat form, but errors are 
easy. We prefer to avoid writing i,j,k. There are nine 2 by 2 determinants to be 
calculated, and here is A~! in full—containing the nine “cofactors” divided by D: 


bac, — b3¢, b3cy —byeq yc. — bc, 
A-*=—| ¢9€3;—¢3@) €38,—0,43 €,4,—c2a, |. (6) 
Qb,—@3b, a3b,—a,b, a,b,— ab, 
Important: The first row of A~‘ does not use the first column of A, except in 1/D. 


In other words, b x ¢ does not involve a. Here are the 2 by 2 determinants that 
produce 4, —2, 2—which is divided by D = 2 in the top row of A~?: 


1 t Oj]; 1 F OFFt 1 9 5 io 
1 0 2/)//1 0 27/1 0 2 See cere (7) 
0-2 2/{/0-2 2|/0-2 2 aS 


The second highlighted determinant looks like +2 not —2. But the sign matrix on 
the right assigns a minus to that position in A~*. We reverse the sign of b,c, — 63¢1, 
to find the cofactor b3c, — b,c, in the top row of (6). 


To repeat: For a row of A~', cross out the corresponding column of A. Find the three 
2 by 2 determinants, use the sign matrix, and divide by D. 


1 1 f 1 -l 0 
EXAMPLE2 B=/0 1 1} has D=1 and B-'=|0 L. =1 I: (8) 
001 0 oO 1 


The multiplication BB~' = IJ checks the arithmetic. Notice how | } in B leads to a 
zero in the top row of B”'. To find row 1, column 3 of B~* we ignore column 1 and 
row 3 of B. (Also: the inverse of a triangular matrix is triangular.) The minus signs 
come from the sign matrix. 


THE SOLUTION u= A ‘d 


The purpose of A~' is to solve the equation Au=d. Multiplying by A~' produces 
Iu=A~‘d. The matrix becomes the identity, Ju equals u, and the solution is 
immediate: 


bxc d-(b x c) 
cXa d =5 d-(c x a) }. (9) 
axhb d-(a x b) 


u=A d= 


vie 


A39 
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By writing those components x, y, z as ratios of determinants, we have Cramer's Rule: 


44M (Cramer’s Rule) 


ld be} jade bd 
labcl > jabel| ~ label’ 


The solution is x = (10) 


The right side d replaces, in turn, columns a and b and c. All denominators are D = 
a*(b xc). The numerator of x is the determinant d-{(b x ¢c) in (9), The second numera- 
tor agrees with the second component d-(c x a), because the cyclic order is correct. 
The third determinant with cotumns abd equals the triple product d-(a x b)in A~ ‘u. 
Thus (10) is the same as (9). 


EXAMPLE A: Multiply by 4 ' to find the known solution x = — 2, y=3,z=1: 


4 -2 2 ! 4-8 2 
1 ] 
= “la |] — _ = — —?+ = 
u=A_‘d 5 2 2 2 0 5 24+8 3 
ap, 22 aie |e 244 1 


EXAMPLE B: Multiply by B~' to solve Bu=d when d is the column (6, 5, 4): 
) ee | 0 6 1 111 1 
u=B'd=| 0 iS Sl St=| 1]. Check Bu=|0 1 1 1 |= 
0 I 4 4 00 1 4 


bk A oO 


EXAMPLE C: Put d= (6, 5, 4) in each column of B. Cramer’s Rule gives u = (1, 1, 4): 
6 1 1 I 
5 1 It=1 
4 0 1 


6 1 1 1 6 rt 1 1 
5 1j=t [0 1 5) =4 all divided by D=/0 1 1]=1, 
4 


0 
0 1 00 4 00 1 

This rule fills the page with determinants. Those are good ones to check by eye, 
without writing down the six terms (three + and three —). 

The formulas for A~! are honored chiefly in their absence. They are nat used by 
the computer, even though the algebra is in some ways beautiful. In big calculations, 
the computer never finds A~'—just the solution. 

We now took at the singular case PD = 0. Geometry—algebra—algorithm must ail 
break down. After that is the algorithm: Gaussian elimination. 


THE SINGULAR CASE 


Changing one entry of a matrix can make the determinant zero. The triple product 
a‘(b xc), which is also the volume, becomes D=0. The box is flattened and the 
matrix is singular. That happens in our example when the lower right entry is changed 
from 2 to 4: 

1 ft 0 


S=]1 0 2 | has determinant D= 0. 
O-2 4 
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This does more than change the inverse. It destroys the inverse. We can no longer 
divide by D. There is no S~'. 

What happens to the row picture and column picture? For 2 by 2 systems, the 
singular case had two parallel lines. Now the row picture has three planes, which 
need not be parailel. Here the planes are not parallel. Their normal vectors are the 
rows of S, which go in different directions. But somehow the planes fail to go through 
a common point. 

What happens is more subtle. The intersection line from two planes misses the 
third plane. The line is parallel to the plane and stays above it (Figure 11.22a). When 
all three planes are drawn, they form an open tunnel. The picture tells more than the 
numbers, about how three planes can fail to meet. The third figure shows an end 
view, where the planes go directly into the page. Each pair meets in a line, but those 
lines don’t meet in a point. 


x 
7 lane 


Pp 
\ &y . 
oe : ey 


< 


tunnel between 
planes end view 


Ag. 14.22 The row picture in the singular case: no intersection point, no solution. 


When two planes are parallel, the determinant is again zero. One row of the matrix 
is a multiple of another row. The extreme case has ail three planes parallel—as in a 
matrix with nine 1’s. 

The column picture must also break down. In the 2 by 2 failure (previous section), 
the columns were on the same line. Now the three columns are in the same plane. The 
combinations of those columns produce d only if it happens to lie in that particular 
plane. Most vectors d will be outside the plane, so most singular systems have no 
solution. 

When the determinant is zero, Au=d kas no solution or infinitely many. 


THE ELIMINATION ALGORITHM 
Go back to the 3 by 3 example Au = d. If you were given those equations, you would 


never think of determinants. You would— quite correctiy—start with the first equa- 
tion. It gives x = 1— y, which goes into the next equation to eliminate x: 


xt+ y = 1 

x=l-y 
x +2z= 0 ——— I1- yr2z= 0 
—2y+2z= —-4 —2y+2z= —-4., 


Stop there for a minute. On the right is a 2 by 2 system for y and z. The first equation 
and first unknown are eliminated—-exactly what we want. But that step was not 
organized in the best way, because a ‘‘1”° ended up on the left side. Constants should 
stay on the right side—the pattern should be preserved. It is better to take the same 
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step by subtracting the first equation from the second: 


x+y = 1 
x +2z= 0 —- yt2z=-1 (11) 
-—2y+2z= —-4 —2y+2z= -4, 


Same equations, better organization. Now look at the corner term — y. Its coefficient 
—1 is the second pivot. (The first pivot was +1, the coefficient of x in the first corner.) 
We are ready for the next elimination step: 


Plan: Subtract a multiple of the “pivot equation” from the equation below it. 
Goal: To produce a zero below the pivot, so y is eliminated. 
Method: Subtract 2 times the pivot equation to cancel — 2y. 


~ yt2z=-1 


= (12) 
—2y+2z=—-4 ~2z=—2. 


The answer comes by back substitution. Equation (12) gives z = 1. Then equation (11) 
gives y= 3. Then the first equation gives x = — 2. This is much quicker than determi- 
nants. You may ask: Why use Cramer’s Rule? Good question. 

With numbers elimination is better. It is faster and also safer. (To check against 
error, substitute —2, 3,1 into the original equations.) The algorithm reaches the 
answer without the determinant and without the inverse. Calculations with letters use 
det A and A~}, 

Here are the steps in a definite order (top to bottom): 


Subtract a multiple of equation 1 to produce 0x in equation 2 
Subtract a multiple of equation 1 to produce Ox in equation 3 
Subtract a multiple of equation 2 (new) to produce Oy in equation 3. 


EXAMPLE (notice the zeros appearing under the pivots): 
xt yt z= 1 x+ y+ z=1 x+ yptz=1 
2x+5yt+3z= 7 - 3y+ z=5 3 3y+z=5 
4x+ 7y+6z= 11 3y+22=7 z= 2, 


Elimination leads to a triangular system. The coefficients below the diagonal are zero. 
First z = 2, then y= 1, then x = — 2. Back substitution solves triangular systems (fast). 


As a final example, try the singular case Su = d when the corner entry is changed 
from 2 to 4. With D =0, there is no inverse matrix §~‘. Elimination also fails, by 
reaching an impossible equation 0 = — 2: 


xt y = | x+ y = | xty = | 
x +22= O4 — pt2?z=-14 —yt2z=-1 
=2y+d2= —4 —2y+4z=—4 O=-2 


The three planes do not meet at a point-—-a fact that was not obvious at the start. 
Algebra discovers this fact from D = 0. Elimination discovers it from 0 = —2. The 
chapter is ending at the point where my linear algebra book begins. 


11.5 Linear Algebra 443 


One final comment. In actual computing, you will use a code written by profession- 
als. The steps will be the same as above. A multiple of equation 1 is subtracted from 
each equation below it, to eliminate the first unknown x. With one fewer unknown 
and equation, elimination starts again. (A parallel computer executes many steps at 
once.) Extra instructions are included to reduce roundoff error. You only see the 
result! But it is more satisfying to know what the computer is doing. 

In the end, solving linear equations is the key step in solving nonlinear equations. 
The central idea of differential calculus is to linearize near a point. 


44.5 EXERCISES 


Read-through questions 


Three equations in three unknowns can be written as Au= 
d. The a uwhascomponents x,y,zand Aisa_b__. The 
row picture has a _¢ ___ for each equation. The first two 
planes intersect ina__4_, and all three pianes intersect in 
a_@_, which is__t . The column picture starts with 
vectors a, b, ¢ from the columns of __g _ and combines them 
to produce _h__. The vector equationis i =d. 


The determinant of A is the triple product _J___. This is 
the volume of a box, whose edges from the origin are __k 
Ifdet A=__| then the system is _m__. Otherwise there 
isan matrixsuchthatA~'d= 0 (the __p__miat- 


rix). In this case the solution to Au=disu=_ 4 


The rows of A~* are the cross products bxe, 
s__, divided by D. The entries of A~' are 2 by 2_t 


divided by D. The upper left entry equals ou. The 2 by 2 
determinants needed for a row of A~! do not use the corre- 


sponding _y¥__ of A. 


The solution is u= A” ‘d. Its first component x is a ratio 
of determinants, |dbe| divided by _w . Cramer’s Rule 
breaks down when det A=__x__. Then the columns a, b,c 
lte in the same __y__. There is no solution to xa + pb + ze = 
d, if dis not on that 2. In a singular row picture, the 
intersection of planes 1 and 2is A ___ to the third plane. 


In practice u is computed by _ 8. The algorithm starts 
by subtracting a multiple of row 1 to eliminate x from __¢ 
If the first two equations are x -—y=1 and 3x+2z=7, this 
elimination step leaves __® __. Similarly x is eliminated from 
the third equation, and then _ © _ is elimtnated. The equ- 
ations are solved by back _F__. When the system has no 
solution, we reach an impossible equation like _@__. The 
example x — » = i, 3x +z =7 has no solution if the third equ- 
ation is __H 


Rewrite 1-4 as matrix equations Au = d (do not solve). 


1 d=(G, 0, 8)isacombination ofa = (1, 2, 0)and b = (2, 3, 2) 
and ec = (2, 5, 2). 


2 The planes x + y=0,x+ y+z=1,and y+ 2 =O intersect 
at u=(x, y, 2). 


3 The point u=(x,»,2) is on the planes x=y, y=z, 
x-z=l. 


4 A combination of a=(!, 0,0) and b=(0, 2,0) and c= 
(0, 0, 3) equals d =(5, 2, 0). 


§ Show that Problem 3 has no solution in two ways: find 
the determinant of A, and combine the equations to produce 
0=1. 


6 Solve Problem 2 in two ways: by inspiration and Cramer's 
Rule. 


7 Solve Problem 4 in two ways: by inspection and by com- 
puting the determinant and inverse of the diagonal matrix 


1 0 0 
A=|0 2 OQ}. 
0 0 3 


8 Solve the three equations of Problem | by elimination. 


9 The vectors b and ¢ lie in a plane which is perpendicular 
to the vector . In case the vector a also lies in that 
plane, it is also perpendicular and a> =0. The 

of the matrix with columns in a plane is 


10 The plane a,x+b6,y+¢,z=4, is perpendicular to tts 
normal vector N, = . The plane a,x +5,y +¢,2= 
d, is perpendicular to Nj = . The planes meet in a 
line that ts perpendicular to both vectors, so the line is parallel 
to their product. If this line is also parallel to the 
third plane and perpendicular to N;, the system is : 
The matrix has no , which happens when 
(N, x N3)}-N,=0. 


Problems 11-24 use the matrices A, B, C. 


1 4 0 00 tf {1 -t — 
A=/0 2 6) B= 1 0) C=|-1 2 0 
0 0 3 6 4 0 ed | 3 
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11 Find the determinants | Al], |B], [C]. Since A is triangular, 
its determinant is the product 


12 Compute the cross products of each pair of columns in B 
{three cross products). 


13 Compute the inverses of A and B above. Check that 
A-'A=ifand BU'B=1. 


1 1 
14 Solve Au=| 0] and 8u =| 0 |. With this right side d, why 
0 0 


is u the first column of the inverse? 


15 Suppose all three columns of a matrix add to zero, as im 
C above. The dot product of each column with v=(1, 1, 1) is 

. All three columns lie in the same . The 
determinant of C must be 


16 Find a nonzcro solution to Cu=9. Find all solutions to 
cu=0. 


17 Choose any right side d that is perpendicular to y= 
(1, 1, 1}and solve Cu=d. Then find a second solution. 


18 Choose any right side d that is not perpendicular to ¥ = 
(1, 1, 1). Show by elimination (reach an impossible equation) 
that Cu=d has no solution. 


19 Compute the matrix product 4B and then its determinant. 
How is det AB related to det A and det B? 


20 Compute the matrix products BC and CB. All columns of 
CB add to , and its determinant is 


21 Add A and C by adding each entry of A to the corre- 
sponding entry of C. Check whether the determinant of 4 + C 
equals det A + det C. 


22 Compute 24 by multiplying each entry of A by 2. The 
determinant of 24 equals times the determinant of 
A. 


23. Which four entries of A give the upper left corner entry p 
of A7~', after dividing by D =det A? Which four entries of A 
give the entry ¢ in row 1, column 2 of A~ 19 Find p and q. 


24 The 2 by 2 determinants from the first two rows of B are 
—1 (from columns 2, 3) and —2 (from columns !, 3) and 

(from columns !, 2). These numbers go into the 
third of B~', after dividing by and chang- 
ing the sign of 


25 Why does cvery inverse matrix A~' have un inverse? 


26 From the multiplication ABB™'A™' = J it follows that 
the inverse of AB is . The separate inverses come in 

order. If you put on socks and then shoes, the 
inverse begins by taking off 


27 Find the determinants of these four permutation matrices: 


0 1 0 001 010 
P=|1 0 oO] g=/0 1 O| PQ=|0o 0 1 
001 10 0 100 


and QP = . Multiply u={x, y, z) by cach permuta- 
tion to find Pu, Qu, PQu, and QPu. 


28 Find all six of the 3 by 3 permutation matrices {including 
I), with a single | in each row and column. Which of them 
are “even” (determinant 1} and which are ‘‘odd” (determinant 
—1)? 


29 How many 2 by 2 permutation matrices are there, includ- 
ing 7? How many 4 by 4? 


30 Multiply any matrix A by the permutation matrix P and 
explain how PA is related to A. In the opposite order explain 
how AP is related to A. 


31 Eliminate x from the last two equations by subtracting 
the first equation. Then eliminate from the new third equa- 
tion by using the new second equation: 


x+ p+ 2=2 x+y =| 
(a) x+3y+3z=0 {b) x+ s=3 
x4+3y4+7z=2 y+z=5 


After elimination solve for 2, y, x (back substitution). 


32 By elimination and back substitution solve 


x+2y+22=0 x-y = 
(a) 2x+3;¢+5z=0 {b) x —z=4 
2y4+22=8 yrz=7. 


33 Ehtminate x from equation 2 by using equation 1: 
x+2y+2:=0 
2x + 4y + 5z=0 
2¥4+22=8. 
Why can’t the new second equation eliminate y from the third 
equation? Is there a solution or is the system singular? 


Note: If elimination creates a zero in the “pivot position,” 
Iry to exchange that ptvot equation with an equation below 
it. Elimination succeeds when there is a full set of pivots. 


34 The pivots in Problem 32a are 1, —1, and 4. Circle those 
as they appear along the diagonal in elimination. Check that 
the product of the pivots equals the determinant. (This is how 
determinants are computed.) 


35 Find the pivots and determinants in Problem 31. 
Eee ee 


36 Find the inverse of 4=|0 1 1 | and also of B= A?. 
00 ft 
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37 The symbol a,, stands for the entry in row i, column j, 39 Compute these determinants. The 2 by 2 matrix is invert- 
Find a,2 and a2, in Problem 36. The formula La,,5, gives ible if . The 3 by 3 matrix (is)(is not) invertible. 

the entry in which row and column of the matnx product 
AB? 


38 Write down a 3 by 3 singular matrix $ in which no two 
rows are parallel. Find a combination of rows 1 and 2 that is 
parallel to row 3. Find a combination of columns [ and 2 that 
is parallel to column 3, Find a nonzero solution to Su=0. 
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CHAPTER 42 


Motion Along a Curve 


Eee = 12.1 The Position Vector [Ey 


This chapter is about “vector functions.” The vector 2i+4j+ 8k is constant. The 
vector R(t) = ti + ¢?j + t*k is moving. It is a function of the parameter t, which often 
represents time. At each time ¢, the position vector R(t) locates the moving body: 


position vector = R(t) = x(t)i+ y(Oj+ z(Ok. ql) 


Our example has x =t, y=t?, z= 1°, As ¢ varies, these points trace out a curve in 
space. The parameter ¢ tells when the body passes each point on the curve. The 
constant vector 21+ 4j + 8k is the position vector R(2) at the instant t = 2. 


What are the questions to be asked? Every student of calculus knows the first 
question: Find the derivative. If something moves, the Navy salutes it and we differen- 
tiate it. At each instant, the body moving along the curve has a speed and a direction. 
This information is contained in another vector function—the velocity vector v(t) 
which is the derivative of R(t): 
dR dx, a dz 
W)= = Sit Fit Tk (2) 
Since i, j, k are fixed vectors, their derivatives are zero, In polar coordinates i and j 
are replaced by moving vectors. Then the velocity y has more terms from the product 
rule (Section 12.4). : 

Two important cases are uniform motion along a line and around a circle. We study 
those motions in detail (v= constant on line, ¥= tangent to circle). This section also 
finds the speed and distance and acceleration for any motion R({¢). 

Equation (2) is the computing rule.for the velocity dR/dt. It is not the definition of 
dR/dt, which goes back to basics and does not depend on coordinates: 


aR. =O ARS i, R(t + At) — RG) 
any ae aoe At 
We repeat: R is a vector so AR is a vector so dR/dt is a vector. All three vectors are 


in Figure 12.1 (t is not a vector!). This figure reveals the key fact about the geometry: 
446 The velocity v= dR/dt is tangent to the curve. 


12.1 The Position Vector 


The vector AR goes from one point on the curve to a nearby point. Dividing by 
At changes its length, not its direction. That direction lines up with the tangent to 
the curve, as the points come closer. 


EXAMPLE’ R()=tit+ 77 +k v(t)=i+ 2] + 327k 


This curve swings upward as t increases. When ¢ = 0 the velocity is y =i. The tangent 
is along the x axis, since the j and k components are zero. When t = | the velocity is 
i+ 2j+ 3k, and the curve is climbing. 

For the shadow on the xy plane, drop the k component. Posilion on the shadow 
is ti+ £7j. Velocity along the shadow is i+ 2tj. The sbadow is a plane curve. 


v¥sitj+2k 


x=f,y=2+t,z=1+2t 


x 
x=y-2= a 
Fig. 12.4 Position vector R, change AR, Ag. 12.2 Equations of a line, with and 
velocity dR/dt. without the parameter 7. 


EXAMPLE 2 Uniform motion in a straight line: the velocity vector y is constant. 
The speed and direction don’t change. The position vector moves with dR/dt =v: 
R(t}=Ro+tv (Rog fixed, v fixed, ¢ varying) (3) 


That is the equation of a line in vector form. Certainly dR/dt = v. The starting point 
Ro = Xoi + Yoj + Zok is given. The velocity v= v, i+ v,j + v3k is also given. Separating 
the x, y and z components, equation (3) for a line is 


fine with parameter: x=Xo+t¥;, Y=Vottvz, Z=Zo + lds. (4) 


The speed along the line is |v| = ./v? + »3 + v3. The direction of the line is the unit 
vector v/|v¥j. We have three equations for x, y, z, and eliminating t leaves two equations. 
The parameter ¢ equals (x — x9)/v, from equation (4). It also equals {y — yo)/v2 and 
(2 — Zo)/v3. So these ratios equal each other, and ¢ is gone: 


line without parameter: 


=2— 770. (5) 


An example is x = y/2 = z/3. In this case (x9, Yo, 29) = (0, 0, 0)the line goes through 
the origin. Another point on the line is (x, y, z) = (2, 4, 6). Because t is gone, we cannot 
say when we reach that point and how fast we are going. The equations x/4 = y/8 = 
z/12 give the same line. Without t we can't know the velocity y = dR/dt. 


EXAMPLE 3 Find an equation for the line through P = (0, 2, 1) and @ =(I, 3, 3). 


Solution We have choices! Ry can go to any point on the line. The velocity v can 
be any multiple of the vector from P to Q. The decision on Ry controls where we 
start, and vy controls our speed. 

The vector from P to Q is i+j+ 2k. Those numbers |, 1, 2 come from subtracting 
0, 2, 1 from 1, 3,3. We choose this vector i+j+ 2k as a first y, and double it for a 
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second v. We choose the vector Ry = P as a first start and Ry = Q as a second stuart. 
Here are two different expressions for the same Itne—they are P + tv and Q + i(2y): 


R(t) = (2j + k) + (i +j + 2k) R*(t) =(i + 3j + 3k) + (21 + 2j + 4k). 


The vector R(t) gives x =t, p= 24+1t,2= 1+ 2t. The vector R* is at a diflerent point 
on the same line at the same time: x* = 1 + 21, y* =342t, 2*=34 41. 

If I pick t= 1 in R and ¢ = 0 in R*, the point is (1, 3, 3). We arrive there at different 
times. You are seeing how parameters work, to tell “where” and also “when.” If ¢ 
goes from —x to +, all points on one line are also on the other line. The path ts 
the same, but the “twins” are going at different speeds. 


Question 14. When do these twins meet? When does R(t) = R*{t)? 
Answer They meet at f= — 1, when R= R* = —i+j—k. 


Question 2. What is an equation for the segment between P and Q (not beyond)? 
Answer In the equation for R(0), let 2 go from 0 to | (not beyond): 


x=t yp=24+r z=14+22 {[0<t<1 for segment]. (6) 
At t=0 we start from P =(0, 2, 1), Att=1 we reach Q =(1. 3, 3). 


Question 3. What is an equation for the line without the parameter £? 
Answer Solve equations (6) for ¢ or use (5): x/1 = (y — 2)/1 =(z — 1/2. 


Question 4. Which point on the line - eloeest to the origin? 
Answer The derivative of x? + y* + 27 =27+(2 48)? +{1 + 22)? is 8 + 82. This deriv- 
ative is zero at t= — 1, So the closest point is (—1, 1, —1). 


Question 5 Where does the line meet the plane x + y+2= 11? 
Answer Equation (6) gives x + y+2=344t=11. So7=2. The meeting point is 
xX=f=2,y=t+2=4,2=142t=5. 


Question 6 What line goes through (3, 1, 1) perpendicular to the plane x — ; — 2 = 1? 
Answer The normal vector to the plane is N=i—j—k. Thar is vy. The position 
vector to (3,1, 1) is Ro = 3i+j +k. Then R= Ry +¢v. 


COMPARING LINES AND PLANES 


A line has one parameter or two equations. We give the starting point and velocity: 
{x, 5 2) = (Xo, ¥o. Zo) + (by. #2,U3). That tells direetly which points are on the line. 
Or we eliminate : to find the two equations in (5). 

A plane has one equation or two parameters! The equation is ax + by +cer=d. 
That tells us indirectly which points are on the plane. (Instead of knowing x, 3, 2, we 
know the equation they satisfy. Instead of directions y and w in the plane, we are 
told the perpendicular direetion N =(a, b, c).) With parameters, the line contains 
R, +1¥ and the plane contains Ry + tv ~ sw. A plane looks worse with parameters 
(t and s), a line looks better. 

Questions 5 and 6 connected lines to planes. Here ure two more. See Problems 
41. 44: 


Question 7 When is the line Ry + ¢v parallel to the plane? When is it perpendicutar? 
Answer The test is ¥*° N =0. The test is ¥x N=0. 


EXAMPLE 4 Find the plane containing Py=(1,2,1) and the line of points 
(1, 0, 0) + (2,0, —1}. That vector ¥ will be in the plane. 
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Solution The vector v = 2i—k goes along the line. The vector w = 2j + k goes from 
(1, 0, 0) to (1, 2, 1}. Their cross product is 


ij k 
N=yvxw=|2 0 -1]=2i-2j+4k. 
0 2 1 


The plane 2x — 2y + 4z =2 has this normal N and contains the point (1, 2, 1). 


SPEED, DIRECTION, DISTANCE, ACCELERATION 


We go back to the curve traced out by R(t). The derivative ¥(‘) = dR/dt is the velocity 
vector along that curve. The speed is the magnitude of v: 


speed = |¥| = ./(dx/dt)? + (dy/dt)? + (dz/at)*. (7) 
The direction of the velocity vector is ¥/¥|. This is a unit vector, since ¥ is divided by 
its length. The unit tangent vector v/|v| is denoted by T. 

The tangent vector is constant for lines. It changes direction for curves. 
EXAMPLE 5 (important) Find v and [yj and T for steady motion around a circle: 
x=rcosa@f, y=rsinawt, z=. 

Solution The position vector is R =r cos wti+r sin wt j. The velocity is 
v=dR/dt = —wr sin wti+ wr cos wtj (tangent, not unit tangent) 
The speed is the radius r times the angular velocity «: 
lv = /(—or sin wt)? + (wr cos wt)? = wr. 
The unit tangent vector is v divided by |v: 
T= —sin wtitcos wt j (length 1 since sin?wt + cos*wt = 1). 


Think next about the distance traveled. Distance along a curve is always denoted 
by s (called arc length). 1 don’t know why we use s—certainly not as the initial for 
speed. In fact speed is distance divided by time. The ratio s/t gives average spced; 
ds/dt is instantaneous speed, We are back to Chapter | and Section 8.3, the relation 
of speed to distanec: 


speed |v|=ds/dt distance s = f (ds/dt) dt = j |¥(2}| de. 


Notice that |¥| and s and ¢ are scalars. The direction vector is T: 


vy dR/dt dR F 
T= iv = aside ar unit tangent vector. {8) 


In Figure 12.3, the chord length (straight) is |AR|. The arc length (curved) is As. As 
AR and As approach zero, the ratio |AR/As| approaches |[T| = 1. 


Think finally about the acceleration vector a(t). It is the rate of change of velocity 
(not the rate of change of speed): 
dv d*R d*x, d*y, dz 


= et ae ae ee 0) 
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Wl = wr A (-1,0, ®) 
T=+ “7 2t 

Wl ( ivi = V2 
In. f 
I ~~ 

§= Wi { 

wn Vx =r cos Wt 1 

y=rsinor Se x=cost,y=sint,z=f 

I 


o* R 
(1, 0,0) Shadow 


Fig. 12.3. Steady motion around a circle, Half turn up a helix. 


For steady motion along a line, as inx =¢, p=2+£, z= 1+ 21, there is no accelera- 
tion, The second derivatives are all zero. For steady motion around a circle, there is 
acceleration, In driving a car, you accelerate with the gas pedal or the brake. You 
also accelerate by turning the wheel. It is the velocity vector that changes, net the 
speed. 


EXAMPLE 6 Find the distance s(t) and acceleration a(t) for circular motion. 


Solution The speed in Example 5 is ds/dt = wr. After integrating, the distance is s = 
wort. At time f we have gone through an angle of wt. The radius is r, so the distance 
traveled agrees with wt times r. Note that the dimension of o is 1/time. (Angles are 
dimensionless.) At time t = 22/@ we have gone once around the circle—to s=2ar 
not back to s=0. 

The acceleration is a = d*R/dt?. Remember R =r cos wti+rsin wt]: 


a(t) = —w?r cos wt i— wr sin wt j. (10) 


That direction is opposite to R. This is a special motion, with no action on the gas 
pedal or the brake. All the acceleration is from turning. The magnitude is |al = ?r, 
with the correct dimension of distance/(time}’. 


EXAMPLE 7 Find vy and s and a around the helix R=cosri+sintj+tk. 
Solution The velocity is ¥= —sin ti+costj+k. The speed is 


ds/dt =|y| = ./sin2t + cos*t + 1 = ,/2 (constant). 


Then distance is s=./2¢. At time tf =7, a half turn is complete. The distance along 
the shadow is a (a half circle). The distance along the helix is oe x, because of its 
45° slope. 

The unit tangent vector is velocity/speed, and the acceleration is dv/dt: 


T=(sin tit+cos tj+k)/,/2 = —costi-sin tj. 


EXAMPLE 8 Find v and s and a around the ellipse x =cos 1, y=2 sin t, z=0. 


Solution Take derivatives: ¥= —sin ti+2cos tj and |v| = ./sin?t + 4 cos*t. This is 
the speed ds/dt. For the distance s, something bad happens {or something normal). 
The speed is not simplified by sin? + cos?t = 1. We cannot integrate ds/dt to find a 
formula for s. The square root defeats us. 

The acceleration — cos ti— 2 sin tj still points to the center, This is not the Earth 
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going around the sun. The path is an ellipse but the speed is wrong. See Section 12.4 
(the pound note) for a terrible error in the position of the sun. 


42A_ The basic formulas for motion along a curve are 


Suppose we know the acceleration a(t) and the initial velocity vo and position Ro. 
Then v(t) and R(t) are also known. We integrate each component: 


a(t) = constant => v(t) = Vo + at => R(t) = Ro + Vot + fat? 


a(t)=costk =v(t)=Vo+sin t k => R(t)=Ro + Vot —cos t k. 
THE CURVE OF A BASEBALL 


There is a nice discussion of curve balls in the calculus book by Edwards and Penney. 
We summarize it here (optionally). The ball leaves the pitcher’s hand five feet off the 
ground: Ry = 0i + 0j + 5k. The initial velocity is vp = 120i — 2j + 2k (120 ft/sec is more 
than 80 miles per hour). The acceleration is — 32k from gravity, plus a new term from 
spin. If the spin is around the z axis, and the ball goes along the x axis, then this 
acceleration is in the y direction. (It comes from the cross product k x i—there is a 
pressure difference on the sides of the ball.) A good pitcher can achieve a = 16j — 32k. 
The batter integrates as fast as he can: 


v(t) = Vo + at = 1201 + (—2 + 162)j + (2 — 320)k 
R(t) = Ro + Vot + fat? = 120t i+ (—2t + 8?)j + (5 + 2t — 1607)k. 


Notice the t?. The effect of spin is small at first, then suddenly bigger (as every batter 
knows). So is the effect of gravity—the ball starts to dive. At t=4, the i component 
is 60 feet and the ball reaches the batter. The j component is | foot and the k 
component is 2 feet—the curve goes low over the outside corner. 

At t= 4, when the batter saw the ball halfway, the j component was zero. It looked 
as if it was coming right over the plate. 


Fig. 42.4 A curve ball approaches home plate. Halfway it is on line. 


42.4 EXERCISES 


Read-through questions where s measures the _g _. Then s=f__h _. The tangent 
ie : vector is in th irection as th i, but Ti i 
The position vector __a__ along the curve changes with the wee e nanae ines vs ell 
caret i ie vector. In general T=__k ___ and in the example T=__1 
parameter t. The velocity is__b _. The acceleration is __c 
If the position is i+ tj+7k, thenv=_d  anda=_e_. Steady motion along a line has a=__m_. If the line is x = 


In that example the speed is |v| = __ft _. This equals ds/dt, y=z, the unit tangent vector is T=__n__. If the speed is 
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\y= a the velocity vector is v=o __. If the initial posi- 
tion is (1, 0, 0), the position vector is R(t}=__p _. The general 
equation of a line is x=xp+0,, Y=_Q_,z=_!t_.In 
vector notation this is R(t}=__s _ . Eliminating ¢ leaves the 
equations (x — x)/v, =(¥ — yo)/v2 =__t__. A line in space 
needs 4 __ equations where a plane needs__v_. A line has 


one parameter where a plane has _w__. The line from Ry = 
(1, 0, 0) to (2, 2, 2} with fy] =3 is R(f}=_ x 


Steady motion around a circle (radius r, angular velocity 
wyhasx=_y .p=_ 2% ,z=0.Thevelocityisy=_ A_. 
The speed is |vy|=__ 8 _. Theaccelerationisa= ¢__, which 


has magnitude _O0 and direction _§& . Combining 


upward motion R= tk with this circular motion produces 
motion arounda__F _ .Theny=_G and |yj=__H_. 


1 Sketch the curve with parametric equations x =2, y=t?. 
Find the velocity vector and the speed at t= 1. 


2 Sketch the path with parametric equations x =1+1, p= 
1 ~¢, Find the xy equation of the path and the speed along it. 


3 On the circle x = cos ¢, y= sin ¢ explain by the chain rule 
and then by geometry why dy/dx = —cot t. 


4 Locate the highest point on the curve x = 6t, y = 6¢—27. 
This curve is a . What is the acceleration a? 


§ Find the velocity vector and the xy equation of the tangent 
ine to x=e', p=e ‘at t=0. What is the xy equation of the 
curve? 


6 Describe the shapes of these curves: (a) x =2', y=4'; (b) 
x=4' yp=8s (ce) x=4, y=. 
Note: To find “parametric equations” is to find x(t), p(t), and 
possibly 2(t}. 


7 Find parametric equations for the line through P= 
(1, 2,4) and @ =(5, 5, 4}. Probably your speed is 5; change the 
equations so the speed is 10. Probably your Rg is P; change 
the start to Q. 


8 Find an equation for any one plane that is perpendicular 
to the line in Problem 7. Also find equations for any one line 
that is perpendicular. 


9 On a straight line from (2, 3, 4) with velocity vy =i— k, the 
position vector is R(t) = . If the velocity vector is 
changed to ti—tk, then R(t}=__ Ss The path is still 


10 Find parametric equations for steady motion from P= 

(3,1, —2} at f=0 on a line to Q =(0, 0, 0) at t= 3, What is 

the speed? Change parameters so the speed is e'. 

11 The equations x—1=y—2)=}{z—2) describe a 
. The same path is given parametrically by x = 1 +2, 

yo ,2= . The same path is also given by 

x=1]4+2t,y= ,z= 


12 Find parametric equations to go around the unit circle 


with speed e' starting from x=1, y=0. When is the circle 
completed? 


13 The path x=2y=3z=66 is a traveled with 
speed . If ¢ is restricted by ¢ 21 the path starts at 
. If t is restricted by O<¢ <1 the pathis a 


14 Find the closest point to the origin on the line x =! +2, 
y =2-—t. When and where does it cross the 45° line through 
the origin? Find the equation of a line it never crosses. 


15 (a) How far apart are the two paralle} lines x = y and 
x = y+ 1? (b) How far is the potnt x =t, y=¢ from the point 
x=t, y=t+ 1? (c) What is the closest distance if their speeds 
are different: x =t, y=t and x= 2t, y=2t+1? 


16 Which vectors follow the same path as R = ti+ t7j? The 
speed along the path may be different. 


(a) 281+ 27f (b) Wei + 4e2j (c) — th c7j (dp 74 2} 
17 Find a parametric form for the straight line y = mx + B. 


18 The line x =1+0,¢, y= 2+ vt passes through the ongin 
provided vy + p, =0. This line crosses the 
45° line y = x unless by + pz =0. 


19 Find the velocity v and speed |v} and tangent vector T 
for these motions: {a)R = ti+t~'j (b)R=tcosti+¢ sinej 
(co) R =(t + Wit (22+ D§+2r4 Dk. 


20 If the velocity dx/dt i+ dy/dtj is always perpendicular to 
the position vector xi+ yj, show from their dot product that 
x? + y? is constant. The point stays on a circle. 


21 Find two paths R(t) with the same v = cos ti + sin tj. Find 
a third path with a different ¥ but the same acceleration. 


22 Jf the acceleration is a constant vector, the path must be 
. If the path is a straight line, the acceleration vector 
must be 


23 Find the minimum and maximum speed if x =# + cos ¢, 
y=zr—sin t, Show that |a| is constant but not a. The point is 
going around a circle while the center is moving on what line? 


24 Find x(t), p(t) so that the point goes around the circle 
(x — 1) + (y —3) =4 with speed 1, 


25 A ball that is circling with x = cos 22, y = sin 2¢ flies off on 
a tangent at t = 7/8. Find its departure point and its position at 
a later time ¢ {linear motion; compute its constant velocity ¥). 


26 Why is jal generally different from d*s/dt*? Give an 
example of the difference, and an example where they are 
equal. 


27 Change ¢ so that the speed along the helix R= 
costi+sintij+ik is | imstead of fz: Call the new 
Parameter s. 


28 Find the speed ds/dt on the line x= 1+ 61, y=2+ 3t, 
z=2t. Integrate to find the length s from (1,2,0) to 
(13, 8, 4). Check by using 12? + 6? + 4?. 


12.2 Plane Motion: Projectiles and Cyclolds 


29 Find v and |¥| and a for the curve x = tan t, y = sec t. What 
is this curve? At what time does it go to infinity, and along 
what line? 


30 Construct parametric equations for travel on a helix with 
speed r. 


31 Suppose the unit tangent vector T(t) is the derivative of 
R(t). What does that say about the speed? Give a noncircular 
example. 


32 For travel on the path y=/(x), with no parameter, it is 
impossible to find the but still possible to find the 
at each point of the path. 


Find x(z) and y(t) for paths 33-36. 


33 Around the square bounded by x =0, x=1, y=0, y= 1, 
with speed 2. The formuias have four parts. 


3M Around the unit circle with speed e~'. Do you get all the 
way around? 


38 Around a circle of radius 4 with acceleration Jal = 1. 
36 Up and down the y axis with constant acceleration —j, 
returning to (0, 0) at f= 10. 
37 True (with reason) or false (with example): 
(a) If (R} = t for all ¢ then jy! = constant. 
(b} If a = 0 then R = constant. 
(c) If ¥-¥ = constant then ¥-a = 0. 
(d} If ¥-R =0 then R-R =constant. 
(e) There is no path with v=a. 


38 Find the position vector to the shadow of ti+ t7j + °k on 
the xz plane. Is the curve ever parallel to the line x = y = z? 


39 On the ellipse x = acos t, y=5 sin t, the angle @ from the 
center ts not the same as f because 
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40 Two particles are racing from (1, 0) to (0, 1). One follows 
x=cost, y=sin¢, the other follows x=1+0,t, p= vl. 
Choose v, and v, so that the second particle goes slower but 
wins. 


41 Two lines in space are given by R(t}=P-+ ty and R(t)= 
Q + tw. Four possibilities: The lines are parallel or the same 
or intersecting or skew. Decide which is which based on the 
vectors ¥ and w and uw = Q — P (which goes between the lines): 

{a} The lines are parallel if are parallel. 

(b) The lines are the same if are parallel, 

(c} The lines intersect if are not parallel but 

lie in the same plane. 
{d) The lines are skew if the triple product u:(v x w) is 


42 If the lines are skew (not in the same plane), find a formula 
based on u, v, w for the distance between them. The vector u 
may not be perpendicular to the two lines, so project it onto 
a vector that is. 


43 The distance from Q to the line P + tv is the projection of 
u=Q-—P perpendicular to v. How far is Q =(9, 4, 5) from 
the line x=1+7, y=1 4+ 2t, z=3 + 21? 

44 Solve Problem 43 by calculus: substitute for x, y,z in 

(x — 9)? + (y — 4)? +(z — 5)? and minimize. Which (x, y, z) on 

the line is closest to (9, 4, 5)? 

45 Practice with parameters, starting from x = F(t}, y = G(t). 
(a} The mirror tmage across the 45° line is x = 
f= 

(b) Write the curve x =¢?, y=t? as y=f(x). 

(c}) Why can’t x=t?, y=¢? be written as y=/(x)? 
(d) If F is invertible then t = F ~'(x) and y= (x). 
46 From 12:00 to 1:00 a snail crawls steadily out the minute 


hand {one meter in one hour). Find its position at time ¢ 
starting from (0, 0). 


> 


The previous section started with R(t). From this position vector we computed v and 
a. Now we find R(z} itself, from more basic information. The laws of physics govern 
projectiles, and the motion of a wheel produces a cycloid (which enters problems in 
robotics). The projectiles fy without friction, so the only force is gravity. 

These motions occur in a plane. The two components of position will be x (across) 
and y (up). A projectile moves as ¢ changes, so we look for x(t) and y(t). We are 
shooting a basketball or firing a gun or peacefully watering the lawn, and we have 
to aim in the right direction (not directly at the target). If the hose delivers water at 
10 meters/second, can you reach the car 12 meters away? 
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range R = (up sin 2a)/g 


height = (up sin a) 29 
time T = (2u) sin a)/¢ 
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The usual initial position is (0,0). Some flights start higher, at (0, 4). The initial 
velocity is (pg COS @, vo sina), where vg is the speed and «@ is the angle with the 
horizontal. The acceleration from gravity is purely vertical: d*y/dt? = —g. So the 
horizontal velocity stays at its initial value. The upward velocity decreases by —gt: 


dx/dt =v, cos a, dy/dt = pp sin a— gt. 


The horizontal distance x(t} is steadily increasing. The height y(t) increases and then 
decreases, To find the position, integrate the velocities (for a high start add h to y): 


The projectile path is x{t) = (vg cos 2)t, W(t) = (vp sin a)t — $gt?. (1) 


This path is a parabola, But it is not written as y= ax? + bx +c. It could be, if we 
eliminated t. Then we would lose track of time. The parabola is yx), with no param- 
eter, where we have x(t) and y(t). 


Basic question: Where does the projectile hit the ground? For the parabola, we solve 
yx) = 0. That gives the position x. For the projectile we solve y(t}=0, That gives the 
time it hits the ground, not the place. If that time is T, then x(T} gives the place. 

The information is there. It takes two steps instead of one, but we learn more. 


EXAMPLE 4 Water leaves the hose at 10 meters/second (this is va). It starts up at the 
angle «. Find the time T when y is zero again, and find where the projectile lands. 


Solution The flight ends when y =(10 sin a)T —4gT? =0. The flight time is T= 
(20 sin a)/g. At that time, the horizontal distance is 


x(T) = {£0 cos a}T = {200 cos @ sin «)/g. This is the range R. 


The projectile (or water from the hose) hits the ground at x = R, To simplify, replace 
200 cos a sin a by 100 sin 2a. Since g = 9.8 meters/sec”, we can’t reach the car. 


The range R ={100 sin 2a}/9.8 is at most 100/9.8. This is less than 12. 


The range is greatest when sin 2a = | (a is 45°). To reach 12 meters we could stand 
on a ladder (Problem 14}. To hit a baseball against air resistance, the best angle is 
nearer to 35°. Figure 12.5 shows symmetric parabolas (no air resistance) and unsym- 
metric flight paths that drop more steeply. 


42B The flight time T and the horizontal range R = x{T) are reached when 
y =0, which means (vp sin a)T = 4¢T?: 


T = (2b, sin a)/g and R =(vp cos «)T =(v% sin 2a)/g. 


HEIGHT IN FEET 


R 


DISTANCE IN FEET 


Fig. 12.5 Equal range R, different times T. Baseballs hit at 35° with increasing v9. The dots 
are at half-seconds (from The Physics of Baseball by Robert Adatr: Harper and Row 1990}. 


42.2 Plane Motion: Projectiles and Cyclokds 


EXAMPLE 2 What are the correct angles a for a given range R and given vo? 


Solution The range is R =(v3 sin 2a)/g. This determines the sine of 2«—dut two 
angles can have the same sine. We might find 22 = 60° or 120°, The starting angles 
a= 30° and = 60° in Figure 12.5 give the same sin 2a and the same range R. The 
flight times contain sin a and are different. 

By calculus, the maximum height occurs when dy/dt = 0. Then vg sin ¢ = gt, which 
means that t = (vg sin &)/g. This is half of the total flight time T—the time going up 
equals the time coming down. The value of y at this halfway time t = $T is 


Ymax = (Vo SiN a)(vq sin a)/g — Fg(vo sin a/g)” = (vo sin a)*/2g. (2) 


EXAMPLE 3 If a ski jumper goes 90 meters down a 30° slope, after taking off at 28 
meters/second, find equations for the flight time and the ramp angle a. 


Solution The jumper lands at the point x = 90 cos 30°, y = — 90 sin 30° (minus sign 
for obvious reasons). The basic equation (2) is x = (28 cos a)t, y = (28 sin a)t — $gt?. 
Those are two equations for a and ¢. Note that ¢ is not T, the flight time to y= 0. 


Conclusion The position of a projectile involves three parameters vp, a, and t. Three 
pieces of information determine the flight (almost). The reason for the word almost is 
the presence of sin a and cos a. Some flight requirements cannot be met (reaching a 
car at 12 meters). Other requirements can be met in two ways (when the car is close). 
The equation sin « = c is more likely to have no solution or two solutions than exactly 
one solution. 

Watch for the three pieces of information in each problem. When a football starts 
at v9 = 20 meters/second and hits the ground at x =40 meters, the third fact is 
. This is like a lawyer who is asked the fee and says $1000 for three questions. 
“Isn't that steep?” says the client. “Yes,” says the lawyer, “now what’s your last 
question?” 


CYCLOIDS 


A projectile’s path is a parabola. To compute it, eliminate t from the equations for x 
and y. Problem 5 finds y = ax? + bx, a parabola through the origin. The path of a 
point on a wheel seems equally simple, but eliminating ¢ is virtually impossible. The 
cycloid is a curve that really needs and uses a parameter. 

To trace out a cycloid, rofl @ circle of radius a along the x axis. Watch the point 
that starts at the bottom of the circle. It comes back to the bottom at x = 2za, after 
a complete turn of the circle. The path in between is shown in Figure 12.6. After a 
century of looking for the xy equation, a series of great scientists (Galileo, Christopher 
Wren, Huygens, Bernoulli, even Newton and l’H6pital) found the right way to study 
a cycloid—by introducing a parameter. We will call it @; it could also be ¢. 
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Aig. 12.6 Path of P on a rolling circte is a cycloid. Fastest slide to Q, 
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The parameter is the angle @ through which the circle turns. (This angle is not at 
the origin, ike @ in polar coordinates.) The circle rolls a distance a@, radius times 
angle, along the x axis. So the center of the circle is at x = a8, y= a. To account for 
the segment CP, subtract a sin 6 from x and acos @ from y: 


The point P has x = a(@ — sin @) and y = a(l —cos 6). (3) 


At 6 =0 the position is (0, 0). At 6 = 2x the position is (2xa, 0). In between, the slope 
of the cycloid comes from the chain rule: 


ad dy/d@ asin 8 
dy _dyid@ __asind (4) 


dx dx/d@  a{l—cos #9) 
This is infinite at @= 0. The point on the circle starts straight upward and the cycloid 
has a cusp. Note how all calculations use the parameter @. We go quickly: 


Question 4 Find the area under one arch of the cycloid (@ = 0 to @ = 22). 
Answer The area is | y dx = §3" a(1 — cos 8)a(1 — cos 6)d@. This equals 3na?. 


Question 2. Find the length of the arch, using ds = ./(dx/d@)* + (dy/d@)? dé. 
Answer fds =>" a /(1 — cos 0)” + (sin BY do = §3" a. /2 2 cos 6 dé. 
Now substitute 1 —cos @ = 2 sin? 49. The square root is 2 sin $9. The length is 8a. 


Question 3 If the cycloid is turned over (y is downward), find the time to slide to 
the bottom. The slider starts with o=0 at y=0. 

Answer Kinetic plus potential energy is 4mv* — mgy =0 {it starts from zero and 
can’t change). So the speed is vp = ./2gy. This is ds/dt and we know ds: 


sliding time = [a = | Sc [sae ca ie Mt ae n/a 
/ 2ey Nein 
Check dimensions: a = distance, g = distance/{time)*, x./a/g = time. That is the short- 
est sliding time for any curve. The cycloid solves the “brachistochrone problem,” 
which minimizes the time down curves from O to Q (Figure 12.6). You might think 
a straight path would be quicker—it is certainly shorter. A straight line has the 
equation x = y/2, so the sliding time is 


§ dt =f ds/ /2gy = [2 //2? + 1 dy), /2ey =./n? +4 /alg. (5) 
This is larger than the cycloid time x/alg. It is better to start out vertically and pick 
up speed early, even if the path is longer. 

Instead of publishing his solution, John Bernoulli turned this problem into an 
international challenge: Prove that the cycloid gives the fastest slide. Most mathemati- 
cians couldn’t do it. The problem reached Isaac Newton (this was later in his life). 
As you would expect, Newton solved it. For some reason he sent back his proof with 
no name. But when Bernoulli received the answer, he was not fooled for a moment: 
“T recognize the lion by his claws.” 

What is also amazing is a further property of the cycloid: The time to Q is the same 
if you begin anywhere along the path. Starting from rest at P instead of O, the bottom 
is reached at the same time. This time Bernoulli got carried away: “You will be 
petrified with astonishment when I say...”. 


There are other beautiful curves, closely related to the cycloid. For an epicycloid, 
the circle rolls around the outside of another circle. For a Aypocycioid, the rolling 
circle is inside the fixed circle. The astroid is the special case with radii in the ratio | 
to 4. It is the curved star in Problem 34, where x = acos*@ and y= a sin*#. 
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The cycloid even solves the old puzzle: What point moves backward when a train 
starts forward? The train wheels have a flange that extends below the track, and 


dx/dt <0 at the bottom of the flange. 


12.2 EXERCISES 


Read-through questions 


A projectile starts with speed vg and angle x. At time ¢ its 
velocity is dx/de=_ 9, dy/dt=__b ~~ {the downward 
acceleration ts g). Starting from (0,0), the position at time f 
isx=_¢ ,y=_4__. The flight time back to y=Qis T= 

@ . At that time the horizontal range is R=__¢ _. The 
flight path isa__9 


The three quantities t), 4%  , ! determine the pro- 
jectile’s motion. Knowing v, and the position of the target, 
we (can) (cannot) solve for «. Knowing « and the position of 
the target, we (can) (cannot) sotve for vp. 


A __|__ is traced out by a point on a rolling ctrcle. If the 
radius is a and the turning angle is @, the center of the circle 
isatx=_k_ , y=__!__. The point is atx=_mM_, y= 

n__, starting from (0,0). It travels a distance © ina 
full turn of the circle. The curve has a__P at the end of 
every turn. An upside-down cycloid gives the 4 ___ slide 
between two points, 


Problems 1-18 and 43 are about projectiles 


1 Find the time of flight T, the range R, and the maximum 
height Y of a projectile with vo = 16 ft/sec and 
(a) a = 30° (b} a= 60° (c) «=90°. 
2 If vg = 32 ft/sec and the projectile returns to the ground 
at T =1, find the angle a and the range R. 


3 A ball is thrown at 60° with vp = 20 meters/sec to clear a 
wall 2 meters high. How far away is the wall? 


4 If 40) = 314 3j find v(0), v(1), (2) and R(¥), R(1), RQ). 


§ (a) Eliminate ¢ from x =¢, y =-— $2? to find the xy equa- 
tion of the path. At what x is y =0? 
(b) Do the same for any vy and x, 


6 Find the angle x for a ball kicked at 30 meters/second if 
it clears 6 meters traveling horizontally. 


7 How far out does a stone hit the water A feet befow, start- 
ing with velocity co at angle « = 0? 


8 How far out does the same stone go, starting at angle 2? 
Find an equation for the angle that maximizes the range. 


9 A ball starting from (0,0) passes through (5, 2) after 2 
seconds. Find ty) and a. (The units are meters.) 


*10 With x and » from equation (1), show that 


v6 2 (gx/vo}? + 2gy. 


If a fire is at height H and the water velocity is vo, how far 
can the fireman put the hose back from the fire? (The parabola 
in this problem is the “envelope” enclosing all posstble paths,} 


11 Estimate the initial speed of a 100-meter golf shot hit at 
a = 45°. Is the true vg larger or smaller, when air friction is 
included? 


12 T =29(sin x)/g is in seconds and R =(v4 sin 2u)/g is in 
meters if vo and g are in 


13 (a) What is the greatest height a ball can be thrown? Aim 
straight up with vg = 28 meters/sec. 


14 if a baseball goes 100 miles per hour for 60 feet, how long 
does it take (in seconds) and how far does it fall from gravity 
(in feet)? Use 4g7?. 


15 If you double v9, what happens to the range and maxi- 
mum height? If you change the angle by da, what happens to 
those numbers? 


16 At what point on the path is the speed of the projectile 
(a) least (b) greatest? 


17 If the hose with vp = 10m/sec is at a 45° angle, x reaches 

12 meters when t = and y= . From a lad- 

der of height the water will reach the car (12 meters). 
‘ 


18 Describe the two trajectories a golf ball can take fo land 
right in the hole, if it starts with a large known velocity vg. 
In reality (with air resistance} which of those shots would fall 
closer? 


Problems 19-34 are about cycloids and related curves 


19 Find the unit tangent vector T to the cycloid. Also find 
the speed at @=0 and @=n, if the wheel turns at d@/dt = 1. 
20 The slope of the cycloid is infinite at @ = 0: 

dy _dyiad sin 0 


dx dxjd) 1—cos@ 


By whose rule? Estimate the slope at @=74 and 0= — 7. 
Where does the slope equal one? 


21 Show that the tangent to the cycloid at P in Figure 12.6a 
goes through x = a8, y = 2a. Where is this point on the rolling 
circle? 


22 For a trochoid, the point P is a distance d from the center 
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of the rolling circle. Redraw Figure 12.6b to find x= 
a? —dsin 6 and y= 


23 Ifa circle of radius a rolls inside a circle of radius 2a, show 
that one point on the small circle goes across on a straight 
line, 


24 Find d?y/dx? for the cycloid, which is concave 


28 If d6/dt =c, find the velocities dx/dt and dy/dt along the 
cycloid. Where is dx/dt greatest and where is dy/dt greatest? 


26 Experiment with graphs of x=acos@+bsin@, y= 
ccos +d sin @ using a computer. What kind of curves are 
they? Why are they closed? 


27 A stone in a bicycle tire goes along a cycloid. Find equ- 
ations for the stone’s path if it flies off at the top (a projectile), 


28 Draw curves on a computer with x =acos 0+ bcos 36 
and y=c sin @+d sin 30. Is there a limit to the number of 
loops? 


Involute 


an 


29 When a penny rolls completely around another penny, the 
head makes turns. When it rolls inside a circle four 
times larger (for the astroid), the head makes turns. 


3 Display the cycloid family with computer graphics: 
(a} cycloid 
(b) epicyeloid x = C cos @—cos Cé, y=C sin 8 + sin Cé 
(c) hypocycloid x = ¢ cos 6+ cos cO, y=c sin @—sin cé 
(d) astroid (c = 3) 
(e) deltoid (c = 2). 


31 If one arch of the cycloid is revolved around the x axis, 
find the surface area and volume. 


32 For a hypocycloid the fixed circle has radius c + 1 and the 
circle rolling instde has radius 1. There are c + 1 cusps if c ts 
an integer. How many cusps (use computer graphics if pos- 


sible) for ¢ = 1/27? c=3/2? c= J? What curve for c= 1? 


33 When a string is unwound from a circle find x(@) and (6) 
for point P. Its path is the “inyolute” of the circle, 


34 For the point P on the astroid, explain why x= 
3 cos @+cos3@ and y=3sin@—sin3@. The angle in the 
figure is 3@ because both circular arcs have length 

Convert to x = 4 cos°0, y = 4 sin*@ by triple-angle formulas. 
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35 Find the area inside the astroid. 


36 Explain why x = 2acot 6 and y = 2a sin@ for the point P 
on the witch of Agnesi. Eliminate @ to find the xy equation. 
Note: Maria Agnesi wrote the first three-semester calculus 
text (’Hépital didn’t do integral catculus). The word “witch” 
is a total mistranslation, nothing to do with her or the curve. 


O Cardioid 


37 For a cardioid the radius C — 1 of the fixed circle equals 
the radius | of the circle rolling outside (epicyclotd with C = 
2). (a) The coordinates of P are x = —1 +2 cos 6—cos 28, 
y= . (b) The double-angle formulas yield x= 
2 cos Ol — cas 6), p= Oxr+y= So its 
square root is r= 


38 Explain the last two steps in equatton (5) for the sliding 
time down a straight path. 


39 On an upside-down cycloid the slider takes the same time 
T to reach bottom wherever it starts. Starting at 6 = 2, write 
1 —cos ? = 2 sin? 9/2 and 1 ~ cos ¢ =2 sin? 2/2 to show that 


r-[ oe x 2a*(1 —cos 8) d san [ft 


./2agicos x — cos 8) x—cos @ 


40 Suppose a heavy weight is attached to the top of the roll- 
ing circle. What is the path of the weight? 


41 The wail in Fenway Park is 37 feet high and 315 feet from 
home plate. A baseball hit 3 feet above the ground at a= 
22.5° will just go over if vo = . The time to reach the 
wall is 
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12.3) Curvature and Normal Vector [xx 


A driver produces acceleration three ways—by the gas pedal, the brake, and steering 
wheel. The first two change the speed. Turning the wheel changes the direction. All 
three change the velocity (they give acceleration). For steady motion around a circle, 
the change is from steering—the acceleration dv/dt points to the center. We now 
look at motion along other curves, to separate change in the speed |y| from change 
in the direction T. 

The direction of motion is T =y/|v|. It depends on the path but not the speed 
(because we divide by |¥|). For turning we measure two things: 


1. How fast T turns: this will be the curvature x (kappa). 
2. Which direction T turns: this will be the aormal vector N. 


« and N depend, like s and T, only on the shape of the curve. Replacing t by 2r or 
t* leaves them unchanged. For a circle we give the answers in advance. The normal 
vector N points to the center. The curvature x is 1/radius. 

A smaller turing circle means a larger curvature «: more bending. 


The curvature & is change in direction |dT| divided by change in position |ds|. There 
are three formulas for x—a direct one for graphs y{x), a brutal but valuable one for 
any parametric curve (x2), y()), and a neat formula that uses tbe vectors ¥ and a. We 
begin with the definition and the neat formula. 


DEFINITION x = |dT/ds| FORMULA x = |¥ x al/|v/ (1) 


The definition does not involve the parameter t—but the calculations do. The posi- 
tion vector R(t) yields vy = dR/dt and a = dv/dt. If t is changed to 2t, the velocity ¥ is 
doubled and a is multiplied by 4. Then |v x al and jv|? are multiplied by 8, and their 
ratio k is unchanged. 


Proof of formula (1) Start from vy =|v/T and compute its derivative a: 


a= Me T+\|¥| a by the product rule. 


Now take the cross product with y=|vIT. Remember that T x T =0: 
dT 
=|vIT ==; 2 
¥xa=|y| x We (2) 


We know that |T| = 1. Equation (4) will show that T is perpendicular to dT/dt. So 
jv x al is the first length |y| times the second length |y||dT/dt|. The factor sin @ in the 
length of a cross product is 1 from the 90° angle. In other words 


dT 
dt 


aT 
ds 


aT jdt 
ds/dt 


\¥ x af 


7 (3) 


Iv? 


_ ly xal 
WP 


and x= 


The chain rule brings the extra {ds/dt| = |¥| into the denominator. 


Before any examples, we show that dT/dt is perpendicular to T. The reason is that 
T is a unit vector. Differentiate both sides of T-T = 1: 


T pyre co or art 


= 4 
dt at (4) 
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That proof used the product rule U’-¥+U-YV’ for the derivative of U:V 
(Problem 23, with U = V = T). Think of the vector T moving around the unit sphere. 
To keep a constant length (T+ ¢T)-(f + dT) =1, we need 2T-dT=0. Movement 
dT is perpendicular to radius vector T. 


Our first examples will be plane curves. The position vector R(z} has components 
x(t) and »({t) but no 2(t). Look at the components of ¥ and a and vy x a (x’ means 
dx/dt): 

R x(t) yt) 0 


vx) yO W= Jie? +1¥P 
a x yO 0 x'y”— yx"l 


are BERRY TS 77 (3) 
vyxa 0 OO xy"—y'x” (x +(v¥yVP? 


Equation (5) is the brutal but valuable formula for x. Apply it to movement around 
a citcle. We should find x = 1/radius a: 


EXAMPLE 4 When x =acos wt and y=asin wt we substitute x’, y’, x”, y” into (5): 
(—a sin wt)(— wa sin ct) — (wa cos wt)(—w?a cos wt) awa? 
_ [(@a sin wt)? + (wa cos wt)?)}>? ~ Fax? a? 3?" 
This is w?a?/m@*a5 and w cancels. The speed makes no difference to x = 1/a. 
The third formula for « applies to an ordinary plane curve given by yx). The 
parameter t is x! You see the square root in the speed |v| = ds/dx: 


R x yx) 0 


¥ 1 dy/dx 0 jy = ./1 + (dy/dx)? 
a 0 da’ y/dx?* 0 |d*y/dx?! 


ree ll ic es 6 
vxa 0 0 dydx? = * + (aya? = 


In practice this is the most popular formula for x. The most popular approximation 
is |d*y/dx*|. (The denominator is omitted.) For the bending of a beam, the nonlinear 
equation uses « and the linear equation uses d*y/dx?, We can see the difference for 
a parabola: 


EXAMPLE 2 The curvature of y= $x? is x = |y"|/(1 + (y'P)9? = 1 + x?)>?, 


T 


Fig. 42.7 Normal! N divided by curvature x for circle and parabola and unit heltx. 


42.3 Curvature and Normal Vector 


The approximation is y” = 1. This agrees with x at x =0, where the parabola turns 
the corner. But for large x, the curvature approaches zero. Far out on the parabola, 
we go a long way for a small change in direction. 

The parabola y = —4x?, opening down, has the same «. Now try a space curve, 


EXAMPLE 3 Find the curvature of the unit helix R =cos ti+ sin ¢j + tk. 


Take the cross product of y= —sinti+costj+k and a= —coszi—sin tj: 
i jk 
yxa=|-—sint cost I] =sinti~costj+k. 


—cost —sint 90 


This cross product has length afi. Also the speed is [v| = ./sin?t + cos*t + 1 = ft 
k= |v x al/lvi? = /2/¢/2)° =4. 


Compare with a unit circle, Without the climhing term tk, the curvature would be 1. 
Because of climbing, each turn of the helix is longer and x = $. 

That makes one think: Is the helix twice as long as the circle? No. The length of a 
turn is only increased by |¥| = /2. The other _/2 is because the tangent T slopes 
upward. The shadow in the base turns a full 360°, but T turns less. 


THE NORMAL VECTOR N 


The discussion is bringing us to an important vector. Where x measures the rate of 
turning, the unit vector N gives the direction of turning. N is perpendicular to T, and 
in the plane that leaves practically no choice. Turn left or right. For a space curve, 
follow dT, Remember equation (4), which makes ¢¥ perpendicular to T. 


The normal vector N is a unit vector along dT/dt. It is perpendicular to T: 


aTjds 147  popnayta Ne Zed 


DEFINITION N= (aT /ds| re a (aT /ae\" {7) 


EXAMPLE 4 Find the normal vector N for the same helix R=costi+sintj+tk. 
Solution Copy ¥ from Example 3, divide by |vl, and compute dT/dt: 


T=vi/\vy|=(-sin tit+costj+ky//2 and dT /dt=(—costi—sin t jyy,/2. 


To change dT/dt into a unit vector, cancel the a2: The normal vector is N= 
—cos ti—sin tj. It is perpendicular to T, Since the k component is zero, N is hori- 
zontal. The tangent T slopes up at 45°—it goes around the circle at that latitude. 
The normal N is tangent to this circle (N is tangent to the path of the tangent!). 
So N stays horizontal as the helix climbs. 

There is also a third direction, perpendicular to T and N. It is the Sinermal vector 
B=TxN, computed in Problems 25-30. The unit vectors T, N, B provide the 
natural coordinate system for the path——along the curve, in the plane of the curve, 
and out of that plane. The theory is beautiful but the computations are not often 
done—we stop here. 
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TANGENTIAL AND NORMAL COMPONENTS OF ACCELERATION 


May I return a last time to the gas peda] and brake and steering wheel? The first 
two give acceleration along T. Turning gives acceleration along N. The rate of turning 
(curvature «) and the direction N are established. We now ask about the force 
required. Newton’s Law is F=ma, so we need the acceleration a—especially its 
component along T and its component along N. 


d*s ds |? 
The acceleration is a= —>T —|N. 
é acceleration is a ae + (5 {8) 


For a straight path, d*s/dt? is the only acceleration—the ordinary second derivative. 
The term x(ds/dt)? is the acceleration in turning. Both have the dimension of length; 
(time)’. 

The force to steer around a corner depends on curvature and speed—as ali drivers 
know. Acceleration is the derivative of ¥ = |v¥|T = (ds/dt)T: 


ae dt dt de dt ds de 


That last term is x(ds/dt)?N, since dT/ds = «N by formula (7). So (8) is proved. 
EXAMPLE 5 A fixed speed dsjdt = 1 gives d*s/dt? = 0. The only acceleration is KN. 
EXAMPLE 6 Find the components of a for circular speed-up R(t) = cos 1? i + sin t? j. 
Without stopping to think, compute dR/dt = v and ds/dt =[¥| and v/|¥| = T: 
v= —2tsint?i+2¢cos¢t*j, jy) =2t, T= —sin t? it+cos r?j. 
The derivative of ds/dt =|v| is d*s/de? = 2. The derivative of ¥ is a: 
a= —2sint*i+2 cos t? j—4t? cos t? i— 4f? sin ¢7j, 


In the first terms of a we see 2T. In the last terms we must be seeing x|v|’N. Certainly 
\v|? =4¢? and x =1, because the circle has radius |. Thus a=2T + 4t?N has the 
tangential component 2 and normal component 4¢*—accelcration along the circle 
and tn to the center. 


Table of Formulas 

y=adRidt a=dv/dt 

\yj=dsidt T= ¥/|¥| = \dR/ds| 
Curvature x = |dT/ds| = |v x al/|¥}? 


jx'y” — y’x"| |d?y/dx?| 
Plane curves k = — 
© (@P+OPP? 4 Gy/dxP yp? 
1aT 3 dTidt “s 
Normal vector N = -— = J 48 <Q brak 
kds |dT/at| eal 


Acceleration a = (d7s/dt?)T + xlv|?N Fig. 12.8 Components of a as car turns corner. 
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12.3 EXERCISES 


Read-through questions 


The curvature tells how fast the curve __9__. For a circle of 
radius a, the direction changes by 22 in aditstance PB _, so 
K=_¢ _. Fora plane curve » = f{x) the formula is « =|y"|/ 
d__. The curvature of y=sinxis_ 8 __. At a point where 
{ _ point) the curve is momentarily straight and 
. For a space curve k=|¥xal/_ 


=_3 


The normal vector N is perpendicular to i. It is a 
| vector along the derivative of T, so N=__k_. For 
motion around a circle N points__! _ _. Up a helix N also 
points ™ __. Moving at unit speed on any curve, the time ¢ 
is the same as the __"__s. Then |y|=__9 _ and d?s/dt?= 


P__ and a is in the direction of _g@ 


Acceleration equals _ *  T+__$s __N. At unit speed 


around a unit circle, those components are _t . An 
astronaut who spins once a second in a radius of one meter 
has jal}=_ 4 meters/sec*, which is about __¥__g. 


Compute the curvature « in Problems 1-8. 
1 y=e 
2 y=In x (where ts « largest?) 
3 x=2cost,y=2sint 
4 x=cost*, y=sin t? 
§ x=1+t?, y=3t? (the path is a ). 
6 x=cos*!, y=sin*s 
Tr=0=t(sox=tcost, y= ) 
8 x=1,y=In cost 
9 Find T and N in Problem 4. 
10 Show that N=sin ti+cos tj in Problem 6. 
11 Compute T and N in Problem 8. 
12 Find the speed |v] and curvature « of a projectile: 
xX ={(vp cos a, y =(o sin at — tee’. 


13 Find T and {vj and x for the helix R=3cosri 
+3 sin ¢j+4tk. How much longer is a turn of the heltx than 
the corresponding cirele? What is the upward slope of T? 


t4 When « = 0 the path isa 


. This happens when ¥ 
and a are ‘ 


. Then ¥x a= 
15 Find the curvature of a cycloid x=a(t—sin’), y= 
a(l — cos 1}. 


16 If all points of a curve are moved twice as far from the 
origin (x > 2x, y > 2y), what happens to xk? What happens 
to N? 


17 Find x and N at @=a for the hypocycloid x= 
4 cos 84+ cos 46, y =4 sin 6 —sin 46, 


18 From ¥ = |+|T and a in equation (8), derive x =|¥ x al/|¥|°. 


19 From a point on the curve, go along the vector N/k to 
find the center of curvature. Locate this center for the point 
(1, 0) on the circle x =cos t, y=sin¢ and the ellipse x =cos t, 
y=2sine and the parabola y=}(x*— 1). The path of the 
center of curvaturc is the “evofute” of the curve. 


20 Which of these depend only on the shape of the curve, 
and which depend also on the speed? y, T, [v|, s, x, a, N, B. 


21 A plane curve through (0, 0) and (2, 0) with constant cur- 
vature « is the circular arc . For which x is there no 
such curve? 


22 Sketch a smooth curve going through (6, 0), (1, —1), and 
(2,0). Somewhere dy/dx? is at least . Somewhere 
the curvature is at least . {Proof is for instructors 
only.} 


23 For plane vectors, the ordinary product rule applied to 
U,V, + U,V, shows that (U-¥) =U'+ V+ 


24 If vis perpendicular to a, prove that the speed is constant. 
True or false: The path is a circle. 


Problems 25-30 work with the T-N-B system—along the 
curve, in the plane of the curve, perpendicular to that plane. 


25 Compute B=T x N for the helix R =cos ti+sinij+tk 
in Examples 3-4. 


26 Using Problem 23, differentiate B-T = 0 and B- B= 1 to 
show that B’ is perpendicular to T and B. So dB/ds = —1N 
for some number ¢ called the torsion. 


27 Compute the torsion t=|dB/ds| for the helix in 
Problem 25. 


28 Find B=T x N for the curve x=1, p=t, z=. 


29 A circle lies in the xy plane. Its normat N lies 
and B= and t = |¢dB/ds| = : 


30 The Serret-Frenet formulas are dT/ds=KN, dN/ds = 
—xT+7B, dB/ds= —tN. We know the first and third. 
Differentiate N = — T x B to find the second. 


31 The angle @ from the x axis to the tangent line ts d= 
tan” ‘(dy/dx), when dyidx is the slope of the curve. 
(a} Compute d@/dx. 
(b) Divide by ds/dx =(1 ~(dyidx)*}'? to show that |d6/ds| 
is & in equation (5), Curvature is change in direction |d60| 
divided by change in position |ds|. 
32 If the tangent direction is at angle @ then T= 
cos 0 i+ sin ?j. In Problem 3] |d@/ds| agreed with « = |dT/ds| 
because |dT /d6| = : 
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In 33-37 find the T and N components of acceleration. % x=e' cost, p=e'sint,z=0 (spiral) 


7 xatyatiz=l. 


PS Rr SOE Dy 8 MLO RO ACE) 38 For the spiral in 36, show that the angle between R and 


34 x=14+2, y=14+2t,z=14+3¢ {line} a (position and acceleration) is constant. Find the angle. 


38 x=tcost, y=tsint,z=0 (spiral) 39 Find the curvature of a polar curve r = F(0). 


= 12.4 Polar Coordinates and Planetary Motion | 


This section has a general purpose—to do vector calculus in polar coordinates. It 
also has a specific purpose—to study central forces and the motion of planets. The 
main gravitational force on a planet is from the sun. It is a central force, because it 
comes from the sun at the center, Polar coordinates are natural, so the two purposes 
go together. 

You may feei that the planets are too old for this course. But Kepler’s laws are 
more than theorems, they are something special in the history of mankind—“the 
greatest scientific discovery of all time.” If we can recapture that glory we should do 
it. Part of the greatness is in the difficulty—Kepler was working sixty years before 
Newton discovered calculus. From pages of observations, and some terrific guesses, 
a theory was born. We will try to preserve the greatness without the difficulty, and 
show how elliptic orbits come from calculus. The first conclusion is quick. 


Motion in a central force field always stays in a plane. 


F is a multiple of the vector R from the ongin (central force). F also equals ma 
(Newton’s Law). Therefore R and a are in the same direction and R x a=0, Then 
R x ¥ has zero derivative and is constant: 


d 
by the product rule: qk x Varxvt+Rxa=0+0. {1} 


R x v¥ is a constant vector H. So R stays in the plane perpendicular to H. 


How does a planet move in that plane? We turn to polar coordinates. At each 
point except the origin (where the sun is), u, is the unit vector pointing outward. It is 
the position vector R divided by its length r (which is ./x? + y’): 


u, = R/r = (xi + yj)/r = cos Oi+ sin @j. (2) 


That is a unit vector because cos?@+sin79=1. It goes out from the center. 
Figure 12.9 shows u, and the second unit vector uy, at a 90° angle: 


Up = — sin Gi+cos Aj. (3) 
The dot product is u, ‘us = 0. The subscripts r and @ indicate direction (not derivative). 
Question 4: How do u, and u, change as r changes (out a ray)? They don't. 
Question 2: How do u, and up, change as @ changes? Take the derivative: 
du,/dé = —sin 6 i+cos 8 j= up, 
(4) 


du,/d@ = —cos @i—sin 0 j= —u,. 
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r@"+2r'@" 


Ag. 12.9 uy, is outward, u, is around the center. Components of y and a in those directions. 


Since u, = R/r, one formula is simple: The position vector is R = ru,. For its derivative 
vy = dR/dt, use the chain rule du,/dt = (du, /d0)(d@/de) = (d0/dt)u,: 


ig d dr dé 
The velocity is y= ai (ru,) = ae +r 7 (5) 


The outward speed is dr/dt. The circular speed is r d6/dt. The sum of squares is |y|?. 

Return one more time to steady motion around a circle, say r= 3 and @ = 2t. The 
velocity is ¥ = 6ug, all circular. The acceleration is —12u,, all inward. For circles ug 
is the tangent vector T. But the unit vector u, points outward and N points inward— 
the way the curve turns. 

Now we tackle acceleration for any motion in polar coordinates. There can be 
speedup in r and speedup in @ (also change of direction). Differentiate ¥ in (5) by the 
product rule: 


hod” dd age ae” dee 


For du,/dt and du,/dt, multiply equation (4) by d0/dt. Then all terms contain u, or ug. 
The formula for a is famous but not popular (except it got us to the moon): 


a ar ( 2ey + 28 | .ard8 (6) 

aa \ae at) } de? drat} 
In the steady motion with r=3 and @= 22, only one acceleration term is nonzero: 
a = — 12u,. Formula (6) can be memorized (maybe). Problem 14 gives a new way to 


reach it, using re’®. 


EXAMPLE 4 Find R and vy and a for speedup @ = ¢? around the circle r= 1. 
Solution The position vector is R= u,. Then ¥ and a come from (5-6): 

v=(r d0/dtu,=2tu, a= —(2t)*u,+ 2uy. 
This question and answer were also in Example 6 of the previous section. The acceler- 
ation was 2T + 4¢?N. Notice again that T = u, and N = — u,, going round the circle. 
EXAMPLE 2 Find R and ¥ and |¥| and a for the spiral motion r = 31, 6 = 2¢. 
Solution The position vector is R = 3r u,. Equation (5) gives velocity and speed: 


v=3u,+6tu, and = [y|=./(3)? +(62)*. 
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The motion goes out and also around. From (6) the acceleration is —12f u, + 12ug. 
The same answers would come more slowly from R = 3 cos 2t i+ 3¢ sin 2t j. 

This example uses polar coordinates, but the motion is not circular. One of Kepler’s 
inspirations, after many struggles, was to get away from circles. 


KEPLER'S LAWS 


You may know that before Newton and Leibniz and calculus and polar coordinates, 
Johannes Kepler discovered three laws of planetary motion. He was the court mathe- 
matician to the Holy Roman Emperor, who mostly wanted predictions of wars. 
Kepler also determined the date of every Easter—no small problem. His triumph 
was to discover patterns in the observations made by astronomers (especially by 
Tycho Brahe). Galileo and Copernicus expected circles, but Kepler found ellipses. 


Law: Each planet travels in an ellipse with one focus at the sun. 


Law 2; The vector from sun to planet sweeps out area at a steady rate: dA/dt = 
constant. 


Law 3: The length of the planet’s year is T = ka*’*, where a = maximum distance 
from the center (not the sun) and k = 27/,/GM is the same for all planets. 


With calculus the proof of these laws is a thousand times quicker. But Law 2 is the 
only easy one. The sun exerts a central force. Equation (1) gave R x v =H = constant 
for central forces. Replace R by ru, and replace v by equation (5): 


dr dé} dé 
Hm ray x (Tu,+r Fito) =P Tx) (7) 


This vector H is constant, so its length h =r?d6/dt is constant. In polar coordinates, 
the area is dA = 4r7d@, This area dA is swept out by the planct (Figure 12,10), and 
we have proved Law 2: 


d Aidt =4r? dOjdt = th = constant. (8) 


Near the sun r is small. So d@/de is big and planets go around faster, 


roe 


NOT 
CIRCLES! 


Fig. 12.10 The planet is on an ellipse with the sun at a focus. Note a, 8, c, g. 


Now for Law 1, about ellipses. We are aiming for 1/r = C — Dcos @, which is the 
polar coordinate equation of an ellipse. It is easier to write q than |/r, and find an 
equation for q. The equation we will reach is d7q/d8?+q=C. The desired q= 
C — Dos @ solves that equation (check this), and gives us Kepler’s ellipse. 
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The first step is to connect dr/dt to dg/d@ by the chain rule: 


dr_d(l\_-1dg_-1dqd0__, dy ~ 
dt dt\q/ @g@@ dt gq? dédt ad 
Notice especially d@/dt = h/r? = hq”. What we really want are second derivatives: 
d?r d {dq d fdq\ dé aigd'@ 
rT me | (4) --15 (2S eas O77 ” 


After this trick of introducing q, we are ready for physics. The planet obeys Newton’s 
Law F = ma, and the central force F is the sun’s gravity: 


F a. GM dr ‘ doy 
m r2 dt? dt}° 
That right side is the u, component of a in (6). Change r to 1/g and change d@/dt to 


hq’. The preparation in (10) allows us to rewrite d?r/dt? in equation (11), That 
equation becomes 


(11) 


2 
1 
_6M ¢=— Wt att 


d@? 
Dividing by —h*q* gives what we hoped for—the simple equation for gq: 
a*q/d0? + q = GM/h? = C (a constant). (12) 


The solution is gq = C — D cos @. Section 9.3 gave this polar equation for an ellipse or 
parabola or hyperbola, To be sure it is an ellipse, an astronomer computes C and D 
from the sun's mass M and the constant G and the earth’s position and velocity. The 
main point is that C > D. Then q is never zero and r is never infinite. Hyperbolas and 
parabolas are ruled out, and the orbit in Figure 12.10 must be an ellipse.f 


Astronomy is really impressive. You should visit the Greenwich Observatory in 
London, to see how Halley watched his comet. He amazed the world by predicting 
the day it would return. Also the discovery of Neptune was pure mathematics— 
the path of Uranus was not accounted for by the sun and known planets. LeVerrier 
computed a point in the sky and asked a Berlin astronomer to look. Sure enough 
Neptune was there. 

Recently one more problem was solved—to explain the gap in the asteroids around 
Jupiter. The reason is “chaos’—the three-body problem goes unstable and an 
asteroid won’t stay in that orbit. We have come a long way from circles. 


Department of Royal Mistakes The last pound note issued by the Royal Mint 
showed Newton looking up from his great book Principia Mathematica. He is not 
smiling and we can see why. The artist put the sun at the center! Newton has just 
proved it is at the focus. True, the focus is marked S§ and the planet is P. But those 
rays at the center brought untold headaches to the Mint—the note is out of circula- 
tion. I gave an antique dealer three pounds for it (in coins). 


Kepler’s third faw gives the time T to go around the ellipse—the planet’s year. 
What is special in the formula is a*/*—and for Kepler himself, the 15th of May 1618 
was umforgettable: “the right ratio outfought the darkness of my mind, by the great 
proof afforded by my labor of seventeen years on Brahe’s observations.” The second 


tAn amateur sees the planet come around again, and votes for an ellipse. 


A67 
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law dA/dt = this the key, plus two facts about an ellipse—its area nab and the height 
b?/a above the sun: 


3 1 
1. The area A= aA dt = =hT must equal zab, so T= = 
o at 2 h 


2. The distance r= 1/C at @ = 7/2 must equal b?/a, so b = ,/a/C. 


The height b?/a is in Figure 12.10 and Problems 25-26. The constant C= GM /h? is 
in equation (12). Put them together to find the period: 


_ 2nab_ 2na a 2m 437 
T= hoe a= Tis 5 (13) 


To think of Kepler guessing a*” is amazing. To think of Newton proving Kepler’s 
laws by calculus is also wonderful—because we can do it too. 
EXAMPLE 3. When a satellite goes around in a circle, find the time T. 


Let r be the radius and @ be the angular velocity. The time for a complete circle 
(angle 2x) is T =2n/w. The acceleration is GM/r? from gravity, and it is also rw” for 
circular motion. Therefore Kepler is proved right: 


rw? = GM/r? => o= /GM/r3 =e T =2n/w = 2nr?!/,/GM. 


42.4 EXERCISES 


en SSS — 
Read-through questions 


A central force points toward _@ _. Then R x d?R/dt? =0 For motion under a circular force, r? times __!___ is con- 


because 6. Therefore R x dR/dt isa __¢ (called H). stant. Dividing by 2 gives Kepler’s second law dA/dt = __m 
; s ‘ The first law says that the orbit is an __n___ with the sun at 
In polar coordinates, the outward unit vector Is u,= ‘ Tiss tbls eauuitions: Tat a Conic section i «id 
cos i+ __4 _. Rotated by 90° this becomes uy =__e __. The oe P = nis —2_* 


position vector R is the distance r times ___ _. The velocity aa ri cone. Using ¥ re fouitic doe +_4_ = C. Seite 
path is a conic section; it must be an ellipse because __ 


v=dR/dt is_9 _u,+__h__ uy. For steady motion around ; é : ee 
the circle r=5 with 0=4t,vis__| and |vjis__j anda 4 te ase ~ ' “ ellipse lead to the period T=_s_, 
a which is Kepler's third law. 
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1 Find the unit vectors a, and uy at the point (0, 2). The u, 
and uy components of v=i+j at that point are 


2 Find u, and u, at (3,3). If v=i+j then ¥= u,. 


Equation (5} gives dr/dt = and d0/dt = : 
3 At the point (1, 2), velocities in the direction will 
give dr/dt =0. Velocities in the direction will give 


d0/dt =0. 


4 Traveling on the cardioid r=] —cos 4 with d6/dt = 2, 
what is ¥? How long to go around the cardioid (no integration 
involved}? 


5 Ifr=e and @ = 3t, find v and a when ¢ = 1. 


6 Ifr=1 and 6=sin t, describe the path and find v and a 
from equations (5-6). Where is the velocity zero? 


7 (important) R=4 cos 5ti+4simn 5tj=4u, travels on a 
circle of radius 4 with 6 = St and speed 20. Find the compo- 
nents of ¥ and a in three systems: i and j, T and N, u, and uy. 


§ When is the circle r = 4 completed, if the speed is 8¢? Find 
v and a at the return to the starting point (4, 0). 


9 The u, component of acceleration is =0 fora 
central force, which is in the direction of . Then 
r*d6/dt is constant (new proof) because its derivative is r times 


10 If r?d0/dt = 2 for travel up the line x = 1, draw a triangle 
to show that 7 = sec 6 and integrate to find the time to reach 


(1, 1). 


11 A satellite is r = 10,000 km from the center of the Earth, 
traveling perpendicular to the radius vector at 4 km/sec. Find 
dO/dt and h. 


12 From |u,|=1, it follows that du,/dr and du,/d6 are 
to u, (Section 12.3). In fact du,/dr is and 
du, /d@ is ; 


13 Momentum is my and its derivative is ma = force. Angular 
momentum is mH = mR x vy and its derivative is = 
torque. Angular momentum is constant under a central force 
because the is zero, 


14 To find (and remember) ¥ and a in polar coordinates, start 
with the complex number re’ and take its derivatives: 


, dé. 
R=re® “ = é el + iro 
d?R 
Phe + + + + : 


Key idea: The coefficients of e and ie” are the u, and u, 
components of R, y¥, a: 


K=ru, + Ou, 


(a) Fill in the five terms from the derivative of dR/dt 
(b) Convert e to u, and ie® to uy to find a 


(c) Compare R, v, a with formulas (5-6) 
{d) (for instructors only) Why does this method work? 


Note how e=cos@+isin4@ corresponds to u,=cos@i 
+sin @j. This is one place where electrical engineers are 
allowed to write j instead of i for ./ —1. 


15 If the period is T find from (13) a formula for the distance 
a, 


16 To stay above New York what should be the period of a 
satellite? What should be its distance a from the center of the 
Earth? 


47 From T and a find a formula for the mass M. 


18 If the moon has a period of 28 days at an average distance 
of a = 380,000 km, estimate the mass of the 


19 The Earth takes 365; days to go around the sun at a 
distance a 93 million miles ~ 150 million kilometers. Find 
the mass of the sun, 


20 True or false: 


(a) The paths of all comets are ellipses. 
(b} A planet in a circular orbit has constant speed. 
{c) Orbits in central force fields are conic sections. 


21 ./GM = 2°10’ in what units, based on the Earth’s mass 
M =6-102*kg and the constant G = 6.67+10~'! Nm?/kg?? 
A force of one kg + meter/sec? is a Newton N, 


22 If a satellite circles the Earth at 9000 km from the center, 
estimate its period T in seconds. 


23 The Viking 2 orbiter around Mars had a period of about 
10,000 seconds. If the mass of Mars is M = 6.4 107 kg, what 
was the value of a? 


24 Convert l/r=C—Dcos @, or 1=Cr—Dx, into the xy 
equation of an ellipse. 


25 The distances a and ¢ on the ellipse give the constants 
in r=!(C—Dcos 8). Substitute 9=0 and @=n as in 
Figure 12.10 to find D=¢/{a?—c*) and C=af(a*—-c*)= 
a/b?. 


26 Show that x=-—c, y=b7/a lies on the ellipse 
x?j/a? + y?/b? = 1. Thus y is the height 1/C above the sun in 
Figure 12.10. The distance from the sun to the center has c? = 
a? — b?, 

27 The point x =a cos 2nt/T, y= 6 sin 2nt/T travels around 
an ellipse centered at (0, 0) and returns at time T. By symmetry 
it sweeps out area at the same rate at both ends of the major 
axis. Why does this break Kepler’s second law? 


28 If a central force is F=—ma(rju,, explain why 
d?r/dt? — r(a0/at)? = — afr). What is afr) for gravity? 
Equation (12} for g = 1/r leads to ggg + q = r7a(r). 


29 When F = 0 the body should travel in a straight 
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The equation gg, +g = 0 allows g =cos 6, in which case the 
path l/r =cos @ ts . Extra credit: Mark off equal 
distances on a line, connect them to the sun, and explain why 
the triangles have equal area. So dA/d¢ is still constant. 


30 The strong nuclear force increases with distance, a(r) =r. 
It binds quarks so tightly that up to now no top quarks have 
been seen (reliably), Problem 28 gives gos + q = 1iq°. 
(a) Multiply by gq, and integrate to find $3 - }q?= 
+C. 
*(b) Integrate again (with tables) after setting u = g?, uy = 
2440. 


31 The path of a quark in 30(b) can be written as 
r?(A + B cos 26) = 1. Show that this is the same as the ellipse 
(4+ B)x? + (A — By? = 1 with the origin at the center. The 
nucleus is not at a focus, and the pound note is correct for 
Newton watching quarks. (Quantum mechanics not 
accounted for.} 


32 When will Halley’s comet appear again? It disappeared in 


1986 and its mean distance to the sun (average of a+c¢ and 
a—c) 1s a=1.6+10° kilometers. 

33 You are walking at 2 feet/second toward the center of a 
merry-go-round that turns once every ten seconds. Starting 
from r = 20, @ = 0 fiud r(#), 6(£), (2), a(t) and the length of your 
path to the center. 


34 From Kepler's laws r = 1/(C — D cos @) and r7d@/dt =h, 
show that 


l. drjdt= —Dhsin@ = 2, d?ridt? = (: = cain 
r 
3. d?ridt? — r(dOide)? = — Ch? jr’. 


When Newton reached 3, he knew that Kepler's laws required 
a central force of Ch?/r?. This is his inverse square law. Then 
he went backwards, in our equations (8-12), to show that this 
force yields Kepler’s laws. 


35 How long is our year? The Earth’s orbit has a= 
149.57- 10° kilometers. 
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CHAPTER 13 


Partial Derivatives 


This chapter is at the center of multidimensional calculus. Other chapters and other 
topics may be optional; this chapter and these topics are required. We are back to 
the basic idea of calculus—rhe derivative. There is a function f, the variables move a 
little bit, and f moves. The question is how much f moves and how fast. Chapters 
1-4 answered this question for f(x), a function of one variable. Now we have f{x, y) 
or fix, y, z)—with two or three or more variables that move independently. As x and 
y change, f changes. The fundamental problem of differential calculus is to connect 
Ax and Ay to Af. 

Calculus solves that problem in the limit. It connects dx and dy to df. In using this 
language I am building on the work already done. You know that df/dx is the limit 
of Af/Ax. Calculus computes the rate of change—which is the slope of the tangent 
line, The goal is to extend those ideas to 


Slx, y)= x? - y? fix y= Jx? + y? 
These functions have graphs, they have derivatives, and they must have tangents. 
The heart of this chapter is summarized in six lines. The subject is differential 
calculus—smal] changes in a short time. Still to come is integra! calculus—adding 
up those small changes. We give the words and symbols for f(x, y), matched with the 
words and symbols for f(x). Please use this summary as a guide, to know where 
calculus is going. 


or or = f(x, y, z) = 2x + 3y + 4z. 


Curve y=f(x} vs. Surface z=f{x, y) 


af of of 
ax becomes two partial derivatives ax F and J ay 
d?f — Pf Pf PF OF 
—— ti se a Pee 
Bt becomes four second derivatives axz? xdy’ Byox’ dy? 
Af oa becomes the linear approximation Afx Fay + Fay 
2 ae 
tangent line becomes the tangent plane z—z = ax Xo) + ay! y— yo) 
dy dydx . dz _Ozdx , oz dy 
a an di becomes the chain rule a ac ae ay at 
a =0 becomes two maximum-minimum equations Zz =0 and z =0. AT‘ 
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The graph of y=f(x) is a curve in the xy plane. There are two variables—x is 
independent and free, y is dependent on x. Above x on the base line is the point (x, y) 
on the curve. The curve can be displayed on a two-dimensional printed page. 

The graph of z = f(x, y) is a surface in xyz space. There are three variables—x and 
y are independent, z is dependent. Above (x, y) in the base plane is the point (x, y, z) 
on the surface (Figure 13.1). Since the printed page remains two-dimensional, we 
shade or color or project the surface. The eyes are extremely good at converting two- 
dimensional images into three-dimensional understanding—they get a lot of practice. 
The mathematical part of our brain also has something new to work on—two partial 
derivatives. 

This section uses examples and figures to illustrate surfaces and their level curves. 
The next section is also short. Then the work begins. 


EXAMPLE 1 Describe the surface and the level curves for z = f(x, y) = ./x? + y?. 


The surface is a cone. Reason: ./x? + y? is the distance in the base plane from (0, 0) 
to (x, y). When we go out a distance 5 in the base plane, we go up the same distance 
5 to the surface. The cone climbs with slope 1. The distance out to (x, y) equals the 
distance up to z (this is a 45° cone). 

The level curves are circles. At height 5, the cone contains a circle of points—all 
at the same “‘level” on the surface. The plane z = 5 meets the surface z = ./x? + y? at 
those points (Figure 13.1b). The circle below them (in the base plane) is the level 
curve. 


DEFINITION A /evel curve or contour line of z = f(x, y) contains all points (x, y) that 
share the same value f(x, y) = c. Above those points, the surface is at the height z = c. 


There are different level curves for different c. To see the curve for c= 2, cut 
through the surface with the horizontal plane z= 2. The plane meets the surface 
above the points where f(x, y)= 2. The level curve in the base plane has the equation 
S(x, y) = 2. Above it are all the points at “level 2” or “level c’”’ on the surface. 

Every curve f(x, y)=c is labeled by its constant c. This produces a contour map 
(the base plane is full of curves). For the cone, the level curves are given by 
,/x* + y? =c¢, and the contour map consists of circles of radius c. 


Question What are the level curves of z= f(x, y)= x? + y?? 
Answer Still circles. But the surface is not a cone (it bends up like a parabola). The 
circle of radius 3 is the level curve x? + y? = 9. On the surface above, the height is 9. 


Vx? + y? 


y 


y 


<— base plane —> 


Fig. 43.4 The surface for z = f(x, y)=./x? + y? is a cone. The level curves are circles. 
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EXAMPLE 2 For the linear function f(x, y) = 2x + y, the surface is a plane. Its level 
curves are straight lines. The surface z=2x+ y meets the plane z=c in the line 
2x + y=c. That line is above the base plane when c is positive, and below when c is 
negative. The contour lines are in the base plane. Figure 13.2b labels these parallel 
lines according to their height in the surface. 


Question If the level curves are all straight lines, must they be parallel? 
Answer No. The surface z= y/x has level curves y/x = c. Those lines y = cx swing 
around the origin, as the surface climbs like a spiral playground slide. 


Plane z=2x+y 


Nm 


ad 


Fig. 13.2 A plane has parallel level lines. The spiral slide z = y/x has lines y/x =c. 


EXAMPLE 3 The weather map shows contour lines of the temperature function. Each 
level curve connects points at a constant temperature. One line runs from Seattle to 
Omaha to Cincinnati to Washington. In winter it is painful even to think about the 
line through L.A. and Texas and Florida. USA Today separates the contours by 
color, which is better. We had never seen a map of universities. 


oO 
WM wu MS) 
N Q 
NITC NDTB CW A 
Pp OS ' \ 
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Fig. 13.3. The temperature at many U.S. and Canadian universities. Mt. Monadnock in New Hampshire is said to be the most 
climbed mountain (except Fuji?) at 125,000/year. Contour lines every 6 meters. 
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Question From a contour map, how do you find the highest point? 
Answer The level curves form loops around the maximum point. As c increases the 
loops become tighter. Similarly the curves squeeze to the lowest point as c decreases. 


EXAMPLE 4 A contour map of a mountain may be the best example of all. Normally 
the level curves are separated hy 100 feet in height. On a steep trail those curves are 
bunched together—the trail climbs quickly. In a fiat region the contour lines are far 
apart. Water runs perpendicular to the level curves. On my map of New Hampshire 
that is true of creeks but looks doubtful for rivers. 


Question Which direction in the base plane is uphill on the surface? 
Answer The steepest direction is perpendicular to the level curves. This is important. 
Proof to come. 


EXAMPLE 5 In economics x”y is a utility function and x*y=c is an indifference curve. 


The utility function x?y gives the value of x hours awake and y hours asleep. Two 
hours awake and fifteen minutes asleep have the value f= (27)(4). This is the same as 
one hour of each: f=(17){1). Those lie on the same level curve in Figure 13.4a. We 
are indifferent, and willing to exchange any two points on a level curve. 

The indifference curve is “‘convex.” We prefer the average of any two points. The 
line between two points is up on higher level curves. 

Figure 13.4b shows an extreme case. The level curves are straight lines 4x + y=c. 
Four quarters are freely substituted for one dollar. The value is f= 4x + y dollars. 

Figure 13.4c shows the other extreme. Extra left shoes or extra right shoes are 
useless. The value (or utility} is the smailer of x and y. That counts pairs of shoes. 


y quarters right shoes 


min (x, y) 


ae x dojlars rk 
awake 2 1 2 shoes 
Aig. 13.4 Utility functions xy, 4x + y, min(x, y). Convex, straight substitution, complements. 
43.4 EXERCISES 

Read-through questions 
The graph of z=f(x,y) is a 9 in __b_ -dimensional For z =f{x, y) =x? — y?, the equation for a level curve is 
space. The curve f{x, y) = 7 lies down in the base plane. 1. This curve isa __J_. For z=x-—y the curves are 
Above this ievel curve are all points at height _ 4 _ in the k__. Level curves never cross because ___|__. They crowd 
surface. The __©  z=-==7 cuts through the surface at those together when the surface is _™_. The curves tighten to a 
points, The level curves f(x, y)=__f are drawn in the xy point when __"_. The steepest direction on a mountain is 
plane and labeled by __9g _. The family of iabeled curves is © tothe Pp 


ach 
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1 Draw the surface z =f[x, y) for these four functions: 


f= /4-%-y¥ 
fg =2~4x? + y’) 
2 The level curves of all four functions are . They 


enclose the maximum at . Draw the four curves 
flx, y) =1 and rank them by increasing radius. 


fr=2—-Jxity 
frzlzern-r 


3 Set y=0 and compute the x derivative of each function 
at x= 2, Which mountain is flattest and which is steepest at 
that point? 


4 Set y=1 and compute the x derivative of each function 
at x=1. 


For f; to f,. draw the level curves f= 0, 1,2. Also f= — 4, 


5 fs=x—y 6 fe=(e+y/ 
7 fz=xe ” 8 fs =sin(x — y) 
9 f=y—x? 10 tio = yix? 


11 Suppose the level curves are parallel straight lines. Does 
the surface have to be a plane? 


12 Construct a function whose level curve f=0 is in two 
separate pieces. 


13 Construct a function for which f=0 is a circle and f= 1 
is not. 


14 Find a function for which f= 0 has infinitely many pieces. 


15 Draw the contour map for f=xy with level curves f= 
—2, —1, 0, 1, 2. Describe the surface. 


16 Find a function f{x, y) whose level curve f= 0 consists of 
a circle and afl points inside it. 


Draw two level curves in 17-20. Are they ellipses, parabolas, 


or hyperbolas? Write ./ —2x=c as ./ =cet+2x 


before squaring both sides. 
17 f= /4x? + y? 18 f=./4x? + y?— 2x 
19 f=. /5x? + y? — 2x 20 f= /3x? + y? — 2x 


21 The level curves of f=(y—2}/(x—-—1) are 
through the point (1, 2) except that this point is not 
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22 Sketch a map of the US with lines of constant temperature 
(isotherms) based on today’s paper. 


23 (a) The contour lines of z =x? + y* — 2x —2y are circles 
around the point ; Where z is # minimum. 
(b) The contour lines of f= are the circles 
x? + y?=¢+1! on which f=c. 


24 Draw a contour map of any state or country (lines of 
constant height above sea level). Florida may be too flat. 


25 The graph of w = F(x, y, z)isa -dimensional sur- 
face in xyzw space. Its level sets F(x, y, z)=c are ~ 
dimensional surfaces in xyz space. For w=x — 2y +z those 
level sets are . For w =x? + y? +27 those level sets 
are 


26 The surface x? + y? —z?* = —1 is in Figure 13.8. There is 
empty space when z° is smaller than { because . 


27 The level sets of F=x?+y?+qz? look like footballs 
when g is , like basketballs when q is ; 
and like frisbees when q is 


28 Let T(x, y) be the driving ume from your home at (0, 0} 
to nearby towns at (x, y). Draw the level curves. 


29 (a) The level curves of f(x, y} =sin(x — y) are 

(b) The level curves of g(x, y) = sin(x* — y*) are 

(c) The level curves of A{x, y) = sin(x — y?) are 
30 Prove that if x,y, =1 and x,y, =1 then their average 
x=4x,+x2), y=4(y1 + 2) has xy 2 1. The function f= xy 
has convex level curves (hyperbolas). 


31 The hours in a day are limited by x + y= 24, Write xy 
as x7(24—~x) and maximize to find the optimal number of 
hours to stay awake. 


32 Near x = 16 draw the level curve x?y = 2048 and the line 
x+y=24. Show that the curve is convex and the line is 
tangent. 


33 The surface z=4x+y is a . The surface z= 
min(x, y) is formed from two . We are willing to 
exchange 6 left and 2 right shoes for 2 left and 4 right shoes 
but better is the average ‘ 


34 Draw a contour map of the top of your shoe. 


The central idea of differential calculus is the derivative. A change in x produces a 
change in f. The ratio Af/Ax approaches the derivative, or slope, or rate of change. 


What to do if f depends on both x and y? 


The new idea is to vary x and y one at a time. First, only x moves. If the function 
is x + xy, then Afis Ax + yAx. The ratio Af/Ax is 1+ y. The “x derivative” of x + xy 
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is 1 + y. For all functions the method is the same: Keep y constant, change x, take the 
limit of AffAx: 


Fo, y= lim Mn in St ofl) 


ox Ax-0AxX 4x70 Ax 


DEFINITION 


(1) 


On the left is a new symbol of/éx. It signals that only x is allowed to vary—@f/dx is 
a partial derivative. The different form @ of the same letter (still say ““d”) is a reminder 
that x is not the only variable. Another variable y is present but not moving. 


EXAMPLE 4 f(x, y)=x?y?+ xy+ y 2, y)= 2xy? + y+0. 


Do not treat y as zero! Treat it as a constant, like 6. Its x derivative is zero. 
If f(x) = sin 6x then df/dx = 6 cos 6x. If f(x, y) = sin xy then df/dx = y cos xy. 
Spoken aloud, of/éx is still “d fd x.” It is a function of x and y. When more is 
needed, call it “the partial of fwith respect to x.” The symbol f’ is no longer available, 
since it gives no special indication about x. {ts replacement f, is pronounced “fx”’ or 
“f sub x,” which is shorter than 6f/dx and means the same thing. 
We may also want to indicate the point (xp, yo) where the derivative is computed: 


af af (a 
Ax Oro» Yo) or fi(Xp,¥o) OF ax — or just (2). 


EXAMPLE 2 f(x, y)=sin2xcosy f,=2cos2xcosy (cos y is constant for d/éx) 


The particular point (x9, ¥o) is (0,0). The height of the surface is f(0,0)=0. 
The slope in the x direction is f,=2. At a different point xy=7, yo =x we find 


Sn, x) = — 2. 
Now keep x constant and vary y. The ratio Af/Ay approaches df/dy: 


Sfx y= tim & = tim Mt od SHY) 
al Ayo A : 


y  Ay70 Ay 2) 


This is the slope in the y direction. Please realize that a surface can go up in the x 
direction and down in the y direction. The plane f(x, y) = 3x ~ 4y has f, = 3 (up) and 
J, = — 4 (down). We will soon ask what happens in the 45° direction. 


Se pe ee Oe ed 
EXAMPLE 3 f(x, y) x*+y ax Seay j ay Jone i 
The x derivative of \/x? + y? is really one-variable calculus, because y is constant. 
The exponent drops from } to — }, and there is 2x from the chain rule. This distance 
function has the curious derivative Of/0x = x/f. 

The graph is a cone. Above the point (0,2) the height is ./0? + 27=2. The 
partial derivatives are f, = 0/2 and f, = 2/2. At that point, Figure 13.5 climbs in the 
y direction. It is level in the x direction. An actual step Ax will increase 0? + 2? to 
(Ax)? + 2?. But this change is of order (Ax)? and the x derivative is zero. 


Figure 13.5 is rather important. It shows how df/ax and df/@y are the ordinary 
derivatives of f{x, yo) and f(xo, y). It is natural to call these partial functions. The first 
has y fixed at yg while x varies. The second has x fixed at xp while y varies. Their 
graphs are crass sections down the surface—cut out by the vertical planes y = yo and 
X = Xo. Remember that the level curve is cut out by the horizontal plane z = c. 
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FO, y) = VO? + y? 


Fig. 13.5 Partial functions ./x? + 2? and ./0? + y? of the distance function f= ./x? + y’. 


The limits of Af/Ax and Af/Ay are computed as always. With partial functions 
we are back to a single variable. The partial derivative is the ordinary derivative of a 
partial function (constant y or constant x). For the cone, éf/dy exists at all points 
except (0, 0). The figure shows how the cross section down the middle of the cone 
produces the absolute value function: f(0, y) = |y|. It has one-sided derivatives but not 
a two-sided derivative. 


Similarly éf/éx will not exist at the sharp point of the cone. We develop the idea 
of a continuous function f(x, y) as needed (the definition is in the exercises). Each 
partial derivative involves one direction, but limits and continuity involve all direc- 
tions. The distance function is continuous at (0, 0), where it is not differentiable. 


EXAMPLE 4 fix, y)=y?—x? af/éx=—2x Af ay =2y 


Move in the x direction from (1, 3). Then y? — x? has the partial function 9 — x?. 
With y fixed at 3, a parabola opens downward. In the y direction (along x = 1) the 
partial function y” — 1 opens upward. The surface in Figure 13.6 is called a hyperbolic 
paraboloid, because the level curves y? — x? = c are hyperbolas. Most people call it a 
saddle, and the special point at the origin is a saddle point. 

The origin is special for y? — x? because both derivatives are zero. The bottom of 
the y parabola at (0, 0) is the top of the x parabola. The surface is momentarily flat in 
all directions. It is the top of a hill and the bottom of a mountain range at the same 


f(x.0) = =-x? 


Fig. 13.6 A saddle function, its partial functions, and its level curves. 
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time. A saddle point is neither a maximum nor a minimum, although both derivatives 
are zero. 


Note Do not think that f(x, y) must contain y? and x? to have a saddle point. The 
function 2xy does just as well. The level curves 2xy=c are still hyperbolas. The 
partial functions 2xyp and 2xgy now give straight lines——which is remarkable. Along 
the 45° line x = y, the function is 2x? and climbing. Along the —45° line x = — y, 
the function is ~ 2x? and falling. The graph of 2xy is Figure 13.6 rotated hy 45°. 


EXAMPLES 5-6 f(x, y,z)=x?7+y? +2? 9 P(T. V)=nRTIV 


Example 5 shows more variables. Example 6 shows that the variables may not be 
named x and y. Also, the function may not be named f! Pressure and temperature 
and volume are P and T and V. The letters change but nothing else: 


6P/eT=nR{V e@P/eV=—nRT/V? (note the derivative of 1/V). 


There is no 0P/8R because R is a constant from chemistry—not a variable. 

Physics produces six variables for a moving body—the coordinates x, y, z and the 
momenta p,, p,, P;. Economics and the social sciences do better than that. If there 
are 26 products there are 26 variables—sometimes 52, to show prices as well as 
amounts. The profit can be a complicated function of these variables. The partial 
derivatives are the marginal profits, as one of the 52 variables is changed. A spreadsheet 
shows the 52 values and the effect of a change. An infinitesimal spreadsheet shows 
the derivative. 


SECOND DERIVATIVE 


Genius is not essential, to move to second derivatives. The only difficulty is that two 
first derivatives f, and f, lead to four second derivatives f,, and f,, and f,, and f,,. 
(Two subscripts: f,, is the x derivative of the x derivative. Other notations are 
6? f/dx? and é?f/dxdy and 4? f/éyéx and 0 f/dy”.) Fortunately f,, equals f,,, as we 
see first by example. 


EXAMPLE7 f=x/y has f,=1/y, which has f,,=0 and f,,= — 1/y?. 


The function x/y is linear in x (which explains f,, = 0). Its y derivative is f, = — x/y?. 
This has the x derivative f,, = —1/y’. The mixed derivatives f., and f,, are equal. 

In the pure y direction, the second derivative is f,, = 2x/y°®. One-variable calculus 
is sufficient for all these derivatives, because only one variable is moving. 


EXAMPLES f{=4x?+3xy+y? has f,=8x+3y and f,=3x+2y. 


Both “‘cross derivatives” f,, and f,, equal 3. The second derivative in the x direction 
is d*f/dx?=8 or f,,=8. Thus “f{xx" is “d second fdx squared.” Similarly 
& f/éy" = 2. The only change is from d to 4. 


If {(x, y) has continuous second derivatives then {,, =f,,. Problem 43 sketches a proof 
based on the Mean Value Theorem. For third derivatives almost any example shows 
that foxy =Sayx =Syxx 18 different from f,,. =Spxy =Sieyy- 


Question How do you plot a space curve x(t), y(t), z(t) in @ plane? One way is to look 
parallel to the direction (1,1, 1). On your XY screen, plot X¥ =(y— x)/,/2 and 
Y = (22 — x — y)/,/6. The line x = y =z goes to the point (0, 0)! 
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How do you graph a surface z= {{x, yy? Use the same X and Y. Fix x and let y 
vary, for curves one way in the surface. Then fix y and vary x, for the other partial 
function, For a parametric surface like x = (2 + v sin $u) cos u, y=(2 + v sin Zu) sin u, 
2 =n 08 4u, vary u and then vp. Dick Williamson showed how this draws a one-sided 


“Mobius strip.” 


43.2 EXERCISES 


Read-through questions 


The _@_ derivative af/éy comes from fixing _b and 
moving _¢ _. It is the limit of _d__. If f=e?* sin y then 
Oidx=_@ and afjay=_¢ __. If f=(x? + y*)'? then f, = 

gq andf,=_h__. At (xo, yo) the partial derivative f, is 
the ordinary derivative of the {| function ffx, y9). Simi- 
larly f, comes from f{__|___). Those functions are cut out by 
vertical planes x =x, and __k __, while the level curves are 
cutout by |! planes. 


The four second derivatives are f,,, __™ n o.. 


For f= xy they are __p__. For f=cos 2x cos 3y they are 
q__. In those examples the derivatives «and __3 
are tbe same, That is always true when the second derivatives 
are _f . At the origin, cos 2x cos Jy is curving ou in 
the x and y directions, while xy goes__v___ in the 45° direc- 


tion and win the —45° direction. 


Find 2f/x and of/dy for the functions in 1-12. 


1 3x—y+x"*y? 2 sin(3x —y)+y 
3 x#y?-x?-e@ 4 xe*+ 

5 (x+y(x—y) 6 Ja? ty? 

7 (tty? 8 In(x + 2y) 


9 In,/x? + y? 10 + 


11 tan~+(y/x) 12 In(xy) 


Compute f.,, fi, =f, and f,, for the functions in 13-20. 


13 x7 4+ 3xy+42y? 14 (x +39)? 
15 (x + iy)? 16 et” 
17 f/x? + y? 18 (x + yy" 
19 cos ax cos by 20 1/(x + iy) 


Find the domain aad range {all inputs aad outputs) for the 
functions 21-26. Then compute J, f, f.,- 


21 1/{x— yP 22 /x* + y?—t? 


23 (y —x)(z —8) 24 In(x + 8) 


25 x! Why does this equal ¢°*? 26 cos x cos” 'y 


27 Verify f,, =f, for f=x"y’. If f,,=0 then f, does not 
depend on and f, is independent of . The 
function must have the form f(x, y) = G(x)+ 


28 In terms of », compute f, and f, for f(x, y) = J” oft} dt. First 
vary x. Then vary y. 


29 Compute af/éx for f= J} v{t}dt. Keep y constant. 
30 What is f(x, y)= |” dt/t and what are f, and f,? 


31 Calculate all eight third derivatives fr... fixp, ++ 
x?y>. How many are different? 


of f= 


In 32-35, choose g{y} so that ftx, y}=e*g{y) satisfies the 
equation. 


32 f,+f,=0 33 f£,=Tf, 
34 fp =Sex 35 fir =4fy 


36 Show that 2~'/2e~*/*" satisfies the Aeat equation f,~f,.. 
This f{x, t) is the temperature at position x and time t due to 
a point source of heat at x =0, ¢ =0. 


37 The equation for heat flow in the xy plane is f, =f... + fy. 
Show that fx, y, {:=e7*' sin x sin y is a solution, What expo- 
nent in f=e sin 2x sin 3p gives a solution? 


38 Find solutions f(x, y) =e sin mx cos ny of the heat 
equation f,=f..+f,,- Show that r7'e7**te-7"* is also a 
solution. 


39 The basic wave equation is f,=/f,,. Verify that fix, t}= 
sin(x + ¢} and f(x, t)=sin(x—t) are solutions. Draw hoth 
graphs at t= 2/4. Which wave moved to the left and which 
moved to the right? 


40 Continuing 39, the peaks of the waves moved a distance 
Ax= in the time step At = 1/4. The wave velocity 
is Ax/At= ; 


41 Which of these satisfy the wave equation f,, =c?f,.? 


sin(x—ct), cos{x+ct}), e~"%, e&—e, & cos ct. 


42 Suppose af/dt = af/éx. Show that 07//at? = 87 f/ax?. 
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43 The proof of /,,=/f,, studies f(x, y} in a small rectangle. 
The top-bottom difference is g{x}=/f{x, B}—f(x, A). The 
difference at the corners 1, 2, 3, 4 is: 


Q@=(f.—-f3]-[h—-fl 
= g(b)— g(a) (definition of g} 
=(b—a)g.(c) (Mean Value Theorem) 


=(b—a)L fac, B)—f,{c, A)] (compute g,) 
={b—a\(B—A}ffc,C) (MVT again) 
(a} The right-left difference is A(y) =/(6, y) —f(a, py). The 
same @ is A{(B)—}{A), Change the steps to reach Q@ = 
{B— A)(b — a}f,.fc*, C*). 
(b} The two forms of @ make f,, at {c, C} equal to f,, at 
(c*, C*), Shrink the rectangle toward (a, A). What assump- 
tion yields f,, =f,, at that typtcal point? 


44 Find éf/éx and éf/dy where they exist, based on equations 
{1) and (2). 


faf=lxyl  (b)f=x?+y? ifx #0, f=0if x=0 


Questions 45-52 are about limits in two dimensions. 


45 Complete these four correct definitions of limit: 1 The 
points {x,, y,) approach the point (a, 5) if x, converges to a 
and . 2 For any circle around {a, 5), the points (x,, ¥,) 
eventually go the circle and stay . 3 The 
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distance from (x,, y,} to {a, 5) is and it approaches 
. 4For any ¢ > O there is an N such that the distance 
<e for all n> 


46 Find (x3, y2} and (x, y,) and the limit (a, 5) if it exists. 
Start from (x9, Yo) = {L, 0). 

(a) (X2, Yn} = 1/(n + 1), afl + 1)) 

(b} (x, Yn) = (x,- 1s Va- 1) 

(c} (x, Ya) =(),- 1: X,-1) 

(d} (x,, Ya) ={(Xq-1 ny Ya-19 Xa-1 — ¥n-1) 
47 (Limit of f{x, y)} 1 Informal definition: the numbers 
S(%, ¥,) approach L when the points (x,, y,) approach (a, 5). 
2 Epsilon-delta definition: For each ¢ > 0 there is a 6 >0 such 
that | f(x, ») — L] is less than when the distance from 
(x, ¥) to (a, 5) is . The value of f at {a, 5) is not 
involved. 


48 Write down the limit L as (x, y) — (a, b}. At which points 
(a, b) does f(x, y} have no limit? 

(a) fix, y}= f/x? + y? (b) flx, y) = x/y 

(c) fly y) = 1x + y) (d).flx, ») = xpfx? + y’) 
In (d) find the limit at (0, 0) along the line y = mtx. The limit 
changes with m, so L does not exist at (0, 0). Same for x/y. 
49 Definition of continuity: f(x, y) is continuous at (a, b) if 
J (a, b) is defined and f(x, y) approaches the limit as 
(x, y) approaches (a, b). Construct a function that is not con- 
tinuous at (1, 2). 
50 Show that x7y/(x*+y?)-»0 along every straight line 
y = mx to the origin. But traveling down the parabola y = x’, 
the ratio equals 
51 Can you define f(0, 0}so that f(x, p)is continuous at (0, 0)? 

fajf=ild+iy—t (b)fHU+xP ()f=x'*”, 
52 Which functions approach zero as (x, y} -> (0, 0} and why? 

xy? x?y? xy 
(a) 4p 


Over a short range, a smooth curve y = f(x) is almost straight. The curve changes 
direction, but the tangent line y — yg = f'(xq)(x — Xq) keeps the same slope forever. 
The tangent line immediately gives the linear apptoximation to y=/f(x): 


¥® Yo + f'(Xo)(X — Xo)- 


What happens with two variabies? The function is z=f(x, y), and its graph is a 
surface. We are at a point on that surface, and we are near-sighted. We don’t see far 
away. The surface may curve out of sight at the horizon, or it may be a bowl or a 
saddle. To our myopic vision, the surface looks flat. We believe we are on a plane 
(not necessarily horizontal), and we want the equation of this tangent plane. 


13.3 Tangent Planes and Linear Approximations 


Notation The basepoint has coordinates x9 and yo. The height on the surface is 
29 =f (Xo, Yo). Other letters are possible: the point can be (a, b) with height w. The 
subscript 9 indicates the value of x or y or z or Of/0x or Of/dy at the point. 


With one variable the tangent line has slope df/dx. With two variables there 
are two derivatives df/Ox and éf/éy. At the particular point, they are (€f/0x), and 
(Of/Cy)o. Those are the slopes of the tangent plane. Its equation is the key to this 
chapter: 


13A The tangent plane at (xo, yo, 29) has the same slopes as the surface z = 
J (x, y). The equation of the tangent plane (a linear equation) is 


0 of 
Z-Zo= (2) —Xo)'+ (Z) 0 — Yo). (1) 


The normal vector N to that plane has components (@f/@x)o, (@f/@y)o, —1. 


EXAMPLE 1 Find the tangent plane to z= 14— x? — y? at (xo, Vo. Zo) = (I, 2, 9). 


Solution The derivatives are 0f/6x=—2x and of/éy= —2y. When x=1 and y=2 
those are (0f/0x)y = — 2 and (@f/dy)) = —4. The equation of the tangent plane is 


2—9= —Ax—-—1)—4y—2) or z+2x+4y=19. (2) 


This z(x, y) has derivatives —2 and —4, just like the surface. So the plane is tangent. 

The normal vector N has components —2, —4, —1. The equation of the normal 
line is (x, y, z) = (1, 2, 9) + (—2, —4, —1). Starting from (1, 2, 9) the line goes out along 
N—perpendicular to the plane and the surface. 


x 


Fig. 13.7 The tangent plane contains the x and y tangent lines, perpendicular to N. 


Figure 13.7 shows more detail about the tangent plane. The dotted lines are the x 
and y tangent lines. They lie in the plane. All tangent lines lie in the tangent plane! 
These particular lines are tangent to the “partial functions”—where y is fixed at yp = 
2 or x is fixed at x9 = 1. The plane is balancing on the surface and touching at the 
tangent point. 

More is true. In the surface, every curve through the point is tangent to the plane. 
Geometrically, the curve goes up to the point and “kisses” the plane.t The tangent 
T to the curve and the normal N to the surface are perpendicular: T+ N = 0. 


TA safer word is “osculate.”” At saddle points the plane is kissed from both sides. 
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EXAMPLE 2 Find the tangent plane to the sphere z* = 14 — x? — y? at (1, 2, 3). 


Solution Instead of z= 14— x? — y? we have z=./14—x?— y?. At x9 =1, yo =2 
the height is now Zp = 3. The surface is a sphere with radius ./14. The only trouble 
from the square root is its derivatives: 


«@ Ta a aaa oz___—-2y) (3) 


éx  éx 14—x?—y? éy /14-x?-y? 
At (1, 2) those slopes are — 4 and — 4. The equation of the tangent plane is linear: 
z—3=-—4(x—-1)—3(y—2). I cannot resist improving the equation, by multiplying 
through by 3 and moving all terms to the left side: 


tangent plane to sphere: 1(x — 1) + 2(y — 2) + 3(z — 3)=0. (4) 


If mathematics is the “science of patterns,” equation (4) is a prime candidate for study. 
The numbers 1, 2,3 appear twice. The coordinates are (Xo, yo, Zo) =(1, 2, 3). The 
normal vector is li + 2j + 3k. The tangent equation is 1x + 2y + 3z = 14. None of this 
can be an accident, but the square root of 14— x? — y? made a simple pattern look 
complicated. 

This square root is not necessary. Calculus offers a direct way to find dz/dx— 
implicit differentiation. Just differentiate every term as it stands: 


x? + y?+27=14 leadsto 2x+2z 6z/éx=0 and 2y+2zé2/éy=0. (5) 
Canceling the 2’s, the derivatives on a sphere are — x/z and — y/z. Those are the same 
as in (3). The equation for the tangent plane has an extremely symmetric form: 


x 
2—29= — 2(x-xo)— 2(y— Yo) OF XolX— Xo) + Yo(V— Yo) + 202-20) =0. 6) 
0 tt) 


Reading off N = xoi+ yoj + Zok from the last equation, calculus proves something 
we already knew: The normal vector to a sphere points outward along the radius. 


= Xi + Yod — Zgk N = Xi + Yod - Zok 


Fig. 13.8 Tangent plane and normal N for a sphere. Hyperboloids of 1 and 2 sheets. 


THE TANGENT PLANE TO Ax, y, 2 =c 


The sphere suggests a question that is important for other surfaces. Suppose the 
equation is F(x, y, z)=c instead of z = f(x, y). Can the partial derivatives and tangent 
plane be found directly from F? 

The answer is yes. It is not necessary to solve first for z. The derivatives of F, 


° ° > 
x*+y*-2-=]1 xX“ +y*-z 
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computed at (X9, Yo, Zo), give a second formula for the tangent plane and normal 
vector. 


43B The tangent plane to the surface F(x, y, z)=c has the linear equation 


(F) xa) (F )o- ve)+(F) @-20= 0. (7) 


The normal vector is N = ge i+ of j+ Ue k. 
Ox Jo Oy )o 62 Jo 


Notice how this includes the original case z=/f(x, y). The function F becomes 
fix, y)— z. Its partial derivatives are 0f/6x and éf/dy and —1. (The —1 is from the 
derivative of — z.) Then equation (7) is the same as our original tangent equation (1). 


EXAMPLE 3 The surface F = x? + y* — z* =c is a Ayperboloid. Find its tangent plane. 
Solution The partial derivatives are F, = 2x, F, = 2y, F, = — 2z. Equation (7) is 
tangent plane: 2x0(X — Xo) + 2yol(¥ — Yo) — 2Z0(z — Zo) = 0. (8) 


We can cancel the 2’s. The normal vector is N = xgi+ yoj — zok. For c>0 this 
hyperboloid has one sheet (Figure 13.8). For c=0 it is a cone and for c < 0 it breaks 
into two sheets (Problem 13.1.26). 


DIFFERENTIALS 


Come back to the linear equation z — zy = (62/0x)o(x — Xo) + (62/Ay)o(y — Yo) for the 
tangent plane. That may be the most important formula in this chapter. Move along 
the tangent plane instead of the curved surface. Movements in the plane are dx and 
dy and dz—while Ax and Ay and Az are movements in the surface. The d’s are 
governed by the tangent equation— the A’s are governed by z = f(x, y). In Chapter 2 
the d’s were differentials along the tangent line: 


dy =(dy/dx)dx (straight line) and Ayx(dy/dx)Ax (on the curve). (9) 


Now y is independent like x. The dependent variable is z. The idea is the same. The 
distances x — Xp and y— yo and z — Zp (on the tangent plane) are dx and dy and dz. 
The equation of the plane is 


z = (0z/@x)gdx + (0z/Oy)ody or df=f,dx +f,dy. (10) 


This is the total differential. All letters dz and df and dw can be used, but éz and of 
are not used. Differentials suggest small movements in x and y; then dz is the resulting 
movement in z. On the tangent plane, equation (10) holds exactly. 

A “centering transform” has put Xo, Yo, Zo at the center of coordinates. Then the 
“zoom transform” stretches the surface into its tangent plane. 


EXAMPLE 4 The area of a triangle is A =4ab sin @. Find the total differential dA. 


Solution The base has length b and the sloping side has length a. The angle between 
them is 6. You may prefer A = $bh, where h is the perpendicular height a sin @. Either 
way we need the partial derivatives. If A = 3ab sin 0, then 


A = 5bsino oS aid os 2 
Ja 


2 db 2 ap paren’. Gh 
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These lead immediately to the total differential dA (like a product rule): 


aA GA 0A bo. _ \ 
=f — ES + a sist = i I 
dA ($5 a A ($3 )ao ($5 ao 56 sin 0 da+ 54 sin 6 db+ sab cos 6 dé. 


EXAMPLE 5 The volume of a cylinder is V = mr*h. Decide whether V is more sensitive 
to a change from r= 1.0 to r= 1.1 or from h= 1.0 to h= 1.1. 


Solution The partial derivatives are @V/ér = 2nrh and GV/@h=nr?. They measure 
the sensitivity to change. Physically, they are the side area and base area of the 
cylinder. The volume differential dV comes from a shell around the side plus a layer 
on top: 

dV = shell + layer = 2arh dr + ar7dh. (12) 


Starting from r=h=1, that differential is dV=2ndr + ndh. With dr = dh=.1, the 
shell volume is .2x and the layer volume is only .1z. So V is sensitive to dr. 

For a short cylinder like a penny, the layer has greater volume. V is more sensitive 
to dh. In our case V = ar7h increases from z(1)° to 2(1.1)°. Compare AV to dV: 


AV=nx(1.1)? — n(1)° = .3312 and dV = 2n(.1) + n(.1) = .300z. 


The difference is AV—dV=.031n. The shell and layer missed a small volume in 
Figure 13.9, just above the shell and around the layer. The mistake is of order 
(dr)? + (dh)?. For V= nr7h, the differential dV = 2arh dr + nr?dh is a linear approxima- 
tion to the true change AV. We now explain that properly. 


LINEAR APPROXIMATION 


Tangents lead immediately to linear approximations. That is true of tangent planes as 
it was of tangent lines. The plane stays close to the surface, as the line stayed close 
to the curve. Linear functions are simpler than f(x) or f(x, y) or F(x, y, z). All we 
need are first derivatives at the point. Then the approximation is good near the point. 

This key idea of calculus is already present in differentials. On the plane, df equals 
f,dx + f,dy. On the curved surface that is a linear approximation to Af: 


43C The linear approximation to f(x, y) near the point (xo, yo) is 


F Fy 
f(x, y) = f(X0, Yo) + (2) — Xo) + (2) 0 — Yo). 


In other words Af~f,Ax +f,Ay, as proved in Problem 24. The right side of (13) 
is a linear function f,(x, y). At (Xo, Vo), the functions f and f, have the same slopes. 
Then f(x, y) curves away from f, with an error of “second order:” 


fix, y)—filx, y)| < M(x — Xo)? + (y — Yo)?]. (14) 


This assumes that f,,, f.y, and f,, are continuous and bounded by M along the line 
from (Xo, Vo) to (x, y). Example 3 of Section 13.5 shows that | f,,] < 2M along that line. 
A factor + comes from equation 3.8.12, for the error f—f;, with one variable. 

For the volume of a cylinder, r and h went from 1.0 to 1.1. The second derivatives 
of V = nr7h are V,, = 27h and V,, = 2zr and V,,,=0. They are below M = 2.22. Then 
(14) gives the error bound 2.2z(.1? + .1*) = .044z, not far above the actual error .0317z. 
The main point is that the error in linear approximation comes from the quadratic 
terms—those are the first terms to be ignored by f,. 
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P 
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AY 
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q 
PRprerReN Eee 20 40 60 

Fig. 43.9 Shell plus layer gives dV = .300z. Fig. 43.40 Quantity Q and price P move with the lines. 


Including top ring gtves AV = .331n. 


EXAMPLE 6 Find a-linear approximation to the distance function r = ,/x? + y?. 
Solution The partial derivatives are x/r and y/r. Then Ar =(x/rJAx + (p/r)Ay. 


For (x, y, r) near (1, 2, /5): x? + y? & /1? #2? + (x — A/S + Ay — D//S. 


If y is fixed at 2, this is a one-variable approximation to ./x? + 2?. If x is fixed at 1, 
it is a linear approximation in y. Moving both variables, you might think dr would 
involve dx and dy in a square root, It doesn’t. Distance involves x and y in a square 
root, but change of distance is lincar in Ax and Ay—to a first approximation. 

There is a rough point at x=0, y=0. Any movement from (0, 0) gives Ar = 
Jax + (Ay)?. The square root has returned. The reason is that the partial deriva- 
tives x/r and y/r are not continuous at (0,0). The cone has a sharp point with no 
tangent plane. Linear approximation breaks down. 


The next example shows how to approximate Az from Ax and Ay, when the 
equation is F(x, y,z)=¢. We use the implicit derivatives in (7) instead of the explicit 
derivatives in (1). The idea is the same: Look at the tangent equation as a way to 
find Az, instead of an equation for z. Here is Example 6 with new letters. 


EXAMPLE? From F = — x*— y? +2? =0 find a linear approximation to Az. 
Solution (implicit derivatives) Use the derivatives of F: —2xAx — 2yAy + 2zAz x 0. 
Then solve for Az, which gives Az = (x/z)Ax + (y/z)/Ay—the same as Example 6. 
EXAMPLE 8 How does the equilibrium price change when the supply curve changes? 


The equilibrium price is at the intersection of the supply and demand curves 
(supply = demand). As the price p rises, the demand q drops (the slope is — .2): 


demand line DD: p= — .2g + 40. (15) 
The supply (also q} goes up with the price. The slope s is positive (here s = .4): 
supply line SS: p=sq+¢t= .4q + 10. 


Those lines are in Figure 13.10. They meet at the equilibrium price P = $30. The 
quantity Q = 50 is available at P (on SS) and demanded at P(on DD). So it is sold. 

Where do partial derivatives come in? The reality is that those lines DD and SS 
are not fixed for all time. Technology changes, and competition changes, and the 
value of inoney changes. Therefore the lines move. Therefore the crossing point (Q, P) 
also moves. Please recognize that derivatives are hiding in those sentences, 
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Main point: The equilibrium price P is a function of s and t. Reducing s by better 
technology lowers the supply line to p= .3q + 10. The demand line has not changed. 
The customer is as eager or stingy as ever. But the price P and quantity Q are 
different. The new equilibrium is at @ = 60 and P = $28, where the new line XX 
crosses DD. 

If the technology is expensive, the supplier will raise t when reducing s. Line YY 
is p= .3g + 20. That gives a higher equilibrium P = $32 at a lower quantity Q = 40— 
the demand was too weak for the technology. 


Calculus question Find éP/cs and ¢P/ét. The difficulty is that P is not given as 
a function of s and ¢. So take implicit derivatives of the supply = demand equations: 


supply =demand: P= — .20+40=sQ+t (16) 
s derivative: P,= —.20,=50,+@Q (note t,=0) 
t derivative: P,= ~ .20,=sQ,+1 {note t,= 1) 


Now substitute s = .4, t= 10, P = 30, @ = 50. That is the starting point, around which 
we are finding a linear approximation. The last two equations give P, = 50/3 and 
P, = 1/3 (Problem 25), The linear approximation is 


P = 30+ 50(s — .4)/3 + (t — 10)/3. (17) 


Comment This example turned out to be subtle (so is economics). I hesitated before 
including it. The equations are linear and their derivatives are easy, but something 
in the problem is hard—there is no explicit formula for P. The function P(s, t) is not 
known. Instead of a point on a surface, we are following the intersection of two lines. 
The solution changes as the equation changes. The derivative of the solution comes from 
the derivative of the equation. 


Summary The foundation of this section is equation (1) for the tangent plane. Every- 
thing builds on that—total differential, linear approximation, sensitivity to small 
change. Later sections go on to the chain rule and “directional derivatives” and 
“gradients.” The central idea of differential calculus is Af = f,Ax + f,Ay. 


NEWTON'S METHOD FOR TWO EQUATIONS 


Linear approximation is used to solve equations. To find out where a function is zero, 
look first to see where its approximation is zero. To find out where a graph crosses 
the xy plane, look to see where its tangent plane crosses. 

Remember Newton’s method for f(x) =0. The current guess is x,. Around that 
point, f(x) is close to f(x,) +(x — x,)f"(x,). This is zero at the next guess x,., = 
x, ~fix,/f'(x,). That is where the tangent line crosses the x axis. 


With two variables the idea is the same— but two unknowns x and y require two 
equations. We solve g(x, y)=0 and A(x, y)=0. Both functions have linear approxi- 
mations that start from the current point (x,, y,)—where derivatives are computed: 


&(X, Y) B(Xn> Va) + (CRIEx)(x — X,) + (Eg/EYY — Yn) 


18 
h(x, y) © A(X_s Yn) + (Ch/x){x — x,) + (Ch/Ey)y ~ Yn). os 


The natural idea is to set these approximations to zero. That gives linear equations 
for x — x, and y— y,. Those are the steps Ax and Ay that take us to the next guess 


13.3 Tangent Planes and Linear Approximations 


in Newton’s method: 


43D Newton’s method to solve g(x, y) = 0 and h(x, y) = 0 has linear equations 
for the steps Ax and Ay that go from (x,, y,) to (X,41, Yn+1): 


og 5 oh OWN es 
(2)ax+(*)ar=—atsnr9 and (Hace (B)ay= en sed 09 


EXAMPLE9 g=x°— y=Oandh= y?— x=O have 3 solutions (1, 1), (0, 0),(—1, —1). 


I will start at different points (xo, yo). The next guess is x; = X9 + Ax, y; = Yo + Ay. 
It is of extreme interest to know which solution Newton’s method will choose—if it 
converges at all. I made three small experiments. 


1. Suppose (xo, yo) = (2, 1). At that point g = 2? - 1=7 and h= 13-—2=~—1. The 
derivatives are g, = 3x? = 12, g,= — 1, h, = — 1, h, = 3y? = 3. The steps Ax and Ay 
come from solving (19): 


12Ax— Ay=-7 Ax = - 4/7 X; =X9 t+ Ax= 10/7 
= => 
—Ax + 3Ay= +1 Ay=+ 1/7 Y; = Yo t+ Ay= 8/7. 


This new point (10/7, 8/7) is closer to the solution at (1, 1). The next point is (1.1, 
1.05) and convergence is clear. Soon convergence is fast. 


2. Start at (x9, yo) = (4, 0). There we find g= 1/8 and h= — 1/2: 
(3/4)Ax -— Ay=-— 1/8 Ax = — 1/2 X; =X9 + Ax=0 
> => 
—Ax+ OAy= + 1/2 Ay=+ 1/4 V1 = Yo t Ay= — 1/4. 


Newton has jumped from (3, 0) on the x axis to (0, — 4) on the y axis. The next step 
goes to (1/32, 0), back on the x axis. We are in the “basin of attraction”’ of (0, 0). 


3. Now start further out the axis at (1, 0), where g= 1 and h= — 1: 


3Ax- Ay=-1 Ax=-1 X;=X9 t+ Ax=0 
=> 


=> 
—Ax + O0Ay= +1 Ay=-2 Y= yot Ay= —2. 


Newton moves from (1,0) to (0, —2) to (16,0). Convergence breaks down—the 
method blows up. This danger is ever-present, when we start far from a solution. 


Please recognize that even a small computer will uncover amazing patterns. It can 
start from hundreds of points (x9, yo), and follow Newton’s method. Each solution 
has a basin of attraction, containing all (xo, yo) leading to that solution. There is also 
a basin leading to infinity. The basins in Figure 13.11 are completely mixed together— 
a color figure shows them as fractals. The most extreme behavior is on the borderline 
between basins, when Newton can’t decide which way to go. Frequently we see chaos. 

Chaos is irregular movement that follows a definite rule. Newton’s method deter- 
mines an iteration from each point (x,, y,) to the next. In scientific problems it 
normally converges to the solution we want. (We start close enough.) But the com- 
puter makes it posible to study iterations from faraway points. This has created a 
new part of mathematics—so new that any experiments you do are likely to be 
original. 
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Section 3.7 found chaos when trying to solve x? + 1 = 0. But don’t think Newton’s 
method is a failure. On the contrary, it is the best method to solve nonlinear equations. 
The error is squared as the algorithm converges, because linear approximations have 
errors of order (Ax)? + (Ay)?. Each step doubles the number of correct digits, near 
the solution. The example shows why it is important to be near. 


Fig. 13.44 The basins of attraction to (1, 1), (0, 0), (—1, —1), and infinity. 


43.3 EXERCISES 


Read-through questions 
The tangent line to y=/(x) is y—yo =__9 _. The tangent 


plane to w=/f(x, y) is w— Wo = __®__. The normal vector is 
N=_¢__. For w=x*+y* the tangent equation at (1, 1, 2) 
is__d__. The normal vector is N=__@ _. Fora sphere, the 


direction of Nis __f 


The surface given implicitly by F(x, y,z)=c has tangent 
equation (0F/0x)o(x —X9) + __9__. For xyz=6 at (1, 2, 3) 
the tangent plane is _h__. On that plane the differentials 
satisfy | = dx +__J__dy+__k__dz=0. The differential 


of z=f{x, y) isdz=__!__. This holds exactly on the tangent 
plane, while Az~__™__ holds approximately on the __" 


The height z = 3x + 7y is more sensitive to a change in__° 
than in x, because the partial derivative __P ___ is larger than 
q 


The linear approximation to f(x, y) is f(X9, Yo) +__! 
This is the same as Af~ __s__ Ax +__t___ Ay. The error is 
of order _u_. For f=sinxy the linear approximation 
around (0, 0) is f, =__v__. We are moving along the __W 
instead of the __x . When the equation is given as 
F(x, y,z)=c, the linear approximation is y  Ax+ 


z Ay+_A_Az=0. 


Newton’s method solves g(x, y)=0 and h(x, y)=0 by a 
8 approximation. Starting from x,, y, the equations are 
replaced by © and_© _. The steps Ax and Ay go to the 


next point __— ___. Each solution has a basin of _ F _. Those 


basins are likely to be_G_. 


In 1-8 find the tangent plane and the normal vector at P. 
1 z=./x*+y', P=(0, 1,1) 
2x+y+z=17, P=(3, 4, 10) 
3'z2:=x/y, P=(6,-3; 2) 

4 z=e**, P=(0, 0, 1) 

5 x?+y?+27=6, P=(I, 2, 1) 
6 x? + y?+227=7, P=(I, 2, 1) 
7 z= x’, P=(1, 1, 1) 

8 V=nr7h, P =(2, 2, 87). 


9 Show that the tangent plane to z*>—x?—y?=0 goes 
through the origin and makes a 45° angle with the z axis. 


10 The planes z=x+4y and z=2x+3y meet at (1, 1, 5). 
The whole line of intersection is (x, y,z)=(1, 1, 5)+ ve. 
Find yv=N, x N3. 


11 If z=3x—2y find dz from dx and dy. If z= x°/y? find dz 
from dx and dy at x» =1, yo =1. If x moves to 1.02 and y 
moves to 1.03, find the approximate dz and exact Az for both 
functions. The first surface is the to the second 
surface. 
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12 The surfaces z = x? + 4y and z = 2x + 3y? meet at (1, 1, 5). 
Find the normals N, and N, and also y=N, x No. The line 
in this direction ¥ is tangent to what curve? 


13 The normal N to the surface F(x, y, z) = 0 has components 
F,, Fy, F,. The normal line has x=xp+Fyt, y=yot Fit 
z= . For the surface xyz — 24 =0, find the tangent 
plane and norma! line at (4, 2, 3). 


14 For the surface x?y? — 2 =0, the normal line at (f, 2, 4) 
has x = y= Ses 


15 For the sphere x? + }? +2? =9, find the equation of the 
tangent plane through (2, 1, 2). Also find the equation of the 
normal line and show that it goes through (0, 0, 0). 


16 If the normat line at every potnt on F(x, ,z)=0 goes 
through (0, 0, 0), show that F, = cx, Fy =cy, F, =cz. The sur- 
face must be a sphere. 


17 For w = xy near (xp, yg), the linear approximation Is dw = 
. This looks like the rule for derivatives. 

The difference between Aw = xy ~ xp¥q and this approxima- 

tion is 

18 If f= xyz (3 independent variables) what is df? 


19 You invest P = $4000 at R=8% to make / = $320 per 
year. If the numbers change by dP and dR what is di? If the 
rate drops by dR = .002 (to 7.8%) what change dP keeps di = 
0? Find the exact interest J after those changes in R and P. 


26 Resistances R, and R, have parallel resistance R, where 
1/R = 1/R, + 1/Rz. Is R more sensitive to AR, or AR, if R; = 
{ and R, = 2? 


21 (a) If your batting average is 4 = (25 hits)/(100 at bats} = 

.250, compute the increase (to 26/101} with a hit and the 
decrease (to 25/101} with an out. 
(b) If A =x/ythendA = dx+ dy. A hit 
{dx = dy = 1) gives dA =(1 — A)/y. An out (dy = 1) gives 
dA=—A/sy, So at A =.250 a hit has times the 
effect of an out. 


22 (a) 2 hits and 3 outs (dx = 2, dy = 5) will raise your average 
(dA > 0) provided A is less than 
(b) A player batting A =.500 with p= 400 at bats needs 
dx= hits to raise his average to ,505. 


23 If x and y change by Ax and Ay, find the approximate 
change A@ in the angle @ == tan~ '{y/x}. 


24 The Fundamental Lemma behind equation (13) writes 
Af=aAx+bAy. The Lemma says that a>f.(xo, Yo) and 
b > f,(Xo, ¥o) when Ax +0 and Ay + 0. The proof takes Ax 
first and then Ay: 


(1) f(xo + AX, yo) —flxo. ¥o) = Axf(c, Yo) where cis 


between and {by which theorem?} 
(2) fixg + Ax, yo + Ay) —flxo + Ax, ii aes C) 
where C is between and 


(3)a =fAc, Yo) + fxlxo, ¥o) provided fy is. 
(4) b =flxo + Ax, C) +f(xo, ¥o) provided f,is 


25 If the supplier reduces s, Figure 13.10 shows that P 
decreases and @ 
(a) Find P, = 50/3 and P, = 1/3 tn the economics equation 
(17) by solving the equations above it for @, and Q,, 
(b} What is the linear approximation to @ around s= 4, 
1=10, P= 30, Q = 50? 


26 Solve the equations P = —.29 +40 and P=sQ +t for P 
and Q. Then find éP/és and é@P/ct explicitly. At the same 
s, t, P,Q check 50/3 and 1/3. 


27 If the supply =demand equation (16) changes to P= 
sQ4+£=—0+50, find P, and P, ats=1,¢1=10. 


28 To find out how the roots of x? + bx +c =0 vary with 6, 
take partial derivatives of the equation with respect to 

. Compare dx/éb with @x/@c to show that a root at 
x =2 Is more sensitive to 6. 


29 Find the tangent planes to z = xy and 2=x? — y? at x= 
2, y= 1. Find the Newton point where those planes meet the 
xy plane (set z= 0 in the tangent equations). 


30 (a) To solve g(x, y) = 0 and A(x, y) = 01s to find the meeting 


point of three surfaces: z= g(x, y) and z=A(x, y) and 


(b} Newton finds the meeting point of three planes: the 
tangent plane to the graph of g, , and 


Problems 31-36 go further with Newton’s method for g = 
x? — y and & = yp? — x. This is Example 9 with solutions (J, 1), 
(0, 0),(-1, —1). 


31 Start from xo = 1, yo =1 and find Ax and Ay. Where are 
x, and y,, and what line is Newton's method moving on? 


32 Start from (4, $) and find the next point. This is in the 
basin of attraction of which solution? 


33 Starting from (2, —a) find Ay which is also — Ax. Newton 
goes toward (0,0). But can you find the sharp point in 
Figure 13.11 where the lemon meets the spadc? 


34 Starting from (a, 0) show that Newton’s method goes to 
(0, —2a*) and find the next point (x2, y2). Which numbers a 
lead to convergence? Which special] number a leads to a cycle, 
in which (x, yz) is the same as the starting point (a, 0)? 


35 Show that x? =, y? =x has exactly three solutions. 
36 Locate a point from which Newton’s method diverges. 


37 Apply Newton's method to a linear problem: g= 
x+2y—5=0,h =3x—3=0. From any starting point show 
that (x,, ¥,} 1s the exact solution (convergence in one step). 
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38 The complex equation (x + iy)* = 1 contains two real equ- 
ations, x*— 3xy?= 1 from the real part and 3x7y—y°>=0 
from the imaginary part. Search by computer for the basins 
of attraction of the three solutions (1, 0}, (—1/2, /3/2), and 
(-1/2, —./3/2)—which give the cube roots of 1. 


39 In Newton’s method the new guess comes from (x,, y;,) by 
an iteration: x,,, = G(x,, y,} and y,41 = A{x,, y,). What are 
G and H for g=x*-—y=0,h=x—y=( First find Ax and 
Ay; then x, + Ax gives G and y, + Ay gives H. 


40 In Problem 39 find the basins of attraction of the solution 
(0, 6) and (1, 1). 
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44 The matrix in Newton’s method is the Jacobian: 


ég/éx égiéy Ax —&, 
J= = : 
ee Shey: ee sl eke 
Find J and Ax and A} for g=e*—1, h=e*4 x. 


42 Find the Jacobian matrix at (1, 1} when ¢ =x? +»? and 
h=xy. This matrix is and Newton’s method fails. 
The graphs of g and h have tangent planes. 


43 Solve g=x?— y?+1=0 and h=2xy=0 by Newton's 
method from three starting points: (0, 2) and (—1, 1) and (2, 0). 
Take ten steps by computer or one by hand. The solution 
(0, L) attracts when yo > 0. If yp =O you should find the chaos 
iteration x,.,; =3(x, —x, |). 


As x changes, we know how f(x, y) changes. The partial derivative éf/@x treats y as 
constant. Similarly éf/éy keeps x constant, and gives the slope in the y direction. But 
east-west and north-south are not the only directions to move. We could go along a 
45° line, where Ax = Ay. In principle, before we draw axes, no direction is preferred. 
The graph is a surface with slopes in af! directions. 

On that surface, calculus looks for the rate of change (or the slope). There is a 
directional derivative, whatever the direction. In the 45° case we are inclined to divide 


Af by Ax, but we would be wrong. 


Let me state the problem. We are given f(x, y) around a point P = (xo, ¥o). We are 
also given a direction u (a unit vector), There must be a natural definition of D, f— 
the derivative of { in the direction u. To compute this slope at P, we need a formula. 
Preferably the formula is based on éf/éx and éf/@y, which we already know. 

Note that the 45° direction has u = if /2 + ji, /2. The square root of 2 is going to 
enter the derivative. This shows that dividing Af by Ax is wrong. We should divide 


by the step length As. 


EXAMPLE 4 
direction from (1, 1), what 1s Az/As? 


Stay on the surface z= xy. When (x, ) moves a distance As in the 45° 


Solution The step is As times the unit vector u. Starting from x= y= Ff the step 
ends atx=y=1+ As!,/2. (The components of uAs are As}, /2.) Then z = xy is 


2=(1 + As/./2)? =1+ ./2As + HAs)?, which means Az = \/2As + HAs). 
The ratio Az/As approaches sa as As > 0. That is the slope in the 45° direction. 


DEFINITION The derivative of { in the direction u at the point P is D, f(P): 


Af 
D ALP) = dim. ~ = lim 


AsO As 


S 


f(P+ baal boar Ua} (1) 


The step from P = (xg, yo} has length As. It takes us to (xg + w,As, yo + u2As). We 
compute the change Af and divide by As, But formula (2) below saves time. 
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The x direction is u=(1, 0). Then uAs is (As, 0) and we recover 0f/0x: 
Af _ fixo + As, Yo) ~f(Xo Yo) of 
As As 6x" 


Similarly D, f= éf/éy, when u=(0, 1) is in the y direction. What is D,f when u= 
(0, —1)? That is the negative y direction, so D, f= — of/dy. 


approaches Di, 0, f= 


CALCULATING THE DIRECTIONAL DERIVATIVE 


D,f is the slope of the surface z = f(x, y) in the direction u. How do you compute it? 
From of/éx and of/éy, in two special directions, there is a quick way to find D,f in 
all directions. Remember that u is a unit vector. 


43E The directional derivative D,f in the direction u = (u,, u2) equals 


The reasoning goes back to the linear approximation of Af: 


of of of of 
Afm = Ax ayy F u, As ae 


Divide by As and let As approach zero. Formula (2) is the limit of Af/As, as the 
approximation becomes exact. A more careful argument guarantees this limit pro- 
vided f, and f, are continuous at the basepoint (x9, yo). 

Main point: Slopes in all directions are known from slopes in two directions. 


EXAMPLE 4 (repeated) f= xy and P=(1, 1) and u=(1/,/2, 1/,/2). Find D,f(P). 
The derivatives f, = y and f, = x equal 1 at P. The 45° derivative is 


Dy fUP) = fers + fra = 1(1/./2) + 1(1/,/2) = ,/2 as before. 


EXAMPLE 2 The linear function f= 3x + y+ 1 has slope D, f= 3u, + u2. 


The x direction is u=(1, 0), and D, f= 3. That is éf/dx. In the y direction D, f= 1. 
Two other directions are special—along the level lines of f(x, y) and perpendicular: 


Level direction: D,f is zero because f is constant 
Steepest direction: D,f is as large as possible (with u? + u3 = 1). 
To find those directions, look at D, f= 3u, + uz. The level direction has 3u, + u,=0. 
Then u is proportional to (1, — 3). Changing x by 1 and y by —3 produces no change 
in f=3x+yt1. 

In the steepest direction u is proportional to (3, 1). Note the partial derivatives 


f,=3 and f,=1. The dot product of (3, 1) and (1, —3) is zero—steepest direction 
is perpendicular to level direction. To make (3, 1) a unit vector, divide by ./10. 


Steepest climb: Dy f = 3(3/,/10) + 1(1/,/10) = 10/,/10 = ,/10 
Steepest descent: Reverse to u=(— 3/,/10, =I /,/10) and D,f=—.,/10. 


The contour lines around a mountain follow D, f= 0. The creeks are perpendicular. 
On a plane like f=3x+y+1, those directions stay the same at all points 
(Figure 13.12). On a mountain the steepest direction changes as the slopes change. 
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gradient direction 
u = (3/V10, 1/10) 
D, f = N10 


steepest descent 
D,f = -N10 


level direction 
3u,;+u,=0 


Slope V10 


Fig. 13.42 Steepest direction is along the gradient. Level direction is perpendicular. 


THE GRADIENT VECTOR 


Look again at f,u, +f,u2, which is the directional derivative D,f. This is the dot 
product of two vectors. One vector is u=(u,, U2), which sets the direction. The other 
vector is (f,,f,), which comes from the function. This second vector is the gradient. 


A A 
DEFINITION The gradient of f(x, y) is the vector whose components are a and gf : 
j oy 


grad f= Vf= x i+ wl (aaa g k in three dimensions) 
ox dy 02 
The space-saving symbol V is read as “grad.” In Chapter 15 it becomes “del.” 

For the linear function 3x + y + 1, the gradient is the constant vector (3, 1). It is 
the way to climb the plane. For the nonlinear function x? + xy, the gradient is the 
non-constant vector (2x + y, x). Notice that grad f shares the two derivatives in N = 
(f.,fy, —1). But the gradient is not the normal vector. N is in three dimensions, 
pointing away from the surface z = f(x, y). The gradient vector is in the xy plane! The 
gradient tells which way on the surface is up, but it does that from down in the base. 

The level curve is also in the xy plane, perpendicular to the gradient. The contour 
map is a projection on the base plane of what the hiker sees on the mountain. The 
vector grad f tells the direction of climb, and its length |grad f| gives the steepness. 


43F The directional derivative is D,f=(grad f)-u. The level direction is per- 
pendicular to grad f, since D, f= 0. The slope D,f is largest when u is parallel to 


grad f. That maximum slope is the length |grad f| = _/f2 +f me 


r u= grad f the slope is (gradf)-u 


é _ grad f |? _ 
\grad f| 


\grad f| 


The example f= 3x+ y+ 1 had grad f= (3, 1). Its steepest slope was in the direc- 
tion u= (3, 1)/,/10. The maximum slope was ,/10. That is |grad f| = ./9 + 1. 


\grad f|. 


Important point: The maximum of (grad f)-+u is the length \grad f|. In nonlinear 
examples, the gradient and steepest direction and slope will vary. But look at one 
particular point in Figure 13.13. Near that point, and near any point, the linear 
picture takes over. 

On the graph of f, the special vectors are the level direction L = (f,, —f,, 0) and 
the uphill direction U =(f,,f,,.f2 +f?) and the normal N =(f,,f,, —1). Problem 18 
checks that those are perpendicular. 
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EXAMPLE 3 The gradient of f(x, y)=(14-—x?- y*)/3_ is Vf=(—2x/3, —2yp/3). 


On the surface, the normal vector is N = (— 2x/3, — 2y/3, ~1). At the point (1, 2, 3), 
this perpendicular is N = (— 2/3, — 4/3, —1}. At the point (1, 2) down in the base, 
the gradient is (— 2/3, — 4/3). The length of grad fis the slope ./20/3. 

Probably a hiker does not go straight up. A “grade” of ./20/3 is fairly steep (almost 
150%). To estimate the slope in other directions, measure the distance along the path 
between two contour lines. If Af= ! in a distance As = 3 the slope is about 1/3. This 
calculation is not exact until the limit of Af/As, which is D,f. 


>=(14—7?- )°y3 


tangent 
to level curve 


level 


Fig. 13.43 N perpendicular to surface and grad f perpendicular to level line {in the base). 


EXAMPLE 4 The gradient of f(x, y, z)= xy + yz + xz has three components. 


The pattern extends from f(x, y} to f(x, y, z). The gradient is now the three-dimensional 
vector (f.,f,,J/;). For this function grad fis {y + z, x +z, x+y). To draw the graph 
of w= f(x, y,z) would require a four-dimensional picture, with axes in the xyzw 
directions. 

Notice the dimensions. The graph is a 3-dimensional “surface” in 4-dimensional 
space. The gradient is down below in the 3-dimensional base. The level sets of fcome 
from xy + yz + zx = c—they are 2-dimensional. The gradient is perpendicular to that 
level set (still down in 3 dimensions). The gradient is not N! The normal vector is 
(f..f,.f- —1), perpendicular to the surface up in 4-dimensional space. 


EXAMPLES Find grad z when 2(x, y} is given implicitly: F(x, y, z)= x? + y? -— 27 =0. 


In this case we find z= +./x?+y?. The derivatives are +.x/,/x?+y? and 
+ y/./x? + y*, whieh go into grad z. But the point is this: To find that gradient faster, 


differentiate F(x, y, z) as it stands. Then divide by F,: 
F,dx + Fydy + F,dz=0 ar dz = (— F,dx — Fy dy)/F,. (3) 


The gradient is (—F,./F,, — F,/F,). Those derivatives are evaluated at (x9, Yo}. The 
computation does not need the explicit function z = f(x, y): 


F=x?+y?-2? = F,=2x, Fy=2y,F,= —2z = grad z=(x/z, y/2). 


To go uphill on the cone, move in the direction (x/z, y/z). That gradient direction 
gocs radially outward. The stccpncss of the cone is the length of the gradient vector: 


lgrad z| = ./(x/z)* + (y/z)* = t because z? = x? + y? on the cone. 
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DERIVATIVES ALONG CURVED PATHS 


On a straight path the derivative of fis D.f=(grad f)-u. What is the derivative on 
a curved path? The path direction u is the tangent vector T. So replace u by T, which 
gives the “direction” of the curve. 

The path is given by the position vector R(t) = x(t)i+ y(tj. The velocity is y= 
(dx/dt)i + (dy/dt)j. The tangent vector is T = ¥/|¥|. Notice the choice—to move at any 
speed (with y) or to go at unit speed (with T). There is the same choice for the 
derivative of f(x, y) along this curve: 

df Ofdx éfdy 

rate of change a (grad f)+¥ carrie By dt (4) 

af efdx  ofdy 

slope re (grad f)>T ox ds + 2y ds (5) 

The first involves time. If we move faster, df/dt increases. The second involves distance. 

If we move a distance ds, at any speed, the function changes by df. So the slope in 

that direction is df/ds. Chapter | introduced velocity as df/dt and slope as dy/dx and 
mixed them up. Finally we see the difference. 

Uniform motion on a straight line has R = Ry + vt. The velocity v is constant. The 
direction T = u=y/|y¥| is also constant. The directional derivative is (grad f)-u, but 
the rate of change is {grad f)-v. 


Equations (4) and (5) iook like chain rules. They are chain rules. The next section 
extends df/dt = (df/dx){dx/dt) to more variables, proving (4) and (5). Here we focus 
on the meaning: df/ds is the derivative of f in the direction u=T along the curve. 


EXAMPLE? Find df/dt and df/ds for f=, The curve is x = ¢?, y= ¢ in Figure 13.14a. 


Solution The velocity along the curve is y= 2ti+ j. At the typical point ¢= 1 it is 
y¥=2i+j. The unit tangent is T= v/,/5. The gradient is a unit vector iy/2 +j//2 
pointing outward, when f{x, y) is the distance r from the center. The dot product 
with vis df/dt = 3/,/2. The dot product with T is df/ds = 3/./10. 

When we slow down to speed | (with T), the changes 1n f(x, y) slow down too. 


EXAMPLE 8 Find df/ds for f= xy along the circular path x =cos¢, y=sin ¢. 


First take a direct approach. On the circle, xy equals {cos f}(sin £). Its derivative comes 
from the product rule: df/dt = cos*t — sin*z. Normally this is different from df/ds, 
because the time ¢ need not equal the arc length s. There is a speed factor ds/dt to 
divide by—but here the speed is 1. {A circle of length s = 2x is completed at f = 22.) 
Thus the slope df/ds along the roller-coaster in Figure 13.14 is cos*t — sin?1, 


T= OQ yy grad f= (yx) y 


f=xy>0 


{grad D|= |AD/As t= 1 


Fig. 13.44 The distance f=r changes along the curve. The slope of the roller-coaster is 
(grad f)+T. The distance D from (xq, }'9) has grad D = unit vector. 


Read-through questions 


D,f gives the rate of change of __a 
It can be computed from the two derivatives __¢ 
special directions 
_e .Thisisa_¢ 
g__, which is called the oh 
ax + by, the gradient is grad f=__ | 
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The second approach uses the vectors grad f and T. The gradient of f= xy is 
(y, x) = (sin t, cos t). The unit tangent vector to the path is T = (—sin t, cos t). Their 
dot product is the same df/ds: 


slope along path = (grad f)*T = — sin*t + cos’t. 
GRADIENTS WITHOUT COORDINATES 


This section ends with a little “philosophy.” What is the coordinate-free definition of 
the gradient? Up to now, grad f= (f,, f,) depended totally on the choice of x and y 
axes. But the steepness of a surface is independent of the axes. Those are added later, 
to help us compute. 

The steepness df/ds involves only f and the direction, nothing else. The gradient 
should be a “‘tensor’’"—its meaning does not depend on the coordinate system. The 
gradient has different formulas in different systems (xy or r@ or ...), but the direction 
and length of grad f are determined by df/ds—without any axes: 


The direction of grad f is the one in which df/ds is largest. 
The fength |grad f| is that largest slope. 


The key equation is (change in {) ~ (gradient of f) -(change in position). That is another 
way to write Af fAx + f,Ay. It is the multivariable form—we used two variables— 
of the basic linear approximation Ay ~ (dy/dx)Ax. 


EXAMPLE? D(x, y)=distance from (x, y) to (x9, yo). Without derivatives prove 
|grad D| = 1. The graph of D(x, y) is a cone with slope 1 and sharp point (xo, yo). 


FArst question In which direction does the distance D(x, y) increase fastest? 
Answer Going directly away from (x9, yg). Therefore this is the direction of grad D. 


Second question How quickly does D increase in that steepest direction? 
Answer A step of length As increases D by As. Therefore |grad D| = As/As = 1. 


Conclusion grad D is a unit vector. The derivatives of D in Problem 48 are 
(x—x9)/D and (y—y,)/D. The sum of their squares is 1, because (x — x)? + 
(y— yo)’ equals D’. 


43.4 EXERCISES 
The gradient of fix, y, z}is__s 


in the direction __b 

in the 
d__. In terms of u,,u, the formula is 
product of u with the vector 
. For the linear function f= 
and the directional 


of change of fis df/dt = 
is T, the slope of fis dffds =__v 
dff/ds is the same as _ 
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. This 1s different from the 
gradient on the surface F(x, y,z)=0, which is —(F,/F,i+ 
_t . Traveling with velocity v on a curved path, the rate 
u__. When the tangent direction . 
. Ina straight direction u,° 


derivative is Df=_ Joo k 


The gradient V/=(f,,f,) is not a vector in__!__ dimen- 
sions, it is a vector in the __m__. It is perpendicular to the 
n__ lines. It points in the direction of _o climb. Its 
magnitude |gradf| is __p . For f=x?+y? the gradient 
points__q __ and the slope in that steepest directionis_ r__. 


Compute grad f, then D, f= (grad f)-v, then D, fat P. 


1 fx, y)= x? - y? u=(,/3/2, 1/2) P=(1, 0) 


2 flx,y=3x44y+7 w= (3/5, 4/5) P=(0, 2/2) 
3 fix, y}= e* cos y u=(0, ft) P=(0, 2/2) 
4 fix, yy=y"® u=(0, —1)} P=(i, -)) 
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5 fix, y) = distance to (0, 3) a=(1, 9) P =({1, }) 
Find grad f =(f,,f,,f,) for the fanctions 6—8 from physics. 
6 1) fx? + Pr +22 (point source at the origin} 

7 In(x? + y?) (line source along z axis) 

8 fx ~ 1? + y? +27 1/, x + 1? + y? + 2? (dipole) 


9 For f= 3x? + 2y? find the steepest direction and the level 
direction at (1, 2). Compute D, fin those directions. 


10 Example 2 claimed that f=3x+ y+ 1 has steepest slope 
10. Maximize D, f= 3u, + uz = 3u, +./1—47. 
11 True or false, when f(x, y) is any smooth function: 
(a) There is a direction u at P in which Dyf=0. 
(b) There is a direction u in which D, f= grad f. 
(c) There is a direction u in which Dy f= 1. 
{d) The gradient of f(x)g{x) equals g grad f+/ grad g. 
12 What is the gradient of f(x)? (One component only.) What 
are the two possible directions u and the derivatives Df? 


What is the normal vector N to the curve y=/{x)? (Two 
components.) 


In 13-16 find the direction u in which f increases fastest at P = 
(1, 2). How fast? 


13 f(x, yk=ox+by 14 f{x, y)=smaller of 2x and y 
15 fix, yy=e*” 16 fix, y= ./5— x? — y? (careful) 


17 (Looking ahead) At a point where f(x, y) is a maximum, 
what is grad f? Describe the level curve containing the maxi- 
mum point (x, y). 


18 (a) Check by dot products that the normal and uphill and 
level directions on the graph are perpendicular: N= 


(Sy. —1), U =(f..f,57 +f?),L =(f, —fi, 0). 


{b) N is to the tangent plane, U and L are 
to the tangent plane. 
(c) The gradient is the xy projection of and also 


of , The projection of L points along the 


19 Compute the N, U, L vectors for f= 1—x-+ y and draw 
them at a poit on the flat surface. 


20 Compute N, U, L for x? + y?—z? =0 and draw them at 
a typical point on the cone. 


With gravity in the negative z direction, in what direction —U 
will water flow down the roofs 21-24? 


21 z= 2x (flat roof} 22 2=4x—3y (flat roof} 


23 z=./1—x?—y* (sphere) 24 z= — /x? + y? (cone) 
25 Choose two functions f[x, y) that depend only on x + 2y. 
Their gradients at (1, 1) are in the direction . Their 
level curves are 
26 The level curve of f= y/x through (1, 1) ts . The 
direction of the gradient must be . Check grad f. 
27 Grad fis perpendicular to 2i+j with length 1, and grad g 
is parallel to 2i+ j with length 5. Find grad f| grad g, f, and g. 
28 True or false: 

(a} If we know grad f, we know f- 

(b) The line x = y = — z is perpendicular to the plane z = 

xty. 

(c) The gradient of z = x + y lies along that line. 


29 Write down the level direction u for 0 = tan~ '(y/x) at the 
point (3, 4). Then compute grad @ and check D,@ = 0. 


30 On a circle around the origin, distance is As = rA0. Then 
d0/ds=tj/r. Verify by computing grad@ and T and 
(grad §)-T. 

31 At the point (2, 1,6) on the mountain z=9-—x—y’, 
which way is up? On the roof z=x+2y+2, which way is 
down? The roof is to the mountain. 


32 Around the point (1, —2) the temperature T=e~*"~”” has 


AT Ax + Ay. In what direction u does 
it get hot fastest? 


33 Figure A shows level curves of z = f(x, y). 
(a) Estimate the direction and length of grad fat P, Q, R. 
(b) Locate two points where grad fis parallel to i+ j. 
(c) Where is |grad f| largest? Where is it smallest? 
(d) What is your estimate of z,,,, on this figure? 


(ce) On the straight line from P to R, describe z and esti- 
mate its maximum. 
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34 A quadratic function ax? + by? + ¢x+ dy has the gradi- 
ents shown in Figure B. Estimate a,5,c,d and sketch two 
level curves. 


35 The level curves of f(x, y) are circles around (1, 1). The 
curve f=c has radius 2c. What is f{? What is grad f at (0, 0)? 


% Suppose grad fis tangent to the hyperbolas xp = constant 
in Figure C. Draw three level curves of f(x, y). Is |grad f| larger 
at P or Q? Is |grad f| constant along the hyperbolas? Choose 
a function that could be f: x? + y?, x? — p?, xy, x?p?. 


37 Repeat Problem 36, if grad fis perpendicular to the hyper- 
bolas in Figure C. 


38 Iff=0, 1, 2 at the points (0, 1), (1, 0}, (2, 1), estimate grad f 
by assuming f= Ax + By +C. 


39 What functions have the following gradients? 
(a}(2x+y,x)  (b)(e""*, —e*} (©) (y, —%) (careful) 


40 Draw level curves of f(x, p) if grad f=(y, x). 


In 41-46 find the velocity vy and the tangent vector T. Then 
compute the rate of change df/dt=grad f+-v and the slope 
dfjds = grad f+T. 


41 f=x?+y? x=t pat? 
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42 f=x x=cos2t y=sin 2¢ 

43 f=x’?—y? X=Xot2t ypoygt3t 

44 faxy x=+l y=3 

45 fain xyz x=e y=e7! z=e! 
MG f= 2x? + 3y? 427 x=t pot? z=? 


47 (a) Find @f/ds and df/dt for the roller-coaster f= xp along 
the path x =cos 2, y=sin 2t. (b) Change to f= x? + py? and 
explain why the slope is zero. 


48 The distance D from (x,y) to (1,2) has D?= 
(x — 1}? +(y — 2)". Show that @D/éx = (x — 1)/D and @D/éy = 
(y — 2)/D and |grad D| = 1. The graph of D(x, phisa 

with its vertex at 


49 If f= and grad f= (2, 3) at the point (4, 5), find the tan- 
gent plane at (4, 5). If fis a linear function, find f(x, y). 
S0 Define the derivative of f(x, y} in the direction u = (u,, u2} 


at the point P =(x9, yo). What is Af (approximately)? What 
is D,f (exactly)? 


5] The slope of f along a level curve is df/ds = =, 
This says that grad fis perpendicular to the vector 
in the level direction. 


Caiculus goes back and forth between solving problems and getting ready for harder 


problems. The first is “application,” the second looks like “theory. 


” If we minimize f 


to save time or money or energy, that is an application. If we don’t take derivatives 
to find the minimum—maybe because fis a function of other functions, and we don’t 
have a chain rule—then it is time for more theory. The chain rule is a fundamental 
working tool, because f(g({x)) appears all the time in applications. So do f(g(x, y)) and 
J((8), (4) and worse. We have to know their derivatives. Otherwise calculus can’t 


continue with the applications. 


You may instinctively say: Don’t bother with the theory, just teach me the formulas. 
That is not possible. You now regard the derivative of sin 2x as a trivial problem, 
unworthy of an answer. That was not always so. Before the chain rule, the slopes of 


sin 2x and sin x? and sin?x? 


were hard to compute from Af/Ax. We are now at the 


same point for f(x, y). We know the meaning of df/0x, but iff=rtan @ and x =rcos @ 
and y=rsin 8, we need a way to compute df/dx. A little theory is unavoidable, if the 
problem-solving part of calculus is to keep going. 


To repeat: The chain rule applies to a function of a function. In one variable that 
was fig(x)). With two variables there are more possibilities: 


L. fiz) with z= g(x, y) 


2. fix, y) with x = x(t), y= Wa) 
3. fix, y) with x = x{t, u), y= 


Find df/éx and of/dy 
Find df/dt 
yt,u) Find of/ét and éf/éu 
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All derivatives are assumed continuous. More exactly, the input derivatives like 
ég/dx and dx/dt and dx/éu are continuous. Then the output derivatives like of/dx 
and df/dt and df/du will be continuous from the chain rule. We avoid points like 
r=0 in polar coordinates—where Gr/0x = x/r has a division by zero. 


A Typical Problem Start with a function of x and y, for example x times y. Thus 
Six, y) = xy. Change x to r cos @ and y to r sin 8. The function becomes (7 cos 4) times 
(r sin 6). We want its derivatives with respect to r and 6. First we have to decide on 
its name. 

To be correct, we should not reuse the letter f. The new function can be F: 


fix, yy=xy f(r cos @, r sin 8) = (r cos Ar sin 4) = Fir, 6). 


Why not call it f(r, 9)? Because strictly speaking that is r times @! If we follow the 
tules, then f(x, y) is xy and f(r, 6) should be r@. The new function F does the right 
thing—it multiplies (r cos @)(r sin 6). But in many cases, the rules get bent and the 
letter F is changed back to f- 

This crime has already occurred, The end of the last page ought to say OF /dt. 
Instead the printer put 0f/dt. The purpose of the chain rule is to find derivatives in 
the new variables ¢ and u (or r and 8). In our example we want the derivative of F 
with respect to r. Here is the chain rule: 

OF Gf dx = aféy . . 
—=S +See + = 
or ax ar’ dyer (y)(cos 6} + (x)(sin 8) = 2r sin @ cos 9 
I substituted rsin @ and rcos@ for y and x. You immediately check the answer: 
F(r, 8) =r? cos @ sin 8 does lead to GF/ér = 27 cos 9 sin 8. The derivative is correct. 
The only incorrect thing—but we do it anyway—is to write finstead of F. 
of ox of dy 


W —=—7 A =—— +S 
Question hat is 36° nswer It is ax 00 ovae” 


THE DERIVATIVES OF f(g(x, y)) 
Here g depends on x and y, and f depends on g. Suppose x moves by dx, while y 
stays constant. Then g moves by dg = (ég/@x)dx. When g changes, f also changes: 


df = (df/dg)dg. Now substitute for dg to make the chain: df= (df/dg){8g/0x)dx. This 
is the first rule: 


EXAMPLE 4 Every f(x + cy) satisfies the 1-way wave equation df/dy = c df/dx. 


The inside function is g= x + cy. The outside function can be anything, g? or sing 
or e?. The composite function is (x + cy)* or sin(x + cy) or e****. In each separate 
case we could check that éf/éy = c éf/dx. The chain rule produces this equation in 
all cases at once, from dg/dx = 1 and ég/dy=c: 


PO ag gag ee ee 


ax dg dx dg dy dgidy dg oy @x (2) 


This is important: éf/dy = ¢ éf/éx is our first example of a partial differential equation. 
The unknown f(x, y) has two variables. Two partial derivatives enter the equation. 


413.5 The Chain Rule 


Up to now we have worked with dy/dt and erdinary differential equations, The 
independent variable was time or space (and only one dimension in space). For partial 
differential equations the variables are time and space (possibly several dimensions 
in space). The great equations of mathematical physics—heat equation, wave equa- 
tion, Laplace’s equation—are partial differential equations. 

Notice how the chain rule applies to f= sin xy. Its x derivative is y cos xy. A patient 
reader would check that fis sing and g is xy and f, is f,¢,. Prohably you are not 
so patient—you know the derivative of sin xy. Therefore we pass quickly to the next 
chain rule. Its outside function depends on two inside functions, and each of those 
depends on ¢t. We want df/dt. 


THE DERIVATIVE OF f(x(f), y(?)) 


Before the formula, here is the idea. Suppose t changes by At. That affects x and y; 
they change by Ax and Ay. There is a domino effect on f; it changes by Af. Tracing 
backwards, 

of 


of dx dy 
S + — yo = . 
‘fms Ax a Ay and Ax» aot and Ay nt 


Substitute the last two into the first, connecting Af to At. Then let At — 0: 


This is close to the one-variable rule dz/dx = (dz/dy\(dy/dx). There we could “cancel” 
dy. (We actually canceled Ay in (Az/Ay){(Ay/Ax), and then approached the limit.) 
Now At affects Af in two ways, through x and through y. The chain rule has two 
terms. If we cancel in (df/0x){dx/dt) we only get one of the terms! 

We mention again that the true name for f(x(t), Xt) is F(t) not f(t). For f(x, y, z) 
the rule has three terms: fx, + fy, +22, is f, {or better dF/dt). 


EXAMPLE 2 How quickly does the temperature change when you drive to Florida? 


Suppose the Midwest is at 30°F and Florida is at 80°F. Going 1000 miles south 
increases the temperature f(x, y) by 50°, or .05 degrees per mile. Driving straight south 
at 70 miles per hour, the rate of increase is (.05)(70) = 3.5 degrees per hour. Note how 
({degrees/mile) times {miles/hour) equals (degrees/hour). That is the ordinary chain rule 
(df/dx)(dx/dt) = (df/dt)}—there is no y variable going south. 

If the road goes southeast, the temperature is f= 30+ .05x+ .0ly. Now x(t) is 
distance south and y{#) is distance east. What is df/dt if the speed is still 70? 
soution f= Ler, FAy _ 95 . 04 


dt @xdt dydt J2 


In reality there is another term. The temperature also depends directly on t, because 
of night and day. The factor cos(2nt/24) has a period of 24 hours, and it brings an 
extra term into the chain rule: 


3 x 3 degrees/hour. 


A . a ofdx ody of 
== _ H+ YH |, 

Far fix, y, t) the chain rule is ae ae By (4) 

This is the total derivative df/dt, from all causes. Changes in x, y, ¢ all affect f£ The 


partial derivative df/ét is only one part of df/dt. (Note that dt/dt = 1.) If night and 
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day add 12 cos(2mt/24) to f, the extra term is éf/ét = — x sin(2nt/24). At nightfall that 
is -—x degrees per hour. You have to drive faster than 70 mph to get warm. 


SECOND DERIVATIVES 


What is d7f/dt?? We need the derivative of (4), which is painful. It is like acceleration 
in Chapter 12, with many terms. So start with movement in a straight line. 

Suppose x = x9 + tcos @ and y= yy) + t sin 6. We are moving at the fixed angle 6, 
with speed 1. The derivatives are x, = cos 6 and y, = sin 6 and cos”6 + sin?@ = 1. Then 
df/dt is immediate from the chain rule: 


=F. t+ hy, =f, cos O+f, sin 8. (5) 
For the second derivative f,,, apply this rule to f,. Then f,, is 
(f,). cos 8+ (f,), sin 6=(f,, cos 0+ f,, sin 8) cos 8+ (f,, cos 0+ f,, sin 6) sin 8. 
Collect terms: Su Sex 00870 + 2f,, cos 6 sin 0+ f,, sin?6. {6) 


In polar coordinates change t to r. When we move in the r direction, @ is fixed. 
Equation (6) gives f,, from f,,., f.,, fy. Second derivatives on curved paths (with new 
terms from the curving) are saved for the exercises. 


EXAMPLE 3 If /.., f-y»fyy are all continuous and bounded by M, find a bound on f,. 
This is the second derivative along any line. 

Solution Equation (6) gives | f,,| << M cos?6 + M sin 20+ M sin?@ < 2M. This upper 
bound 2M was needed in equation 13.3.14, for the error in linear approximation. 


THE DERIVATIVES OF f(x(#, u), C4, u)) 


Suppose there are two inside functions x and y, each depending on ¢ and u. When t 
moves, x and y both move: dx = x,dt and dy = y,dt. Then dx and dy force a change 
in f: df= f,dx + fdy. The chain rule for éf/ér is no surprise: 


This rule has /6t instead of d/dt, because of the extra variable u. The symbols remind 
us that uw is constant. Similarly t is constant while u moves, and there is a second 
chain rule for df/éu: fu=f.x,t+S,¥u: 


EXAMPLE 4 [n polar coordinates find f, and fgg. Start from f{x, y) = f(r cos 8, r sin 6). 


The chain rule uses the 6 derivatives of x and y: 


of _ fox Fay _ (F\_.. of 
é6 6x a6 ss ay 00 (2) r sin 8) + (2) cos 8). (8) 


The second 6 derivative is harder, because (8) has four terms that depend on @. Apply 
the chain rule to the first term @f/dx. It is a function of x and y, and x and y are 
functions of 6: 


O(P\ Of AP\ex, OPV. ng 
alse)" alge noe) Pt ae ae 
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The @ derivative of ef/éy is similar. So apply the product rule to (8): 
foo= Uax(—r sin 0) +f,,(r cos 8)](—r sin 6) +f,(—r cos 6) 
+ [f.<-r sin 8) + f,(r cos @)](r cos 8) + f\(- r sin @). (9) 


This formula is not attractive. In mathematics, a messy formula is almost always a 
signal of asking the wrong question. In fact the combination f,, + f,, is much more 
special than the separate derivatives. We might hope the same for f,, + fog, but dimen- 
sionally that is impossible—since r is a length and @ is an angle. The dimensions of 
fe and f,, are matched by f,, and f,/r and fgg/r?. We could even hope that 


1 
Lae hyy = Sen = bet 5 Sow (10) 


This equation is true. Add (5)+(6)+{9) with ¢ changed to r. Laplace’s equation 
Sux +S, = 0 is now expressed in polar coordinates: f,, + fr + fog/r? = 0. 


A PARADOX 


Before leaving polar coordinates there is one more question, It goes back to ér/éx, 
which was practically the first example of partial derivatives: 


cr 
SoS ao ke hy Rx ey Sap (11) 
7 OX 


My problem is this. We know that x is r cos @. So x/r on the right side is cos @. On 
the other hand r is x/cos @. So ér/éx is also 1/cos 0. How can ér/@x lead to cos 0 one 
way and |/cos ? the other way? 

I will admit that this cost me a sleepless night. There must be an explanation— 
we cannot end with cos 8 = 1/cos 6. This paradox brings a new respect for partial 
derivatives. May I tell you what I finaliy noticed? You could cover up the next 
paragraph and think about the puzzle first. 

The key to partial derivatives is to ask: Which variable is held constant? In 
equation (11), pis constant. But when r = x/cos 6 gave ér/@x = 1/cos 0, 8 was constant. 
In both cases we change x and look at the effect on r. Fhe movement is on a horizontal 
line (constant y) or on a radial line (constant @). Figure 13.15 shows the difference. 


Remark This example shows that ¢r/éx is different from 1/(éx/dr). The neat formula 
(cr/éx)(Ex/er) = 1 is not generally true. May I tell you what takes its place? We have 
to include (ér/éy)(¢y/er). With two variables xy and two variables r@, we need 2 by 
2 matrices! Section 14.4 gives the details: 


eriex éeridy || @xfér éx/ee | 0 
COiéx Gbs/éy || Cyfér  Ay/ed Oo 1] 


dr = dvicos 8 


dy 


ie dr = dy cos 6 


iv xtdr Xx x+dx 


Fig. 13.45 dr=dx cos @ when y is constant, dr = dxjcos ? when @ is constant. 
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NON-INDEPENDENT VARIABLES 


This paradox points to a serious problem. In computing partial derivatives of f(x, y, z), 
we assumed that x, y, z were independent. Up to now, x could move while y and z 
were fixed. In physics and chemistry and economics that may not be possible. If there 
is a relation between x, y, z, then x can't move by itself. 


EXAMPLE 5 The gas law PV =nRT relates pressure to volume and temperature. 
P, V, T are not independent. What is the meaning of @V/@P? Does it equal 1/(6P/@V)? 


Those questions have no answers, until we say what is held constant. In the paradox, 
érj/éx had one meaning for fixed y and another meaning for fixed 6. To indicate what 
is held constant, use an extra subscript (not denoting a derivative): 


(ér/dx), = cos 6 (dr/éx}, = lfcos 8. (42) 


(ofjéP), has constant volume and (éf/@P), has constant temperature. The usual 
of/aP has both V and T constant. But then the gas law won’t let us change P. 


EXAMPLE 6 Let f= 3x+2y+z. Compute of/dx on the plane z= 4x + y. 
Solution 4 Think of x and y as independent. Replace z by 4x + y: 
f=3xt2y+(4xt+y) so (6f/éx), =7. 
Solution 2 Keep x and y independent. Deal with z by the chain rule: 
(6f/éx), = Of (dx + (Of/2z)(Gz/dx) = 3 + (1){4) = 7. 
Solution 3 (different) Make x and z independent. Then y =z — 4x: 
(6f/0x), = Of/dx + (Af/ey)(y/Ex) = 3 + (2)(— 4) = — 5. 


Without a subscript, df/é@x means: Take the x derivative the usual way. The answer 
is Of/éx = 3, when y and z don’t move. But on the plane z = 4x + y, one of them must 
move! 3 is only part of the total answer, which is (df/dx), = 7 or (Of/@x), = — 5. 

Here is the geometrical meaning. We are on the plane z = 4x + y. The derivative 
{af/éx), moves x but not y. To stay on the plane, dz is 4dx. The change in f= 
3x+2y+z is df= 3dx+0+ dz=7dx. 


EXAMPLE 7 On the world line x? + y? + 2? = 2? find (af/dy),,, for f= xyzt. 


The subscripts x, z mean that x and z are fixed. The chain rule skips df/éx and 
of/daz: 


(Af/dy),.. = Offay + (Gf/de)(t/Oy) = xzt + (xyz}(y/t). Why y/t? 


EXAMPLE 8 From the law PV=T, compute the product (6P/@V)7(@V/0T)p(8T/OP),. 
Any intelligent person cancels @V’s, @T’s, and @P’s to get 1. The night answer is —1: 
(@P/eV), = — T/V? (VOT) p = 1/P (OT/2P)y = V. 

The product is — 7/PV. This is —1 not +1! The chain rule is tricky (Problem 42). 


EXAMPLE 9 Implicit differentiation was used in Chapter 4. The chain rule explains it: 
Ef F(x, y) = 0 then F,.+ Fyy, =0 so dy/dx = — F,/F,. (13) 
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13.5 EXERCISES 


Read-through questions 


The chain rule applies to a function ofa _a@__. The x deriva- 

tive of f(e(x, yp is Affeéx=__b __. The y derivative is éf/éy = 
c_. The example f=(x+y)" has g=_4_. Because 

Ggidx = ég/éy we know that _@ = +t. This _9 


differentia] cquation is satisfied by any function of x + y. 


Along a path, the derivative of f{x(t), y(t) is dfidt = __h 
The derivative of f(x(t), v(t), 2(t}) is _ |. If f= xy then the 
chain rule gives dfidt=__ i. That is the same as the __k 
rule! When x = x4,¢ and y =u,¢ the path is 1. The chain 
rule for f(x, y) gives dfjdt=_m__. Thatis the 0 deriva- 
tive Dif. 


The chain rule for f{x(t, 4), y(t, u)) is 6ffér=__o . We 
don't write dfjdt because __P _. Ifx =r cos J and y=r sin @, 
the variables t,u change to _q__. In this case @f/ér= 

rand ¢f/¢?=_ 5 _. That connects the derivatives in 

t and _u coordinates. The difference between 
érjéx = x/r and érjéx = licos @ is because _ ¥ is constant 
in the first and _ ¥___ is constant in the second. 


With a relation like xyz = 1, the three variables are __* 
independent. The derivatives (é//éx), and (f/éx)}, and (éf/éx) 
mean_¥ and 2 and _A_. For f=x?+ 5?+42? with 
xyz = 1, we compute (éf/éx), from the chain rule _B _. In 
that rule éz/¢x=_ © from the relation xyz =1, 


Find f, and /, in Problems 1-4. What equation connects them? 
L fix, y) = sin(x + cy) 2 fix, ¥) = (ax + by)!” 
3 f(x, yy= er 4 fix. y)=In(x + 7y) 
5 Find both terms in the ¢ derivative of (e(x(#). y(t))°. 


6 Iff(x, y)=xy and x = u(t) and yp = x(t), what is df/dt? Prob- 
ably all other rules for derivatives follow from the chain rule. 


7 The step function f{x) 1s zero for x <0 and one for x > Q, 
Graph f(x) and g(x) =f(x + 2) and A(x) =/f{x + 4). If fix + 22) 
Tepresents a wall of water (a tidal wave}, which way is it 
moving and how fast? 


8 The wave equation is f,, =c7/,,. (a} Show that (x + ct)” is 
a solution. (b} Find C different from ¢ so that (x + C1)" is also 
a soiution, 


9 If f= sin(x — t}, draw two lines in the xr plane along which 
jJ=0. Between those lines sketch a sine wave. Skiing on top 
of the sine wave, what is your spced dx/d?? 


10 If you float at x =0 in Problem 9, do you go up first or 
down Iirst? At time t=4 what is your height and upward 
velocity? 


11 Laplace's equation is fi. +fyy = 0. Show from the chain 
rule that any function j(x + fy) satisfies this equation if i? = 
— 1. Check that f= (x + iy)? and its real part and 
its imaginary part all satisfy Laplace’s equation. 


12 Equation (10) gave the polar form f,, + f/r +fee/r? =0 of 


Laplace’s equation. (a) Check that f= r?e?" and its real part 


r* cos 26 and its imaginary part r? sin 26 all satisfy Laplace’s 
equation. (b) Show from the chain rule that any function ftre®} 
satisfies this equation if i? = — 1. 


In Problems 13-18 find df/di from the chain rule (3). 

IB fHx?+y2 x=t, y=t? 

14 f= ty, x=4, yar? 

15 f=xy,x=1- ty=h+/t 

16 f=x/y, x =e, y=2e' 

17 f=lIn(x + y), x=e ye’ 

18 f=x*,x=ty=! 

19 If a cone grows in height by dh/dt = 1 and in radtus by 


dr/dt = 2, starting from zero, how fast ts its volume growing 
att=3? 


20 If a rocket has speed dx/dt =6 down range and dy/dt = 
2: upward, how fast is it moving away from the launch point 
at (0, 0)? How fast is the angle @ changing, if tan @ = y/x? 

21 If a train approaches a crossing at 60 mph and a car 
approaches (at right angles) at 45 mph, how fast are they 
coming together? (a) Assume they are both 90 miles from the 
crossing. {b) Assume they are going to hit. 

22 In Example 2 does the temperature increase faster if you 
drive due south at 70 mph or southeast at 80 mph? 

23 On the line x = yf. y =#2t, 5 = gt, What combination of 
fet, fi; gives dfidt? This is the directional derivative in 3D, 
24 On the same line x = u,¢, p=u 26, z= ut, find a formula 
for d7fidr?. Apply it to f= xyz. 

25 For fix, y,th=x+y+et find éf/ét and df/dt when x = 2: 
and y = 3. Explain the difference. 


26 If z=(x+y)? then x= Jz — y. Does (@z/@x)(@x/éz) = 1? 
27 Suppose x,=t and y, =2f, not constant as in (5-6). For 
f(x, y) find f, and f,. The answer involves f., fy. fax. Sey-Syy: 


28 Suppose x,=fand y,=t?. For f=(x + y)* find f, and then 
Jy from the chain rule. 

29 Derive of /ér = (6f/Gx) cos 6+ (Gf/2y) sin 8 from the chain 
rule. Why do we take @x/ér as cos 9 and not 1/cos #? 


30 Compute f,, for f(x, y)=(ax + by +c)'® Ifx=rand y= 
t compute f,,. True or false: (¢f/éx)(@x/ét) = éf/ér. 
31 Show that @r/éx? = y?/r3 in two ways: 

(1) Find the x derivative of ér/éx = x/./x? + y? 

(2) Find the x derivative of ér/@x = x/r by the chain rule, 
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32 Reversing x and y in Problem 31 gives r,,=x?/r?. But 
show that r,, = — xy/r?. 


33 Ifsinz= x + y find (éz/@x), in two ways: 
(i) Waite z =sin7 ‘(x + y) and compute its derivative. 
(2} Take x derivatives of sinz=x+ y. Verify that these 
answers, explicit and implicit, are equal. 
34 By direct computation find f, and f{, and f,, for 
f= J/x7+y’. 
35 Find a formula for 07//érd6@ in terms of the x and y deriva- 
tives of flx, y). 


36 Suppose z = f(x, y) is solved for x to give x = g(y, z). Is it 
true that 42/@x = 1/(ax/éz)? Test on examples. 


37 Suppose z = e” and therefore x = (In z)/y. Is it true or not 
that (6z/@x} = 1/(@x/éz)? 
38 If x= x(t, u,v) and y= y(t, uw, v) and z= 2(t, u, »), find the ¢ 
derivative of f(x, y, z). 
39 The t derivative of f(x(¢, 4), y(t, u)) is in equation (7). What 
is fi? 
40 (a) For f=x?+y7+2z? compute af/éx (no subscript, 
x, y, z all independent). 
{b) When there is a further relation z = x” + y?, use it to 
remove z and compute (df/éx},. 
{c) Compute (6f/éx}, using the chain rule (éf/dx)+ 
(6f/8z)(éz/Ax). 
(d) Why doesn’t that chain rule contain (6f/éy)(Gy/x)? 
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41 For f= ax + by on the plane z = 3x + Sy, find (@f/ax), and 
(af/ax), and (df/é2),. 


42 The gas law in physics is PV =nRT or a more general 
telation F(P, V, T)=0. Show that the three derivatives in 
Example 8 still multiply to give —1. First find (@P/éV); from 
OF /AV + (aF/6P\8P/éV)z = 0. 


43 If Problem 42 changes to four variables related by 
F(x, y, z, f)=0, what is the corresponding product of four 
derivatives? 


44 Suppose x =t+uand y=tu. Find the t and u derivatives 
of f(x, y). Check when f(x, y) = x? — 2y. 


48 (a) For f=r" sin?4 find f, and f,. 
(b) For f= x? + y? find f, and fy. 


46 On the curve sin x+ sin y=Q, find dy/dx and d?y/dx? by 
implicit differentiation. 


47 {horrible} Suppose f,, +f,, =9. Ifx=u+v and y=u—v 
and f(x, y) = g(u, v), find g, and g,. Show that g,, + 2,, =9. 


48 A function has constant returns to scale if ficx, cy)= 
cfix, y). When x and y are doubled so are f=./x?+y? 
and f=./xy. In economics, input/output is constant. In 
mathematics fis homogeneous of degree one. 

Prove that x of/éx + y Of/dy =flx, y), by computing the c 
derivative at c=]. Test this equation on the two examples 
and find a third example. 


49 Trae or false: The directional derivative of fir, @} in the 
direction of ug is df/é@, 


The outstanding equation of differential calculus is also the simplest: df/dx = 0. The 
slope is zero and the tangent line is horizontal. Most likely we are at the top or 
bottom of the graph—a maximum of a minimum. This is the point that the engineer 
or manager or scientist or investor is looking for—maximum stress or production 
or velocity or profit. With more variables in f(x, y) and f(x, y, z), the problem becomes 
more realistic. The question still is: Hew te locate the maximum and minimum? 

The answer is in the partial derivatives. When the graph is level, they are zero. 
Deriving the equations f,=0 and f,=0 is pure mathematics and pure pleasure. 
Applying them is the serious part. We watch out for saddle points, and also for a 
minimum at a boundary point—this section takes extra time. Remember the steps 
for f(x) in one-variable calculus: 


1. The leading candidates are stationary points (where df/dx = 0}. 
2. The other candidates are rough points (no derivative) and endpoints {a or 6). 
3. Maximum ys, minimum is decided by the sign of the second derivative. 


In two dimensions, a stationary point requires 6f/éx = 0 and df/dy = 0. The tangent 
line becomes a tangent plane. The endpoints a and 6 are replaced by a boundary 
curve. In practice boundaries contain about 40% of the minima and 80% of the work. 
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Finally there are three second derivatives f,., f,y, and f,,. They tell how the graph 
bends away from the tangent plane—up at a minimum, down at a maximum, both 
ways at a saddle point. This will be determined by comparing (f,x)(f,y) with (f,,)’. 


STATIONARY POINT — HORIZONTAL TANGENT — ZERO DERIVATIVES 


Suppose f has a minimum at the point (x9, yo). This may be an absolute minimum or 
only a local minimum. In both cases f(x9, Yo) <f(x, y) near the point. For an absolute 
minimum, this inequality holds wherever f is defined. For a local minimum, the 
inequality can fail far away from (xo, yo). The bottom of your foot is an absolute 
minimum, the end of your finger is a local minimum. 

We assume for now that (x9, yo) is an interior point of the domain of f. At a 
boundary point, we cannot expect a horizontal tangent and zero derivatives. 

Main conclusion: At a minimum or maximum (absolute or local) a nonzero deriva- 
tive is impossible. The tangent plane would tilt. In some direction f would decrease. 
Note that the minimum point is (Xo, yo), and the minimum value is f(xo, Vo). 


43) If derivatives exist at an interior minimum or maximum, they are zero: 
of/ex=0 and odf/éy=0 (together this is grad f= 0). (1) 


For a function f(x, y, z) of three variables, add the third equation df/éz = 0. 


The reasoning goes back to the one-variable case. That is because we look along the 
lines x = Xo and y= yo. The minimum of f(x, y) is at the point where the lines meet. 
So this is also the minimum along each line separately. 

Moving in the x direction along y = yo, we find éf/éx = 0. Moving in the y direction, 
of/éy =0 at the same point. The slope in every direction is zero. In other words 
grad f= 0. 

Graphically, (xo, yo) is the low point of the surface z = f(x, y). Both cross sections 
in Figure 13.16 touch bottom. The phrase “‘if derivatives exist” rules out the vertex 
of a cone, which is a rough point. The absolute value f= |x| has a minimum without 
df/dx = 0, and so does the distance f= r. The rough point is (0, 0). 


1 
fixed at — 

Pe oe y fixed at = 
+ 
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----/" 


- “kb x fixed at 


Vile if 


Ie § 
saa 4%, Yo) = Gs >) 
Fig. 13.16 of/6x=0 and of/éy =0 at the minimum. Quadratic f has linear derivatives. 
EXAMPLE 4 Minimize the quadratic f(x, y)=x?+xy+y?—x-—y#l. 


To locate the minimum (or maximum), set f, = 0 and f, = 0: 


f,=2x+y-1=0 and f,=x+2y—1=0. (2) 
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Notice what’s important: There are two equations for two unknowns x and y. Since f 
is quadratic, the equations are linear. Their solution is xg = 4, yo = 4 (the stationary 
point). This is actually a minimum, but to prove that you need to read further. 

The constant 1 affects the minimum value f= 4—~but not the minimum point. The 
graph shifts up by 1. The linear terms — x — y affect f, and f,. They move the minimum 
away from (0, 0). The quadratic part x? + xy + y* makes the surface curve upwards. 
Without that curving part, a plane has its minimum and maximum at boundary 
points. 


EXAMPLE 2 (Steiner's problen\ Find the point that ts nearest to three given points. 


This example is worth your attention. We are locating an airport close to three cities. 
Or we are choosing a house close to three jobs, The probicm is to get as near as 
possible to the corners of a triangle. The best point depends on the meaning of “near.” 

The distance to the first corner (x,, 3) is d; = ./(x—x,)* + (vy — y;)*. The dis- 
tances to the other corners (x, yz) and (x3, 3) are d, and d;. Depending on whether 
cost equals (distance) or (distance)* or (distance)’, our problem will be: 


Minimize d,+d,t+d, ov dj+di+d% oreven d?+d§+dh. 
The second problem is the easiest, when d7 and d3 and d3 are quadratics: 
fx =e P EU AP He 2)? + (9 — 2? + P+ ya)? 


Offéx = ax—x,+x—xgtx—x3]=0 8 Off@y=2Ay-—y, + y—y2ty— yz] =. 


Solving éf/éx = 0 gives x = 3(x, + x. + x3). Then éfiéy = 0 gives y= 4(y, + yo + 3a). 
The best point is the centroid of the triangle (Figure 13.17a). It is the nearest point 
to the corners when the cost is (distance)”. Note how squaring makes the derivatives 
linear. Least squares dominates an enormous part of applied mathematics. 


VHX) + ty +4 1/3 
YSyp + Ys + ¥31/3 Us ye? e2 


\ di +d, +d, min 


uy 


(Sige) 
Fig. 13.47. The centroid minimizes d? + d3 + d3. The Steiner point minimizes ¢, +d, +d. 


The real “Steiner problem” is to minimize f(x, y) = d, + d, + d,. Weare laying down 
roads from the corners, with cost proportional to length. The equations f, = 0 and 
f,=9 look complicated because of square roots. But the nearest point in 
Figure 13.17b has a remarkable property. which you will appreciate. 

Calculus takes derivatives of d? =(x—x,)?+({y'— y,)?. The x derivative leaves 
2d ,(@d,/@x) = 2(x — x,). Divide both sides by 2¢,: 


Cd,  x-Xx, ed, YTS 


XY7X b em, 
and =-—+ so prad d, = } ot}, 3 
md -(FEGR) a 
This gradient is a unit vector. The sum of (x — x,)?/d? and () — ¥,)?:d? is d?/d? = 1. 
This was already in Section 13.4: Distance increases with slope | away from the 
center. The gradient of d, (call it u,) is a unit vector from the center point (x,, 3+). 


ex d, cy dy 


13.6 Maxima, Minima, and Saddle Points 


Similarly the gradients of d, and d, are unit vectors u, and u;. They point directly 
away from the other corners of the triangle. The total cost is f(x, y)= d, +d, + ds, 
so we add the gradients. The equations f,=0 and f,=0 combine into the vector 
equation 

grad f=u, + u,+u,=0 at the minimum. 


The three unit vectors add to zero! Moving away from one corner brings us closer to 
another. The nearest point to the three corners is where those movements cancel. 
This is the meaning of “grad f= 0 at the minimum.” 

It is unusual for three unit vectors to add to zero—this can only happen in one 
way. The three directions must form angles of 120°. The best point has this property, 
which is repeated in Figure 13.18a. The unit vectors cancel each other. At the “Steiner 
point,” the roads to the corners make 120° angles. This optimal point solves the 
problem, except for one more possibility. 


120° 


(Xp. 3) 


Aig. 13.48 Gradients u, +u,+u;=0 for 120° angles. Corner wins at wide angle. Four 
corners. In this case two branchpoints are better—still 120°. 


The other possibility is a minimum at a rough point. The graph of the distance 
function @,(x, y) is a cone. It has a sharp point at the center (x,, y,). All three corners 
of the triangle are rough points for d, + d, + d3, $0 all of them are possible minimizers. 

Suppose the angle at a corner exceeds 120°. Then there is no Steiner point. Inside 
the triangle, the angle would become even wider. The best point must be a rough 
point—one of the corners. The winner is the corner with the wide angle. In the figure 
that means d, = 0. Then the sum ¢, + d, comes from the two shortest edges. 


Summary The solution is at a 120° point or a wide-angle corner. That is the theory. 
The real problem is to compute the Steiner point—which | hope you will do. 


Remark 1 Steiner’s problem for four points is surprising. We don’t minimize 
d,+d,+d,+d,—there is a better problem. Connect the four points with roads, 
minimizing their total length, and allow the roads to branch. A typical solution is in 
Figure 13.18c. The angles at the branch points are 120°. There are at most two branch 
points (two less than the number of corners). 


Remark 2 For other powers p, the cost is (d,)? + (d,)? + (d3)’. The x derivative is 
Gf [8x = p(d,P~ (x — x1) + plz)?“ *{x — x2) + pds)? 7(x — xs). {4) 


The key equations are still 4//6x = 0 and éf/éy = 0. Solving them requires a computer 
and an algorithm. To share the work fairly, I will supply the algorithm (Newton’s 
method) if you supply the computer. Seriously, this is a terrific example. It is typical 
of real problems—we know éf/éx and éf/cy but not the point where they are zero. 
You can calculate that nearest point, which changes as p changes. You can also 
discover new mathematics, about how that point moves. I will collect all replies I 
receive to Problems 38 and 39. 
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MINIMUM OR MAXIMUM ON THE BOUNDARY 


Steiner’s problem had no boundaries. The roads could go anywhere. But most appli- 
cations have restrictions on x and y, like x 2Oor y <0 or x*+ y? > 1. The minimum 
with these restrictions ts probably higher than the absolute minimum. There are three 
possibilities: 


(1} stationary point f, = 0, f, =0 {2) rough point (3) boundary point 


That third possibility requires us to maximize or mmuimize f(x, y) along the boundary. 


EXAMPLE 3 Minimize f(x, y)=x?+xy+y*-—x—y +] in the half-plane x > 0. 
The minimum in Example | was 3. It occurred at xg = 3, ¥o = 4. This point is still 
allowed. It satisfies the restriction x 20. So the minimum is not moved. 

EXAMPLE 4 Minimize the same f(x, y) restricted to the lower half-plane y <0, 


Now the absolute minimum at (4, 4) is not allowed. There are no rough points. We 
look for a minimum on the boundary line »=0 in Figure 13.19a. Set y= 0, so f 
depends oniy on x. Then choose the best x: 


fix, 0) =x? +0-x-0+1 and =f, =2x-1=0. 


The minimum is at x = 3 and y= 0, where f= 7. This is up from §. 


foxtoxel 
when y =0 


Leia 


not allowed 


ve, coe 
+ fin = 


Fig. 143.49 The boundaries y = 0 and x? +)? = 1 contain the minimum points. 


EXAMPLES Minimize the same f(x, y) on or outside the circle x* + y* = 1. 


One possibility is f, = 0 and f,=Q. But this is at (4. 3), inside the circle. The other 
possibility is a minimum at a boundary point, oa the circle. 

To follow this boundary we can set y= ./1 — x*. The function f gets complicated, 
and df/dx is worse. There is a way to avoid squarc roots: Set y= cost and y=sin?. 
Then f= x? + xy + 3? ~x-— y+ 1 isa function of the angle f: 


fit}=1+ cost sin tr—cost—sint+ 1 
dfidt = cos*t — sin?r + sin tf — cos tf = (cos ¢— sin f\(cos t+ sin £— 1). 


Now dfdt = 0 locates a minimum or maximum along the boundary. The first factor 
(cos f — sin 2) is zero when x = }. The second factor is zero when eos 1+ sin? = 1, or 
x + y= 1. Those points on the circle are the candidates. Problem 24 sorts them out, 
and Section (3.7 finds the minimum in a new way using “Lagrange multipliers.” 


13.6 Maxima, Minima, and Saddle Points 


Minimization on a boundary is a serious problem—it gets difficult quickly—and 
multipliers are ultimately the best solution. 


MAXIMUM VS. MINIMUM VS. SADDLE POINT 


How to separate the maximum from the minimum? When possible, try all candidates 
and decide. Compute f at every stationary point and other critical point (maybe also 
out at infinity), and compare. Calculus offers another approach, based on second 
derivatives. 

With one variable the second derivative test was simple: f,, >0 at a minimum, 
fc, = 9 at an inflection point, f,, <0 at a maximum. This is a local test, which may 
not give a global answer. But it decides whether the slope is increasing (bottom of 
the graph) or decreasing (top of the graph). We now find a similar test for f(x, y). 

The new test involves all three second derivatives. It applies where f,=0 and 
f, = 0. The tangent plane is horizontal. We ask whether the graph of f goes above or 
below that plane. The tests f,, > 0 and f,,>0 guarantee a minimum in the x and y 
directions, but there are other directions. 


EXAMPLE 6 f(x, y)=x? + 10xy + y? has f.,=2, fry = 10, fy = 2 (minimum or not?) 


All second derivatives are positive—but wait and see. The stationary point is (0, 0), 
where 6f/éx and éf/éy are both zero. Our function is the sum of x? + y?, which goes 
upward, and 10xy which has a saddle. The second derivatives must decide whether 
x? + y? or 10xy is stronger. 

Along the x axis, where y = 0 and f= x’, our point is at the bottom. The minimum 
in the x direction is at (0, 0). Similarly for the y direction. But (0, 0) is not a minimum 
point for the whole function, because of 10xy. 

Try x= 1, y= — 1. Then f= 1-— 10+ 1, which is negative. The graph goes below 
the xy plane in that direction. The stationary point at x = y=0 is a saddle point. 


a<Q ac>b? 


Fig. 13.20 Minimum, maximum, saddle point based on the signs of a and ac — b?. 


EXAMPLE 7 f(x, y)=x?+xy+y? has f,,=2, fry = 1, fy, = 2 (minimum or not?) 


The second derivatives 2, 1, 2 are again positive. The graph curves up in the x and y 
directions. But there is a big difference from Example 6: f,, is reduced from 10 to 1. 
It is the size of f,, (not its sign!) that makes the difference. The extra terms —x — y+ 4 
in Example 1 moved the stationary point to (4, 4). The second derivatives are still 
2, 1, 2, and they pass the test for a minimum: 


43K At (0, 0) the quadratic function f(x, y) = ax? + 2bxy + cy? has a 


a>0 a<0 


minimum if maximum if saddle point if ac < b?. 
ac > b? ac > b? 


509 


510 


13 Partial Derivatives 
For a direct proof, split f(x, y) into two parts by “completing the square:” 


ac—b? , 


b 2 
ax? + 2bxy + cy? =alx+-y] + 
a a 


That algebra can be checked (notice the 25). It is the conclusion that’s important: 


if a>0 and ac> b?, both parts are positive: minimum at (0, 0) 
if a<0 and ac> b*, both parts are negative: maximum at (0, 0) 


if ac < b*, the parts have opposite signs: saddle point at (0, 0). 


Since the test involves the square of b, its sign has no importance. Example 6 had 
b=5 and a saddle point. Example 7 had b=4 and a minimum. Reversing to 
—x?—xy— y? yields a maximum. So does — x? + xy— y?. 

Now comes the final step, from ax? + 2bxy + cy? to a general function f(x, y). For 
all functions, quadratics or not, it is the second order terms that we test. 


EXAMPLE 8 f(x, y)=e*—x-—cos y has a stationary point at x = 0, y=0. 


The first derivatives are e* — 1 and sin y, both zero. The second derivatives are f.,. = 
e*=1 and f,, =cos y= 1 and f,, =0. We only use the derivatives at the stationary 
point. The first derivatives are zero, so the second order terms come to the front in 
the series for e* — x — cos y: 


(ltx+4x?+--)—x—(1—4y? + ---)=4x? + 4y? + higher order terms. (7) 


There is a minimum at the origin. The quadratic part 4x? + $y? goes upward. The x* 
and y* terms are too small to protest. Eventually those terms get large, but near a 
stationary point it is the quadratic that counts. We didn’t need the whole series, 
because from f,, =f,, = 1 and f,, =0 we knew it would start with $x? + $y?. 


43L The test in 13K applies to the second derivatives a=f,,, b=f,,,¢ =fyy 


of any f(x, y) at any stationary point. At all points the test decides whether the 
graph is concave up, concave down, or “indefinite.” 


EXAMPLE 9 f(x, y) = e** has f, = ye*” and f, = xe*’. The stationary point is (0, 0). 


The second derivatives at that point are a=/f,,=0, b=f,,= 1, and c=f,, = 0. The 
test b* > ac makes this a saddle point. Look at the infinite series: 


eW=1L+xyt4ixty? te, 


No linear term because f, =f, = 0: The origin is a stationary point. No x? or y term 
(only xy): The stationary point is a saddle point. 

At x=2, y= —2 we find f,,f,y > (f,,)?. The graph is concave up at that point— 
but it’s not a minimum since the first derivatives are not zero. 

The series begins with the constant term—not important. Then come the linear 
terms—extremely important. Those terms are decided by first derivatives, and they 
give the tangent plane. It is only at stationary points—when the linear part disappears 
and the tangent plane is horizontal—that second derivatives take over. Around any 
basepoint, these constant-linear-quadratic terms are the start of the Taylor series. 
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THE TAYLOR SERIES 


We now put together the whole infinite series. It is a “Taylor series’ —which means 
it is a power series that matches all derivatives of f (at the basepoint). For one 
variable, the powers were x" when the basepoint was 0. For two variables, the 
powers are x” times y” when the basepoint is (0, 0). Chapter 10 multiplied the nth 
derivative d"f/dx" by x"/n! Now every mixed derivative (0/0x)"(0/dy)"f(x, y) is computed 
at the basepoint (subscript 9). 

We multiply those numbers by x"y"/n!m! to match each derivative of f(x, y): 


43M When the basepoint is (0, 0), the Taylor series is a double sum 2X a,,,x"y™. 
The term a,,,x"y" has the same mixed derivative at (0, 0) as f(x, y). The series is 


af ef) .(f 
f(0, n+x(2) + z) + (2 i) + ut >(53) 


xy (ers 
4 ee nlm nim! (44). 


The derivatives of this series agree with the derivatives of f(x, y) at the basepoint. 


The first three terms are the linear approximation to f(x, y). They give the tangent 
plane at the basepoint. The x? term has n= 2 and m=0, so n!m! = 2. The xy term 
has n= m= 1, and n!m! = 1. The quadratic part (ax? + 2bxy + cy”) is in control when 
the linear part is zero. 


EXAMPLE 10 All derivatives of e**” ~as one at the origin. The Taylor series is 


pees 

evaltxtytttayt hee =r 

This happens to have ac = b?, the special case that was omitted in 13M and 43N. 

It is the two-dimensional version of an inflection point. The second derivatives fail to 

decide the concavity. When f,.f,, =(fx,), the decision is passed up to the higher 

derivatives. But in ordinary practice, the Taylor series is stopped after the quadratics. 

If the basepoint moves to (x9, Yo), the powers become (x — Xg)"(y — Yo)"—and all 
derivatives are computed at this new basepoint. 


Final question: How would you compute a minimum numerically? One good way is 
to solve f,=0 and f,=0. These are the functions g and h of Newton’s method 
(Section 13.3). At the current point (x,, y,), the derivatives of g=/, and h=/f, give 
linear equations for the steps Ax and Ay. Then the next point x, ,, = x, + Ax, y,4, = 
y, + Ay comes from those steps. The input is (x,, y,), the output is the new point, 
and the linear equations are 


(g,)Ax 7 (g,)Ay Sas &(Xns Yn) é? fx )Ax =u (fs Ay =f (Kes Yn) 
(h,)Ax + (hyJAy = — A(Xqs Yn) (fey)Ax + (fy AY = —fy%n> In): 


When the second derivatives of f are available, use Newton’s method. 

When the problem is too complicated to go beyond first derivatives, here is an 
alternative—steepest descent. The goal is to move down the graph of f(x, y), like a 
boulder rolling down a mountain. The steepest direction at any point is given by the 
gradient, with a minus sign to go down instead of up. So move in the direction Ax = 
—s Of/dx and Ay= — s of/éy. 


(5) 
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The question is: How far to move? Like a boulder, a steep start may not aim 
directly toward the minimum. The stepsize s is monitored, to end the step when the 
function f starts upward again (Problem 54). At the end of each step, compute first 
derivatives and start again in the new steepest direction. 


13.6 EXERCISES 


Read-through questions 


A minimum occurs ata 9 __ point (where f, =f, =90) or a 

6 ~point (no derivative) or a _¢ point. Since f= 
x*?—xy+2y has f,=_4 and f,=_@_, the stationary 
point is x=_f  , y=_g__. This is not a minimum, 
because f decreases when __h 


The minimum of d?=(x—x,)? +(j—¥,)* occurs at the 
rough point __! _. The graph of d is a __J_ sand gradd 
isa __k& vector that points _! . The graph of f= |x) 
touches bottom along the lines _™ . Those are “rough 
lines" because the derivative 2 _. The maximum of d and 
f Must occur on the _ 9 __ of the allowed region because it 
doesn’t occur __P 


When the boundary curve is x = x(t), ¥ = )X¢), the derivative 
of f(x, ») along the boundary is _@_ (chain rule). If f= 
x° + 2)? and the boundary is x =cos f, y =sin 4s, then dfidt = 

r__. It 1s zero at the poiots__s_ _. The maximum is at 

t_ and the minimum is at __¥__. Inside the circle f has 
an absolute ininimum at _ ¥ 


To separate maximum from minimum from __¥ _. com- 
pute the __™* derivatives ata _ ¥ point. The tests for a 
minimum are __2 _. The tests for a maximum are _A__. In 
caseac<_ 8 orfixfyy<_©& . we havea 0. At all 
points these tests decide between concave up and _E and 
“indefinite.” For f= 8x? — 6xy + y”, the originisa__F _. The 
signs of fat (1, O} and (1, 3) are _ & 


The Taylor sertes for f(x, ¥) begins with the six terms __H 
The coefficient of x"y" is __|__. To find a stationary point 
numerically, use__J or __K 


Find all stationary points ( /, =, = 0) in 1-16. Separate mini- 
mum from maximum from saddle point. Test 43K applies to 
a = fess b = fieys ¢ =fyy- 


1 x? + 2xy 4 3y? 2 xy-xty 

3 xi + 4xypt 3y? -—6x—12p 4 Pay tay 

§ x?y?—x 6 xe" —e* 

7 —x* + Ixy — 3y? R (xt yP +(x + 2y- 6" 
9x 4y? 427-42 10 (x + yx + 2y- 6) 
WH (x—sP 12 (1+ x2) + ¥7) 


13 (x+y)? —(x + 23) 14 sin x —cos y 


1S xP ty? 3x7 43)? 16 8xy— x7 y4# 


17 A rectangle has sides on the x and ¥ axes and a corner on 
the line x + 3y = 12. Find its maximum area. 


18 A box has a corner at (0, 0, 0) and all edges parallel to the 
axes. If the opposite corner (x,y¥,z} is on the plane 
3x +2) +2= 1, what position gives maximum volume? Show 
first that the problem maximizes xv — 3x7¥— 2xy". 


19 (Straight line fit, Section 11.4) Find x and y to minimize 
the error 

E=(x+yP +(x t+ 2y— 5p 4 (x 4+ 3p —4)?. 
Show that this gives a minimum not a saddle point. 


20 (Least squares) What numbers x, \' come closest to satisfy- 

ing the three equations x--=1, 2x+y=—I1,x+2y=1? 

Square and add the errors, (x—y— 1)? + + 
. Then minimize. 


21 Minimize f= x? + xy +3? —x—y restricted by 
{a)x <0 (b) +2 1 (c) x <O and p>]. 


22 Minimize f= x7 + y? + 2x + 4¥ in the regions 
(a) all x. 1 (b) 7 20 (c) x20, 130 


23 Maximize and minimize f=x +. /3y on the circle x= 
cost. y=sing, 


24 Example 5 followed f= x? + xy + y?-—x—y+1] around 
the circle x? ~ y? = 1. The four stationary points have x = y 
or x+y=1. Compute f at those points and locate the 
minimum. 


25 {a) Maximize f= ax + by on the circle x? + y? = 1. 
(b) Minimize x? + y? on the line ax + by = 1. 


26 For fix, y)=4x* ~ xy +4y*, what are the equations /, = 
0 and f, = 0? What are their solutions? What is fin? 


27 Choose ¢ > 0 so that f=x? 4 x+y? has a saddle point 
at (0, 0). Note that f> 0 on the lines x = 0 and y =Q and } = 
x and y= — x, so checking four directions does not confirm 
a minimum. 


Problems 28-42 minimize the Steiner distance f=, + d, +d; 
and related functions. A computer is needed for 33 and 36-39, 


28 Draw the triangle with corners at (0, 0), (1, 1}, and (1, —1). 
By symmetry the Steiner point will be on the x axis. Write 
down the distances d,,d).d, to (x,0) and find the x that 
minimizes d, + d,+d,. Check the 120° angles. 
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29 Suppose three unit vectors add to zero. Prove that the 
angles between them must be 120°. 


3% In three dimensions, Steiner minimizes the total distance 
flx ».2)=4,+4,+4,;+4, from four points. Show that 
grad d, is still a unit vector (in which direction?) At what 
angles do four unit vectors add to zero? 


31 With four points in a plane, the Steiner problem allows 
branches (Figure 13.18c). Find the shortest network connect- 
ing the corners of a rectangle, if the side lengths are (a) 1 and 
2 (b) I and f (two solutions for a square) (c} I and 0.1. 


32 Show that a Steiner point (120° angles) can never be out- 
side the triangle. 


33 Write a program to minimize f(x, y}=d,+d,+4 3 by 
Newton’s method in equation (5). Fix two corners at (0, 0), 
(3, 0), vary the third from (I, ft} to (2, 1) to (3, 1) to (4, t), and 
compute Steiner points. 


34 Suppose one side of the triangle goes from (—1, 0) to (1, 0). 
Above that side are points from which the lines to (1, 0) and 
(1, 0) meet at a 120° angle. Those points lie on a circular arc— 
draw it and find its center and its radius. 


35 Continuing Problem 34, there are circular arcs for ali three 
sides of the triangle. On the arcs, every point sees one side of 
the triangle at a 120° angle. Where is the Steiner point? 
(Sketch threc sides with their arcs.} 


% Invent an algorithm to converge to the Steiner point based 
on Probiem 35. Test it on the triangles of Problem 33. 


37 Write a code to minimize f=d7+d4+d$ by solving f, = 
and f,=0. Use Newton’s method in equation (5). 


38 Extend the code to allow all powers p> 1, not only p= 
4. Follow the minimizing point from the centroid at p= 2 to 
the Steiner point at p= 1 (try p= 1.8, 1.6, 1.4, 1.2). 


39 Follow the minimizing point with your code as p increases: 
p=2, p=4, p=8, p= 16. Guess the limit at p = oo and test 
whether it is equally distant from the three corners. 


40 At p=oo we are making the largest of the distances 
d,,d,,d3 as smal] as possible. The best point for a 1, t, J/2 
right triangle is : 


41 Suppose the road from corner 1 is wider than the others, 
and the total cost is fx, y)=./2 d; +d, +dy. Find the gradi- 
ent of f and the angles at which the best roads meet. 


42 Solve Steiner’s problem for two points. Where is d, + 4, 
a minimum? Solve also for three points if only the three 
corners are allowed. 


Find all derivatives at (0, 0). Construct the Taylor series: 
43 fix, yy=(x+ yy 44 fix, y) = xe 
45 fix, y)=ln(1 — xy) 


Find f,, f,, Sex Sey» fyy at the basepoint. Write the quadratic 
approximation to f(x, »y) — the Taylor series through second- 
order terms: 


46 f=e*” at (0, 0) 47 foe? at (1, 1) 
48 f=sin x cos y at (0,0) 49 f=x7+y at 1, —D 


50 The Taylor series around (x, y) is also written with steps 
hand k: fix+h yt+h=flax,y)t+h +k + 
Sh? +hk +. Fill in those four blanks. 


§1 Find lines along which f(x, y) is constant (these functions 
have f,,f,y =f 2, or ac = b*): 
fa)f=x?—4xy+4y? — (d) f=ere? 


52 For fix, y, z) the first three terms after /(0, 0, 0) in the ae 
lor series are . The next six terms are 


§3 (a) For the error f—/f, in linear approximation, the Taylor 
series at (0, 0) starts with the quadratic terms ; 
(b} The graph of f goes up from its tangent plane {and 
fof it . Then fis concave upward. 
(c} For (0, 0) to be a minimum we also need 


54 The gradient of x?+2y? at the point (1, 1} is (2, 4). 
Steepest descent is along the line x = 1 — 2s, y = 1 — 4s (minus 
sign to go downward). Minimize x? + 2y? with respect to the 
stepsize s. That locates the next point , where 
steepest descent begins again. 


55 Newton's method minimizes x? + 2y? in one step. Starting 
at (xp, ¥o) =(I, 1}, find Ax and Ay from equation (5). 


56 If f,. +, =0, show that f(x, y) cannot have an interior 
maximum ot minimum (only saddle points). 


57 The value of x theorems and y exercises is f= x*y (maybe). 
The most that a student or author can deal with ts 4x + y= 
12. Substitute y = 12 — 4x and maximize {. Show that the line 
4x + y= 12 is tangent to the level curve x*y =fi,,;- 


58 The desirability of x houses and y yachts is f(x, y} The 
constraint px + gy =k limits the money available. The cost of 
a house is , the cost of a yacht is . Substi- 
tute y=(k—px)/q into f(x, y) = F(x) and use the chain rule 
for dF/dx, Show that the slope —f,/f, at the best x 1s — p/q. 
59 At the farthest point in a basebail fieid, explain why the 


fence is perpendicular to the line from home plate. Assume 
it is not a rough point (corner) or endpoint (foul line). 
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This section faces up to a practical problem. We often minimize one function f(x, y) 
while another function g(x, y) is fixed. There is a constraint on x and y, given by 
g(x, y) =k. This restricts the material availabie or the funds available or the energy 
available. With this constraint, the problem is to do the best possible (fia, OF {nin 

At the absolute minimum of f(x, y), the requirement g{x, y) = k is probably violated. 
In that case the minimum point is not allowed. We cannot use f, = 0 and f, = 0— 
those equations don’t account for g. 


Step 1 Find equations for the constrained minimum ot constrained maximum. They 
will involve f, and f, and also g, and g,, which give local information about f and g. 
To see the equations, look at two examples. 


EXAMPLE 4 Minimize f= x? + y? subject to the constraint g = 2x + p=k. 


Trial runs The constraint allows x = 0, y= k, where f= k?. Also (3k, 0) satisfies the 
constraint, and f= fk? is smaller. Also x = y = 4k gives f= $k? (best so far). 


Idea of solution Look at the level curves of f(x, y) in Figure 13.21. They are circles 
x? + y* =c. When ¢ is small, the circles do not touch the line 2x + y =k. There are 
no points that satisfy the constraint, when c is too small. Now increase c. 
Eventually the growing circles x? + y? =c will just touch the line x + 2y=k. The 
point where they touch is the winner. It gives the smallest value of ¢ that can be 
achieved on the line. The touching point is (x,;,, ¥min), amd the value of ¢ is fiin. 


What equation describes that point? When the circle touches the line, they are 
tangent. They have the same slope. The perpendiculars to the circle and the line go in 
the same direction, That is the key fact, which you see in Figure 13.21a. The direction 
perpendicular to f= c is given by grad f= (/f,, f,). The direction perpendicular to g = 
k is given by grad g=(g,,g,). The key equation says that those two vectors are 
parallel. One gradient vector is a multiple of the other gradient vector, with a multi- 
plier 4 (called lambda) that is unknown: 


43N At the minimum of f(x, y) subject to g(x, y)=k, the gradient of f is 
parallel to the gradient of g—with an unknown number 4 as the multiplier: 


grad f=Agradg so 


Step2 There are now three unknowns x, y, A. There are also three equations: 


af/éx = 4ag/éx is 2x = 2A 
af/dy = Aég/dy is 2y=4 (2) 
a(x, Hk is 2xty=k. 


In the third equation, substitute 24 for 2x and 44 for y. Then 2x + y equals $4 
equals k. Knowing 4 = $k, go back to the first two equations for x, y, and fiin! 


2 1, 1 2 Le 5 1 
=i=ok, y=sh=sk, fain=( ck) +[ 2k) = Sk? = ck. 
xm hm ek ym adm oh Soin (2) (3+) 5" 5 
The winning point (Xin: Ymin) is ($k, 4k). It mimimizes the “distance squared,” 
f=x? + y? =4k?, from the origin to the line. 


43.7 Constraints and Lagrange Mulipliers 


Question What is the meaning of the Lagrange multiplier 1? 


Mysterious answer The derivative of 4k? is 4k, which equals 1. The muttiplier 
A is the derivative of f,,,, with respect to k. Move the line by Ak, and f,,, changes by 
about AAk. Thus the Lagrange multiplier measures the sensitivity to k. 

Pronounce his name “Lagronge” or better ‘“‘Lagrongh”’ as if you are French. 


Fig. 13.24 Circles f=c tangent to line g =k and ellipse g = 4: parallel gradients. 


EXAMPLE 2 Maximize and minimize f= x?+ y? on the ellipse g = (x —1)?+ 4y?=4. 


dea and equatians The circles x? + y? = c grow until they touch the ellipse. The 
touching point is (Xpia> ¥min) and that smallest value of c is f,,,. As the circles grow 
they cut through the ellipse. Finally there is a point (x,,.;, ¥max) Where the last circle 
touches. That largest value of ¢ is f,,;- 

The minimum and maximum are described by the same rule: the circle is tangent 
to the ellipse (Figure 13.21b). The perpendiculars go in the same direction. Therefore 
(f., /,) is a multiple of (g,,g,), and the unknown multiplier is 4: 


Sn = Ags: 2x = A2(x — 1) 
J, = gy: 2y = ABy (3) 
gak: (x — 1)? + 4y? = 4. 


Solution The second equation allows two possibilities: y= 0 or 2 = 3. Following up 
y=0, the last equation gives (x — 1)? =4. Thus x=3 or x= — 1. Then the first 
equation gives A= 3/2 or 4= 1/2. The values of f are x?+ y?=37+0?=9 and 
x?+y=(-1P +07 =1. 

Now follow A= 1/4. The first equation yields x = — 1/3. Then the last equation 
requires y* = 5/9. Since x? = 1/9 we find x? + y? = 6/9 = 2/3. This is fiia- 


Conclusion The equations (3) have four solutions, at which the circle and ellipse 
are tangent. The four points are (3, 0), (—1, 0), (- 1/3, \/5/3), and (1/3, — \/5/3). The 
four values of fare 9, 1, %, 4. 


Summary The three equations are f, = Ag, and f, = Ag, and g = k. The unknowns 
are x, y, and A. There is no absolute system for solving the equations (unless they are 
linear; then use elimination or Cramer’s Rule). Often the first two equations yield x 
and y in terms of J, and substituting into g = k gives an equation for A. 

At the minimum, the level curve f(x, y)}=c is tangent to the constraint curve 
gtx, y) =k. If that constraint curve is given parametrically by x(t) and y(2}, then 
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minimizing f(x{t), y(t)} uses the chain rule: 
ffx, Fay 
dt @xdt dydt 
This is the calculus proof that grad f is perpendicular to the curve. Thus grad f is 


parallel to grad g. This means (f,, f,) = A(g., By). 
We have lost f, = 0 and f,= 0. But a new function L has three zero derivatives: 


=0 or (grad f)-(tangent to curve) =0. 


430 The Lagrange function is L(x, y, A) =f(x, y}— A(e(x, y)— k). Its three 
derivatives are L, = L, = L, = 0 at the solution: 


OL 
me t=O. (4) 


Note that ¢L/d4 = 0 automatically produces g = k. The constraint is “built in’ to L. 
Lagrange has included a term A(g — k), which is destined to be zero— but its derivatives 
are absolutely needed in the equations! At the solution, g=k and L=f and 
AaL/é@k =A. 

What is important is f, = Ag, and f, = A4g,, coming from L, = L, = 0. In words: The 
constraint g= k forces dg = g,dx + gydy = 0. This restricts the movements dx and dy. 
They must keep to the curve. The equations say that df= f.dx + f,dy is equal to Adg. 
Thus df is zero in the allowed direction—which is the key point. 


MAXIMUM AND MINIMUM WITH TWO CONSTRAINTS 


The whole subject of min(max)imization is called optimization. Its applications to 
business decisions make up operations research. The special case of linear functions 
is always important—in this part of mathematics it is called fimear programming. A 
book about those subjects won’t fit inside a calculus book, but we can take one more 
step—to allow a second constraint. 

The function to minimize or maximize is now f(x, y,z). The constraints are 
a(x, y, 2) = k, and A(x, y, z) = k,. The multipliers are 2, and 7,. We need at ieast three 
variables x, y, z because two constraints would completely determine x and y. 


A3P To minimize f(x, y, z) subject to g(x, y, z)= k, and A(x, y, z)= kz, solve five 
equations for x, y, z, 4,, Az. Combine g =k, and h=k, with 

ih C7} ih 
ag é f_, i) Ff _, oe ch 


Hg. Seg gS =A, e+e. 
2 ax’ dy “lay “7 ay’ éz +62 7a 


of 


ox 18x 


a | (5) 


Figure 13.224 shows the geometry behind these equations. For convenience f is 
x? + y? + z?, so we are minimizing distance (squared). The constraints g= x + y+ z= 
9 and h=x + 2y + 3z = 20 are linear—their graphs are planes. The constraints keep 
(x, y, z) on both planes—and therefore on the line where they meet. We are finding 
the squared distance from (0, 0, 0) to a line. 

What equation do we solve? The level surfaces x? + y? + z? =c are spheres. They 
grow as c increases. The first sphere to touch the line is tangent to it. That touching 
point gives the solution (the smallest c). Alf three vectors grad f, grad g, grad h are 
perpendicular to the line: 


line tangent to sphere = grad f perpendicular to line 


line in both planes => grad g and grad 4 perpendicular to line. 
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Thus grad f, grad g, grad h are in the same plane—perpendicular to the line. With 
three vectors in a plane, grad f is a combination of grad g and grad h: 


(fz d; i= Ai(Bx» Bys g,)+ Athy, hy, h,). (6} 
This is the key equation (5). It applies to curved surfaces as wel] as planes. 


EXAMPLE 3 Minimize x? + y? + z* when x +y+z=9 and x+ 2y+ 3z= 20. 


in Figure 13.22b, the normals to those planes are grad g=(1, 1,1) and gradh = 
(1, 2, 3). The gradient of f= x? + y? + z? is (2x, 2y, 2z). The equations (5)—(6) are 


2x=A,tAg, Zy=A,+24,, 2z=A, + 3A,. 
Substitute these x, y, z into the other two equations g= x + y+ z=9 and A= 20: 


i Be een Wee ee ees dita Ay + 3da_ 


After multiplying by 2, these simplify to 34, + 64, = 18 and 64, + 144, =40. The 
solutions are 4, = 2 and A, = 2. Now the previous equations give (x, y, z) = (2, 3, 4). 

The Lagrange function with two constraints is L(x, y,z,4,,4:)= 
f—4,(g — k,)— 4,(h — k2). Its five derivatives are zero—those ate our five equations. 
Lagrange has increased the number of unknowns from 3 to 5, by adding 4, and A. 
The best point (2, 3, 4) gives fi, = 29. The 2’s give f/¢k—the sensitivity to changes 
in 9 and 20. 


9 and +2 3 20. 


grad f = (4, 6, 9) (4, 6,9) 
=A, grad g +d, grad h 


plane & = 20 


line 


tangent x plane g =9 
to sphere 


intersection 
of planes 


Fig. 13.22 Perpendicular vector grad fis a combination 4, grad g+ A, grad A. 


INEQUALITY CONSTRAINTS 


In practice, applications involve inequalities as well as equations. The constraints 
might be g < k and A > 0, The first means: It is not required to use the whole resource 
k, but you cannot use more. The second means: 4 measures a quantity that cannot 
be negative. At the minimum point, the multipliers must satisfy the same inequalities: 
A, <Oand A, 20. There are inequalities on the A’s when there are inequalities tn the 
constraints. 

Brief reasoning: With g < & the minimum can be on or inside the constraint curve. 
Inside the curve, where g < k, we are free to move in all directions. The constraint is 
not really constraining. This brings back f, = 0 and f,=0 and 4=0—an ordinary 
minimum. On the curve, where g = & constrains the minimum from going lower, we 
have 4 <0. We don’t know in advance which to expect. 
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5x + 6v=c y 
¢ too small Q=(4,0) 
Fmin = 20 
eR 
Fig. 43.23 Linear programming: f and g are linear, inequalities cut off x and y. 
43.7 EXERCISES 

Read-through questions 
A restriction g(x, y}=& ts called a _ a. The minimizing Jmin iS _¢ _ to the constraint curve g=k. The number 4 
equations for f{x, y) subject tog=k are __b_. The number turns out to be the derivative of __g with respectto oh 
A is the Lagrange _ ¢ _. Geometrically, gradfis_ 4 __ to The Lagrange function is L=__i__ and the three equations 


grad g at the minimum. That is because the __e@_ curve f= forx,y,4are__j _and_*« and | 
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For 100 constraints g; < k;, there are 100 2’s. Some 4’s are zero (when g; < k,) and 
some are nonzero (when g; = k;). It is those 2'°° possibilities that make optimization 
interesting. In near programming with two variables, the constraints are x > 0, y > 0: 


EXAMPLE 4 Minimize f= 5x + 6y with g=x+y=4 and h=x>Qand H=y2>0. 


The constraint g=4 is an equation, # and H yield inequalities. Each has its own 
Lagrange multiplier—and the inequalities require 4, > 0 and 4, > 0. The derivatives 
of f, g,h, H are no problem to compute: 


By 81a, 1425 TAG yields S=2,+A, 

ee . 2 ah dH m 
Fe eta tas = yields 6= A, +5. 

dy dy y 


Those equations make A; larger than 4,. Therefore 4, > 0, which means that the 
constraint on H must be an equation. (Inequality for the multiplier means equality 
for the constraint.) In other words H=y=0. Then x+ y=4 leads to x= 4. The 
solution ts at (Xmins ¥min) = (4, 0), where f,,;, = 20. 

At this minimum, 4 = x = 4 is above zero. The multiplier for the constraint A > 0 
must be 2, = 0. Then the first equation gives 4, = 5. As always, the multiplier mea- 
sures sensitivity. When g=4 is increased by Ak, the cost f,;, = 20 is increased by 
SAk. In economics A, =5 is called a shadow price—it is the cost of increasing the 
constraint. 

Behind this example is a nice problem in geometry. The constraint curve x + y=4 
is a line. The inequalities x > 0 and y 20 leave a piece of that line—from P to @ in 
Figure 13.23. The level curves f= 5x + 6y=c move out as c increases, until they 
touch the line. The first touching point is Q = (4, 0), which is the solution. It is always 
an endpoint—or a corner of the triangle POR. It gives the smallest cost f,i,, which 
is c = 20. 


constraint 
=xr+ z= 
xty=4 gHxty+z=1 


Sx + 6y = 20 (Problem 25) 


13.7 Constraints and Lagrange Mulliipiiers 


To minimize f= x? — y subject to g=x—y =O, the three 
equations for x, y,Aare_m_. Thesolutionis____. In this 
example the curve f(x, y)=fayn= 9 is ap which is 

a__ to the line g = 0 at (x,,15+ Penind: 


With two constraints g{x, y, z) =k, and A(x, y, z) = k, there 
are multipliers. The five unknowns arc __8__. The five 
equationsare__t_. The level surface f=f,,,,is__u__ to the 
curve where g=k, and h=k,. Then gradfis__v___ to this 
curve, andsoaregradgand_ w  .Thus__x__isacombina- 
tion of grad gz and _y¥__. With nine variables and six con- 
straints, there will be__2 _ multipliers and eventually __A 
equations. Ifa constraintisan__§ ¢<k, then its multiplier 
must satisfy 2 <0 at a minimum. 


1 Example { minimized f= x? + y? subject to 2x+y=k. 
Solye the constraint equation for y =k — 2x, substitute into 
f, and minimize this function of x. The minimum is at (x, y) = 

, where f= : 


Note: This direct approach reduces to one unknown x, 
Lagrange increases to x, y, 4. But Lagrange is better when the 
first step of solving for y is difficult or impossible. 


Minimize and maximize (x, y) in 2-6. Find x, y, and 4. 
2 f=x’y with g=x? +y?=1 


1 1 
a ey Ee 


4 f=3x+y with g=x?+9y?=1 
§ f=x?+y? with g=x% + y6 =2. 


6 f=xt+y with g=x! 7) =k. With x =capital and y= 
labor, g is a Cobb-Douglas function in economics. Draw two 
of its level curves. 


7 Find the point on the circle x? + y? = 13 where f= 2x — 3y 
is a maximum. Explain the answer. 


8 Maximize ax + by + cz subject to x? + y? + 27 =k7. Write 
your answer as the Schwarz inequality for dot products: 
(a, b, c)+(x, y,z}<_ 


9 Find the plane z=ax+by+c that best fits the points 
(x, y, 2) _ (0, 0, 1), d, 0, 0), di, I, 2), (0, i, 2). The answer a, b,c 
minimizes the sum of (z — ax — by —c)}’ at the four points. 


10 The base of a Uiangle is the top of a rectangle (5 sides, 
combined area = 1). What dimensions minimize the distance 
around? 


11 Draw the hyperbola xy= —1 touching the circle g= 
x? 4+ y? =2. The minimum of f= xy on the circle is reached 
at the points . The equations f, = i and las dg, 
are satisfied at those points with A= 


12 Find the maximum of f= xy on the circle g = x? + y? =2 
by solving f, = 4g, and f, = 4g, and substituting x and y into 
f. Draw the level curve f= f,,, that touches the circle. 


919 


13 Draw the level curves of f= x? + p? with a closed curve C 
across them to represent g{x, y)=k. Mark a point where C 
crosses a level curve. Why is that point not a minimum of f 
on C? Mark a point where C is tangent to a level curve. Is 
that the minimum of f on C? 


14 On the circle g=x?4 y?=1, Example 5 of 13.6 mini- 
mized f= xy — x — y. (a) Set up the three Lagrange equations 
for x, y, 4. (b) The first two equations give x = y= 
(c) There is another solution for the spectal value A= ee 
when the equations become . This is easy to miss 
but it gives fai, = — 1 at the point 
Problems 15-18 develop the theory of Lagrange multipliers. 
15 (Sensitivity) Certainly L=f—A(g—k) has @bj/ék=2. 
Since L = fii. and g = k at the minimum point, this seems to 
prove the key formula df,,;,/dk = 42. But xX,in, Ymins 4 200 fos, 
all change with k. We need the total derivative of L(x, y, 4, k): 
dL @Ldx  éLdy OL da éL dk 
dk Oxdk dydk OLdk* Okhdk 
Equation (i) at the minimum point should now yield the 
sensitivity formula df_;,/dk = 2. 
16 (Theory behind 4) When g(x, y}=k is solved for y, it 
gives a curve y= R(x). Then minimizing f(x, y) along this 
curve yields 
¥ ff afdR % og dR 
ax ay ax ax * ay dx 

Those come from the tule: dffdx =O at the mini- 
mum and dg/dx =0 along the curve because g = 
Multiplying the second equation by A= (af/ay)/(de/ey) and 
subtracting from the first gives =0. Also af/éy = 
Adg/dy. These are the equations (I) for x, y, 4. 
17 (Example of failure) 4=Jf,/g, breaks down if g, = 0 at the 
minimum point. 

(a)g=x?—y?=0 does not allow negative y because 

(b) When g=0 the minimum of f= x? + y is at the point 

{c} At that point f,= which is 
impossible. 


(d) Draw the pointed curve g=0 to see why it is not tan- 
gent to a level curve of f. 


Ag, becomes 


18 (No maximum) Find a point on the line g=x+y=1 
where f(x, y}= 2x + y is greater tban 100 (or 1000}. Write out 
grad f= A grad g to see that there is no solution. 


19 Find the minimum of f=x?+2y?+2z? if (x,y,z) is 
restricted to the planes g=x+y+z=O and h=x—z=1. 


20 (a) Find by Lagrange multipliers the volume V = xyz of 
the largest box with sides adding up to x+y+z=k, (b) 
Check that 4 = dV,,,,/dk. (c) United Airlines accepts baggage 
with x + y+z= 108", If it changes to t11”, approximately 
how much (by 2) and exactly how much does V,,, increase? 
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21 The planes x =0 and y=0 intersect in the line x= y =O, 
which is the z axis. Write down a vector perpendicular to the 
plane x =0 and a vector perpendicular to the plane y=0. 
Find A, times the first vector plus 4, times the second. This 
combination is perpendicular to the line 


22 Minimize f= x? + y? + z? on the plane ax + by +ez = d— 

one constraint and one multiplier. Compare f,;, with the 

distance formula |d|/,/a” + b* +c? in Section 11.2. 

23 At the absolute minimum of f(x, y}, the derivatives 
are zero. If this point happens to fall on the curve 


e(x, y) =k then the equations f, = dg, and f, = 4g, hold with 
A= , 


Problems 24-33 allow inequality constraints, optional but good. 


24 Find the minimum of f= 3x + Sy with the constraints g = 
x+2y=4 and h=x>0 and H = y 20, using equations like 
(7). Which multiplier is zero? 


25 Figure 13.23 shows the constraint plane g=x+y+z=1 
chopped off by the inequalities x >0, y 20, z 20. What are 
the three “endpoints” of this triangle? Find the minimum and 
maximum of f= 4x —2y + 5z on the triangle, by testing f at 
the endpoints. 


26 With an inequality constraint g <&, the multiplier at the 
minimum satisfies 4 < 0. If k is increased, f,.,, goes down (since 
A= dfmpi/dk). Explain the reasoning: By increasing k, (more) 
(fewer) points satisfy the constraints. Therefore (more) (fewer) 
points are available to minimize { Therefore f,i, goes (up) 
(down). 


27 With an inequality constraint g<k, the multiplier at a 
maximum point satisfies A > 0. Change the reasoning in 26. 


28 When the constraint h>k is a strict inequality h>k at 
the minimum, the multiplier is 2=0. Explain the reasoning: 
For a small increase in k, the same minimizer is still available 
(since h>k leaves room to move). Therefore f,;, is 
{changed)(unchanged), and i = df,,,/dk is ; 


29 Minimize f= x* + y? subject to the inequality constraint 
x+y<4. The minimum is obviously at , where f, 
and f, are zero. The multiplier is 4= . A small 
change from 4 will leave f.;, = so the sensitivity 
Ofniaf@k still equals 2. 


30 Minimize f= x? + y’ subject to the inequality constraint 
x+y24, Now the minimum ts at and the multi- 
plier is 2 = and fain = . Asmall change to 
4+ dk changes f,, by what multiple of dk? 


31 Minimize f= 5x + 6y withe =x+ y=4andh=x 2>Qand 
H=y<0 Now 4,<0 and the sign change destroys 
Example 4. Show that equation (7) has no solution, and 
choose x, y to make Sx + 6y < — 1000. 


32 Minimize f = 2x + 3y + 4z subject tog =x+y+z=1and 
x, y,Z20. These constraints have multipliers 4, >0, 4; 20, 
Aq 20. The equations are 2=A, + Az, , and 4= 
A; + Aq. Explain why A; >0 and 4,>0 and fain = 2. 


33 A wire 40” long is used to enclose one or two squares 
(side x and side y). Maximize the total area x* + y* subject to 
x20, y2O, 4x + 4y = 40, 
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CHAPTER 14 


Multiple Integrals 


Ee §8614.1 Double Integrals 


This chapter shows how to integrate functions of two or more variables. First, a 
double integral is defined as the limit of sums. Second, we find a fast way to compute 
it. The key idea is to replace a double integral by two ordinary “‘single’’ integrals. 

The double integral ff Six, y)dy dx starts with f f(x, y)dy. For each fixed x we integ- 
rate with respect to y. The answer depends on x. Now integrate again, this time with 
respect to x. The limits of integration need care and attention! Frequently those limits 
on y and x are the hardest part. 

Why bother with sums and limits in the first place? Two reasons. There has to be 
a definition and a computation to fall back on, when the single integrals are difficult 
or impossible. And also—this we emphasize—muitiple integrals represent more than 
area and volume. Those words and the pictures that go with them are the easiest to 
understand. You can almost see the volume as a “‘sum of slices” or a “double sum of 
thin sticks.” The true applications are mostly to other things, but the central idea is 
always the same: Add up small pieces and take limits. 

We begin with the area of R and the volume of V, by double integrals. 


A LIMIT OF SUMS 


The graph of z = f(x, y) is a curved surface above the xy plane. At the point (x, y) in 
the plane, the height of the surface is z. (The surface is above the xy plane only when 
z is positive. Volumes below the plane come with minus signs, like areas below the 
x axis.) We begin by choosing a positive function—for example z= 1+ x? + y’. 

The base of our solid is a region R in the xy plane. That region will be chopped 
into small rectangles (sides Ax and Ay). When R itself is the rectangle 0< x <1, 
0<y <2, the small pieces fit perfectly. For a triangle or a circle, the rectangles miss 
part of R. But they do fit in the limit, and any region with a piecewise smooth 
boundary will be acceptable. 


Question What is the volume above R and below the graph of z= f(x, y)? 
Answer It is a double integral—rhe integral of f(x, y) over R. To reach it we begin 
with a sum, as suggested by Figure 14.1. 521 
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point (x, ¥;) 
x area AA 


Fig. 14.4. Base R cut into small pieces AA. Solid V cut into thin sticks AV = 2AA. 


For single integrals, the interval [a, b] is divided into short pieces of length Ax. 
For double integrals, R is divided into small] rectangles of area AA = (Ax)(Ay). Above 
the ith rectangle is a “thin stick” with small volume. That volume is the base area 
AA times the height above it—except that this height z = f(x, y} varies from point to 
point. Therefore we select a point (x;, y;) in the ith rectangle, and compute the volume 
from the height above that point: 


volume of one stick =f(x;, AA volume of all sticks = pie (i ¥IAA. 


This is the crucial step for any integral—to see it as a sum of small pieces. 

Now take limits: Ax + 0 and Ay + 0. The height z = f(x, ) is nearly constant over 
each rectangle. (We assume that fis a continuous function.) The sum approaches a 
limit, which depends only on the base R and the surface above it. The limit is the 
volume of the solid, and it is the double integral of f(x, y) over R: 


i fix. dA = lim ¥ f(x; JAA. (1) 
R rr 

To repeat: The limit is the same for afl choices of the rectangles and the points (x;, y,). 
The rectangles will not fit exactly into R, if that base area is curved. The heights are 
not exact, if the surface z = f(x, y) is also curved, But the errors on the sides and top, 
where the pieces don’t fit and the heights are wrong, approach zero. Those errors are 
the volume of the “icing” around the solid. which gets thinner as Ax + 0 and Ay 3 0. 
A carcful proof takes more space than we are willing to give. But the properties of 
the integral need and deserve attention: 


1, Linearity: [[(f+ g)dA =Jf fdA + ff eda 

2. Constant comes outside: {f ¢f{x, yd =c If f(x, pd A 

3. R splits into § and T{not overlapping): [J fdA = f[ fda + [[ fda. 
R AY T 


In 1 the volume under f+ g has two parts. The “‘thin sticks” of height f+ g split into 
thin sticks under fand under g. In 2 the whole volume is stretched upward by c. In 
3 the volumes are side by side. As with single integrals, these properties help in 
computations. 

By writing dA, we allow shapes other than rectangles. Polar coordinates have an 
extra factor r in dA =r dr d@. By writing dx dy, we choose rectangular coordinates 
and prepare for the splitting that comes now, 
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SPLITTING A DOUBLE INTEGRAL INTO TWO SINGLE INTEGRALS 


The double integral {{ f(x, y)dy dx will now be reduced to single integrals in y and 
then x. (Or vice versa. Our first integral could equally well be { f(x, y)dx.) Chapter 8 
described the same idea for solids of revolution. First came the area of a slice, which 
is a single integral. Then came a second integral to add up the slices. For solids 
formed by revolving a curve, all slices are circular disks—now we expect other shapes. 

Figure 14.2 shows a slice of area A(x). It cuts through the solid at a fixed value of 
x. The cut starts at y=c on one side of R, and ends at y=d on the other side. This 
particular example goes from y= 0 to y= 2 (R is a rectangle). The area of a slice is 
the y integral of f(x, y). Remember that x is fixed and y goes from c to d: 


d 
A(x) = area of slice = | S\x, y)dy (the answer is a function of x). 


c 


2 y> y=2 8 
EXAMPLE 4 4=| (1 tate yidy=| yt sty 4 =2+2x?+-. 
si 5 lao 3 
This is the reverse of a partial derivative! The integral of x?dy, with x constant, is 
xy. This “partial integral” is actually called an inner integral. After substituting the 
limits y = 2 and y= 0 and subtracting, we have the area A(x) = 2 + 2x? + $. Now the 


outer integral adds slices to find the volume { A(x) dx. The answer is a number: 


3 8 2 8.7 2, %._ 16 
= + 2x? + — \dx = +—+-x| =2+=+-=—. 
volume | ; (2 2x 5) 2x 3 3 a 2 aS 


Sas 


, ig 

Fig. 14.2 A slice of V at a fixed x has area A(x) =f f(x, y)dy. 

To complete this example, check the volume when the x integral comes first: 
2 2 l 3 2 iis 4 2 

(1 +.x* y")ax =| ¥+ =x?'+ 3*x =-+y 

0 3 3 


x=0 


+ (4 4 Lar... &. 81 
i = —+ 2 id =}-y+— 3 =-+-=>—, 
outer integral {6 y ) y E 37 a a ae 


The fact that double integrals can be split into single integrals is Fubini’s Theorem. 


1 


inner integral = | 


x= 


44A If f(x, y) is continuous on the rectangle R, then 


b d d b 
[freon ['[romo]ee=[[ fae} 0 
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The inner integrals are the cross-sectional areas A(x) and a(y) of the slices. The outer 
integrals add up the volumes A(x)dx and a(y)dy. Notice the reversing of limits. 

Normally the brackets in (2) are omitted. When the y integral is first, dy is written 
inside dx. The limits on y are inside too. | strongly recommend that you compute the 
inner integral on one line and the outer integral on a separate line. 


EXAMPLE 2 Find the volume below the plane z=x-—2y and above the base 
triangle R. 


The triangle R has sides on the x and y axes and the line x + y= 1. The strips in the 
y direction have varying lengths. (So do the strips in the x direction.) This is the main 
point of the example—the base is not a rectangle. The upper limit on the inner 
integral changes as x changes. The top of the triangle is at y= 1 — x. 
Figure 14.3 shows the strips. The region should always be drawn (except for 
rectangles). Without a figure the limits are hard to find. A sketch of R makes it easy: 
y goes from c= 0 to d= 1 — x. Then x goes from a=0 to b= 1. 


The inner integral has variable limits and the outer integral has constant limits: 


y=1—-x ee 
inner: | (x — 2y)dy = [ xy—y? Po5 = x(1 — x) —(1— x)? = -—1+ 3x — 2x? 


y=0 


1 1 
outer: | (I +3x—2e?hx=| —x+ sx >| = ciGupe aa Be hs 
x=0 


The volume is negative. Most of the solid is below the xy plane. To check the answer 
—%, do the x integral first: x goes from 0 to 1 — y. Then y goes from 0 to 1. 


= 


ae 1 1 1 5 
; . —_ =| — i —_ 7 _ Minas _— = + — 2 
inner: | ‘ (x — 2y)dx E any 5 y)* — 21 —y)y 5 3y 57 


x= 0 


A 5 es) oe 

: eee + —y? ldy=]— eat ag = tel =—-——-+-=--, 

ane {_.G y 5° jy E =| 2 2°6 6 
d 


Same answer, very probably right. The next example computes ff 1 dx dy = area of R. 


1 ) oa 1 l-y 
EXAMPLE 3 The area of R is | | dy dx and also | | dx dy. 
x=0 Jy=0 y=0 Jx=0 


The first has vertical strips. The inner integral equals 1 — x. Then the outer integral 
(of 1 — x) has limits 0 and 1, and the area is }. It is like an indefinite integral inside 
a definite integral. 


y=0 x=id | 


y= 2 


Fig. 14.3 Thin sticks above and below (Example 2). Reversed order (Examples 3 and 4). 
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2 2x 
EXAMPLE 4 Reverse the order of integration in | | xidy dx. 


x=O0 Jy=x? 


Solution Draw a figure! The inner integral goes from the parabola y = x? up to 
the straight line y = 2x. This gives vertical strips. The strips sit side by side between 
x =Qand x= 2. They stop where 2x equals x”, and the line meets the parabola. 

The problem is to put the x integral first. It goes along horizontal strips. On each 
line y = constant, we need the entry value of x and the exit value of x. From the figure, 
x goes from $y to /y. Those are the inner limits. Pay attention also to the outer 
limits, because they now apply to y. The region starts at y=0 and ends at y=4. 
No change in the integrand x*—that is the height of the solid: 


4 .¥ 
| | xrdx dy. (3) 
xady 


wend 


2 2x 
| | xtdy dx is reversed to 
x=0 Jy=x? 


EXAMPLES Find the voiume bounded by the planes x=0, p=0, z=0, and 
2xt+yt+z2=4. 


Solution The solid is a tetrahedron (four sides). [t goes from z = 0 (the xy plane) up 
to the plane 2x + y+ z= 4. On that plane z = 4 — 2x — y. This is the height function 
Sx, y) to be integrated. 

Figure 14.4 shows the base R. To find its sides, set z= 0. The sides of R are the 
lines x = 0 and y=0 and 2x+ y= 4. Taking vertical strips, dy is inner: 


4-2x 1 j4-2x 1 

inner: | (42x yay=| 2p — 39" = =(4-2xy 
eh a |, 8 
= 4-27? _ 4? _ 16 
Df s(4~2x)4dx =| - = ae 
ware {5 Bae | 2-352 5 232 3 


Question What is the meaning of the inner integral (4 — 2x}? (ana also = 


1 
Answer The first is A(x), the area of the slice. e is the solid volume. 


Question What if the inner integral | f(x, y}dy has limits that depend on y? 
Answer It can’t. Those limits must be wrong. Find them again. 


¥ 


arca AA 


density p = + 


Fig. 44.4 Tetrahedron in Example 5, semicircle in Example 6, triangle in Example 7. 


EXAMPLE 6 Find the mass in a semicircle 0 < y < ./ 1 — x? if the density is p = y. 


This is a new application of double integrais. The total mass is a sum of small masses 
(p times AA) in rectangles of area AA. The rectangles don’t fit perfeetly inside the 
semicirete R, and the density is not constant in each rectangle—but those problems 
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disappear in the Jimit. We are left with a double integral: 


total mass M = {) pdA= {| px, y) dx dy, (4) 
R R 


Set p = y. Figure 14.4 shows the limits on x and y (try both dy dx and dx dy): 


1 ,T-x 1 <T-y? 
mass M -| | y dy dx andalo M= | y dx dy. 
x7 -UJy70 y=0 J -v1-»? 
The first inner integral is $7. Substituting the limits gives }(1 -- x?). The outer integral 
of $(1 — x’) yields the total mass M = 3. 
The second inner integral is xy. Substituting the limits on x gives . Then 
the outer integral is — 4(1 — y?)*"?. Substituting » = | and y =0 yields M = 


Remark This same calculation also produces the moment around the x axis, when 
the density is g = 1. The factor } is the distance to the x axis. The moment is M, = 
ii} y dA = 4. Dividing by the area of the semicircle (which is 2/2) locates the centroid: 
x =0 by symmetry and 


¥ = height of centroid = ———— = —_= —. 
. 8 area mi2 3x 


z 


(5) 
This is the “average height” of points inside the semicircle, found earlier in 8.5, 


EXAMPLE 7 Integrate |” [¢_; cos x*dx dy avoiding the impossible { cos x? dx. 


This is a famous cxample where reversing the order makes the calculation possible. 
The base R is the triangle in Figure 14.4 (note that x goes from y to 1). én the opposite 
order y goes from 0 to x. Then fj cos x*dy = x cos x? contains the factor x that we 
need: 


1 
: if 1 Py 
outer integral: { x cos x7dx = [4 sin x? |, = 3 sin 1. 
a 


14.4 EXERCISES 


(inner) limits on x are __u__. Now the strep is 


outer integral is _w _. When the density is p(x, ¥), the total 


The double integral [[, f(x, y)d4 gives the volume between R 
and _a_.. The base is first cut into small __b of area AA. 
The volume above the ith piece ts approximately _c__. The 
limit ofthesum __d___is the volume integral. Three properties 
of double integrals are _e (linearity) and _ tf — and 


gs. 


If Ris the rectangle 0< x <4,.4<) <6, the mtegral |{x dA 
can be computed two ways. One is [|x dy dx, when the 
inner integral ish J$= i. The outer integral gives 

aps =_k__. When the x integral comes first it equals 
fxdx=_1 JO=_m Then the y integral equals 


n__. This is the volume between 9 (describe V’}. 


The area of R is f{/_p dy dx. When R is the triangle 
between x =0, 3 = 2x, and y= 1, the inner limits on y¥ are 
q__. This is the length ofa __t___ strip. The (outer) limits 


onxare__$ __. Theareats__t __. In the opposite order, the 


mass in the region R is [f__x__. The moments are M, = 
_y and M,=_2_. The centroid has x = M,/M. 


Compute the double integrals 1-4 by two integrations. 


rt 2 


1 2 
x?dx dy and | | yidx dy 
wF TO yard yO Jx at 


Sou eu (*2e ¢ 
2 | | 2xy dx dy and | dx dyixy 
yr2gx-t yo2da-1 


Crd Prd 


2 2 
3 | | sin(x + y) dx dy and [ | dy dx:(x + ¥P 
o Jo 1 


+ 


o 


my pz “1 P3 ; 
4 | ye'dx dy and | | dy dxivy3 42x44 
13 | ¢ 
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In 5-10, draw the region and compate the area. 


1 fx 
6 | { dy dx 
oO jx? 
1 f1 1 lvl 
o( { ax ay 10 | { dx dy 
-1i Jy? -1 Jx=y 


In 11-16 reverse the order of integration (and find the new 
limits) in 5-10 respectively. 


In 17-24 find the limits on {j dy dx and {[.dx dy. Draw R and 
compute its area, 

17 R=triangle inside the lines x = 0, y= 1, y = 2x. 

18 R= triangle inside the lines x = —1, y= 0, x + y=0. 

19 R=triangle inside the lines y= x, y= — x, y= 3. 

20 R =triangle inside the lines y= x, y = 2x, y = 4. 

21 R= triangle with vertices (0, 0}, (4, 4}, (4, 8). 

22 R =triangle with vertices (0, 0), (—2, —1), {1, —2). 

23 R= triangle with vertices (0, 0), (2, 0), (1, b). Here b> 0. 


*24 R =triangle with vertices (0, 0), {a, 5), (c, d). The sides are 
y=bx/a, y=dx/c, and y=b+(x —a)(d— d)/(c— a). Find 
A=ffdydx when 0<a<c,0<d<b, 


b fa 
25 Evaluate [ { a fjaxdy da dy. 
0 yo 


& fa 
26 Evaluate | | 6f/Ox dx dy. 


oo 


In 27-28, divide the unit square RX into triangles $ and T and 
verify {|r faa = js fda +fjr fda. 


27 fix, y)=2x—3y+1 28 fix, y) = xe” — ye* 


29 The area under y = f(x) is a single integral from a to b or 
a double integral (find the limits): 


[, f(x) dx - {fi dy dx, 


44.2 Change to Better Coordinates 


3 Find the limits and the area under y = t — x?: 
{c ~— x?) dx and if dx dy (reversed from 29). 


31 A city inside the circle x? + y? = 100 has population den- 
sity p(x, y) = 10100 — x? — y?}. Integrate to find its pop- 
ulation. 


32 Find the volume bounded by the planes x =0, y=O, 
z=0, and ax +5y+cz=1. 


In 33-34 the rectangle with corners (1, 1), (1, 3), (2, 1), (2, 3) has 
density p(x, y)= x7. The moments are M,={/xodA and 
M,= fj yp dA. 


33 Find the mass. 34 Find the center of mass. 
In 35—36 the region is a circular wedge of radius 1 between the 
lines y = x and y= — x. 


35 Find the area. 36 Find the centroid (x, ¥). 


37 Write a program to compute (35 flx, pdx dy by the mid- 
point rule (midpoints of n? small squares). Which flx, y) are 
integrated exactly by your program? 


38 Apply the midpoint code to integrate x? and xy and y?. 
The errors decrease like what power of Ax = Ay =1/n? 


Use the program to compute the yolume under f(x, y) in 39-42. 
Check by integrating exactly or doubling 7. 


39 fix, yy 3x4 4y45 40 fix, y}= 1//x? + y? 

41 fix, y)=x’ 42 fix, y)=e* sin zy 

43 In which order is {f x’dx dy = [ff x’dy dx easier to integ- 
rate over the square O< x <1,0< » < 1? By reversing order, 


integrate (x—1)/Inx from 0 to 1—tts antiderivative is 
unknown. 


44 Explain in your own words the definition of the 
double integral of f(x, y) over the region R. 


45 $° yA A might not approach ff y @A if we only know that 
AA —0. In the square O< x, y <1, take rectangles of sides 
Ax and 1 (not Ax and Ay). If (x;, y,) ts a point in the rectangle 
where y,=1, then } y,AA= . But f[yda= 


You don’t go far with double integrals before wanting to change variables. Many 
regions simply do not fit with the x and y axes. Two examples are in Figure 14.5, 
a tilted square and a ring. Those are excellent shapes—ia the right coordinates. 
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We have to be able to answer basic questions like these: 


Find the area [fas and moment If. aA and moment of inertia {hs dA. 


The problem is: What is dA? We are leaving the xy variables where dA = dx dy. 

The reason for changing is this: The limits of integration in the y direction are 
miserable. 1 don't know them and I don’t want to know them. For every x we would 
need the entry point P of the line x = constant, and the exit point QO. The heights of 
P and @Q are the limits on [dy, the inner integral. The geometry of the square and 
ring are totally missed, if we stick rigidly to x and y. 


(—sin &, cos &} 


(cos o, sin @&) 


Fig. 14.5 Unit square turned through angle x. Ring with radii 4 and 5. 


Which coordinates are better? Any sensibic person agrees that the areca of the tilted 
square is 1. “Just turn it and the area is obvious,” But that sensible person may not 
know the moment or the center of gravity or the moment of inertia. So we actually 
have to do the turning. 

The new coordinates u and ev are in Figure 14.6a. The limits of integration on z are 
O and 1. So are the limits on uw. But when you change variables, you dow t just change 
limits. Two other changes come with new variables: 


1. The small area dA = dx dy becomes dA = du dv. 
2. The integral of x becomes the integral of 


Substituting « = x in a singic integral. we make the same changes. Limits x = 0 and 
x= 4 become u = 0 and u= 2. Since x is u?, dx is 2u du. The purpose of the change 
is to find an antiderivative. For double integrals, the usual purpose is to improve the 
limits—but we have to accept the whole package. 

To turn the square, there are formulas connecting x and y to wand ¢. The geometry 
is clear—rotate axes by x—but it has to be converted into algebra: 


u=xXCosS sty sin x : X=ucosz—rsing 
and in reverse ; (1) 
r= —-Nsin zt y cos 2 y=usin ¢+e cos #. 


Figure 14.6 shows the rotation. As points move, the whole square turns. A good way 
to remember equation (1) is to follow the corners as they become (1, 0) and (0, 1). 
The change from |f x dA to || du dv is partly decided by equation (1). It 
gives x as a function of u and r. We also need dA. For a pure rotation the frst guess 
is correct: The area dx dy equals the area du dc, For most changes of variable this is 
faise. The general formula for dA comes after the examples. 
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v= vcost+ysing 


D=-Vsind + ¥ cosa 


v= cosd. ¥ = sing goes tou = 1,0=0 


Fig. 14.6 Change of coordinates- -axes turned by 2. For rotation dA is du dv. 


EXAMPLE 4 Find jf 4A and {jx dA and x and also ff x? dA for the tilted square. 
Solution The area of the square is fj} du de = 1. Notice the good limits. Then 


xdA=fi[h(u cos x— ve sin adu dv = 4 cos x—4 sina. (2) 
0/0 


This is the moment around the y axis. The factors } come from 3u? and 3x7. The x 


coordinate of the center of gravity is 


ape 
f= i dA? | dA =(3 cos x— 5 Sin 2/1. 


Similarly the integral of y leads to ¥. The answer is no mystery—the point (x, }') is 
at the center of the square! Substituting x = uv cos  — e sin x made x dA look worse, 
but the limits 0 and | are much better. 

The moment of inertia /, around the y axis is also simplified: 


peor Paes cos?x cosxsing sin? 
vrdA= (u cos 4—e sin a)?du de = : + . OQ) 
vo Jo 3 2 3 


You know this next fact but I will write it anyway: The answers don't contain u or v. 
Those are dummy variables like x and ». The answers do contain x, because the 
square has turned. (The area is fixed at 1.) The moment of inertia I, = [{ y? dA is the 
same as equation (3) but with all plus signs. 


Question The sum /, + /, simplifies to $ {a constant). Why no dependence on x? 
Answer 1, + 1, equals Jy. This moment of inertia around (0, 0) is unchanged by 
rotation. We are turning the square around one of its corners. 


CHANGE TO POLAR COORDINATES 


The next change is tor and}. A smail area becomes dA = r dr dé (definitely not dr dé). 
Area always comes from multiplying two lengths, and d@ is not a length. Figure 14.7 
shows the crucial region—a “polar rectangle” cut out by rays and circles. Its area 
AA is found in two ways, both leading to r dr dé: 


(Approximate) The straight sides have length Ar. The circular arcs are 
close to rA@. The angles are 90°. So AA ts close to (Ar)(rAQ@). 
(Exact) A wedge has area r7A0. The difference between wedges is AA: 
! Ary ! Ar? 
AA= 3(r+ 3] AQ - 3(r- 3) AU =r Ar AQ. 


2 
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The exact method places r dead center (see figure). The approximation says: Forget 
the change in rA@ as you move outward. Keep only the first-order terms. 

A third method is coming, which requires no picture and no geometry. Calculus 
always has a third method! The change of variables x =r cos 6, y=rsin @ will go 
into a general formula for dA, and out will come the area r dr dé. 


do 


45 
Fig. 14.7 Ring and polar rectangle in xy and r@, with stretching factor r = 4.5. 


EXAMPLE 2 Find the area and center of gravity of the ring. Also find [{x7dA. 


Solution The limits on r are 4 and 5. The limits on @ are 0 and 27. Polar coordinates 
are perfect for a ring. Compared with limits like x = ./25—y?, the change to rdrd0 
is a small price to pay: 

2x 


2 5 
area= { [rdr d0= 2n{ 47? |f = 15? — 74? = 9n. 
04 


The @ integral is 2x (full circle). Actually the ring is a giant polar rectangle. We could 
have used the exact formula r Ar AO, with A@ = 22 and Ar = 5 — 4. When the radius 
r is centered at 4.5, the product r Ar A@ is (4.5)(1)(2x) = 9x as above. 

Since the ring is symmetric around (0, 0), the integral of x dA must be zero: 


2x5 5 ‘ 2n 
J xdA= J Jer cos @)r dr dd = [4°] [ sin a |; =, 
Notice r cos 6 from x—the other r is from dA. The moment of inertia is 
2n 


2x5 
ff x2dA = ff r2cos?0 r dr do =[ 4r* |? { cos%@ d0 = 2(5* — 4*)n. 
R 04 


This 6 integral is x not 2x, because the average of cos?@ is $ not 1. 
For reference here are the moments of inertia when the density is p(x, y): 


1,=ffx?pdA  1,=Jfy?pdA Ig =Sfr?p dA =polar moment=1,+1,. (4) 


EXAMPLE 3 Find masses and moments for semicircular plates: p = 1 and p=1—r. 


Solution The semicircles in Figure 14.8 have r=1. The angle goes from 0 to x 
(the upper half-circle). Polar coordinates are best. The mass is the integral of the 
density p: 


1 ri 


M= {r dr d0=(4)(x) and M= J y(t —r)r dr dO = (3)(z). 


44.2 Change to Better Coordinates Bo 


The first mass 2/2 equals the area (because p= 1). The second mass 7/6 is smaller 
(because p < 1). Integrating p = 1 is the same as finding a volume when the height is 
z= 1 (part of a cylinder). Integrating p = 1 — r is the same as finding a volume when 
the height is z= 1 —r (part of a cone). Volumes of cones have the extra factor 4. 


The center of gravity involves the moment M,=f{yp dA. The distance from the 
x axis is y, the mass of a small piece is p dA, integrate to add mass times distance. 
Polar coordinates are still best, with y=r sin 6. Again p= 1 and p=1-—r: 


xl xr il 
{fydA=f [rsin@rdrdd=% = ffyi-rdA= J {rsin 01 —r)rdr dé =}. 
00 
The height of the center of gravity is j= M,/M = moment divided by mass: 


when p=1-r. 


-2 -1 l 2 
Fig. 14.8 Semicircles with density piled above them. Fig. 14.9 Bell-shaped curve. 


Question Compare j for p= 1 and p= other positive constants and p= 1 —r. 
Answer Any constant p gives y = 4/3z. Since 1 — ris dense at r = 0, j drops to 1/z. 


Question How is j = 4/37 related to the “average” of y in the semicircle? 
Answer They are identical. This is the point of j. Divide the integral by the area: 


The average value of a function is {40s y)dA | {| dA. (5) 


The integral of fis divided by the integral of 1 (the area). In one dimension \s v(x) dx 
was divided by lp 1 dx (the length b—a). That gave the average value of v(x) in 
Section 5.6. Equation (5) is the same idea for f(x, y). 


EXAMPLE 4 Compute a={ e-dx=,/n from v= eax | e -’'dy=n. 
oP =a —-@ 
A is the area under a “bell-shaped curve”—see Figure 14.9. This is the most important 
definite integral in the study of probability. It is difficult because a factor 2x is not 
present. Integrating 2xe~** gives —e*’, but integrating e *’ is impossible—except 
approximately by a computer. How can we hope to show that A is exactly Jn 
The trick is to go from an area integral A to a volume integral A”. This is unusual 


(and hard to like), but the end justifies the means: 


ro) wo 2n al 
a | | ere raydc=| | e-"r dr dé. (6) 
x=-o Jy=-@ 6=0 Jr=0 


The double integrals cover the whole plane. The r? comes from x? + y?, and the 
key factor r appears in polar coordinates. It is now possible to substitute u=r?. 
The r integral is ${f e~“du=4. The @ integral is 2x. The double integral is (3)(2z). 
Therefore A? = x and the single integral is A = Jn. 
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EXAMPLES Apply Example 4 to the “normal distribution” p(x) = e~*"!?/./2n. 


Section 8.4 discussed probability. It emphasized the importance of this particular p(x). 
At that time we could not verify that { p(x)dx = 1. Now we can: 


= iedds |” e-™? -—-|" -P gy = 
x /2y yields val es dx te e *dy=1. (7) 


-% RJ-a@ 


Question Why include the 2’s in p(x)? The integral of ei ln also equals 1. 
Answer With the 2’s the “variance” is | x*p(x) dx = 1. This is a convenient number. 


CHANGE TO OTHER COORDINATES 


A third method was promised, to find r dr d@ without a picture and without geometry. 
The method works directly from x =rcos @ and y=rsin @. It also finds the 1! in 
du dv, alter a rotation of axes. Most important, this new method finds the factor J in 
the area dA = J du dv, for any change of variables. The change is from xy to uv. 

For single integrals, the “stretching factor” J between the original dx and the new 
du is (not surprisingly) the ratio dx/du. Where we have dx, we write (dx/dujdu. Where 
we have (du/dx)dx, we write du. That was the idea of substitutions—the main way 
to simplify integrals. 

For double integrals the stretching factor appears in the area: dx dy becomes 
|J| du dv. The old and new variables are related by x = x(u, v) and y = y(u, v). The point 
with coordinates u and » comes from the point with coordinates x and y. A whole 
region S, full of points in the uv plane, comes from the region R full of corresponding 
points in the xy plane. A small piece with area |J| du dv comes from a small piece with 
area dx dy. The formula for J is a two-dimensional version of dx/du. 


44B The stretching factor for area is the 2 by 2 Jacobian determinant J(u, v): 
Ox/du ax/dv 


j= : 
dy/du dy/v| Gu dv dv du (8) 


An integral over R in the xy plane becomes an integral over S in the up plane: 


ff, (x, y) dx dy = ff, S(x(u, v), Wu, v)) [J] du dv. (9) 


The determinant J is often written o(x, y)/é{u, v), as a reminder that this stretching 
factor is like dx/du. We require J #0. That keeps the stretching and shrinking under 
control. 

You naturally ask: Why take the absolute value |J| in equation (9)? Good 
question—it wasn’t done for single integrals. The reason is in the limits of integration. 
The single integral {} dx is [; * (—du) after changing x to —u. We keep the minus sign 
and allow single integrals to run backward. Double integrals could too, but normally 
they go left to right and down to up. We use the absolute value |J| and run forward. 


EXAMPLE Polar coordinates have x = ucosv=rcos@ and py=usinv=rsiné. 


éx/ér éx/é cos6 —rsin@ 


With no geometry: J= = =r, (10) 


sin @ r cos @ 


Gy/ér @y/é@ 
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EXAMPLE 7 Find J for the linear change to x = aut bv and y=cu+ dv, 
a b 
c d 


Cxidu éx/év 


=ad— be. (10 


Ordinary determinant! J=j) | 
Cy/cu éy/ér 


Why make this simple change, in which a, 4,c,d are all constant? It straightens 
parallclograms into squares (and rotates those squares). Figure 14.10 is typical. 
Common sense indicated J = 1 for pure rotation—no change in area. Now J = | 
comes from equations (1) and (11), because ad ~ be is cos*% + sin?. 
‘In practice, xy rectangles generaily go into uv rectangles. The sides can be curved 
(as in polar rectangles) but the angles are often 90°. The change is “orthogonal.” The 
next cxample has angles that are not 90°, and J still gives the answer. 


) ’ 
es (2u + v) v H 
Re 
fee (ue + 20) 2 


(3,3) 


3 


x “ ane 


Fig. 14.40 Change from xy to av has J = 4. Fig. 14.41 Curved areas are also 
dA =[ldu de. 
EXAMPLE 8 = Find the area of R in Figure 14.10. Also compute [f e*dx dy. 
R 


Solution The figure shows x = 4u+ 4c and y= 4u+4v. The determinant is 
2/3 1/3 
1/3 2/3 


exfeu Ox/ée 


Cy/eu Opfev 
The area of the xy parallelogram becomes an integral over the wv square: 
33 
ff dx dy=ff|Jldude =f [4 dudo=4-3-3=3. 
R S 00 


The square has area 9, the parallelogram has area 3. I don’t know if J=4 is a 
stretching factor or a shrinking factor. The other integral {| e*dx dy is 


ee 2uj3~vi3 ly d 3 Quid : vid a | 3 2 
I: ix 3 u =| 56 [| [3-36 — 1){e — 1). 


Main point: The change to u and v makes the limits easy (just 0 and 3). 


Why is the stretching factor J a determinant? With straight sides, this goes back to 
Section 11.3 on vectors. The area of a parallelogram is a determinant. Here the sides 
are curved, but that only produces (du)? and (dv)?, which we ignore. 

A change du gives one side of Figure 14.11—it is (@x/dui-+ éey/eujjdu. Side 2 is 
(éx/cvi+ @y/évjide. The curving comes from second derivatives. The area (the cross 
product of the sides) is |J |du dv. 
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Final remark [can’t resist looking at the change in the reverse direction. Now the 
rectangle is in xy and the parallelogram is in we. In all formulas, exchange x for u 
and y for v: 

Gu/dx éufdy 


Gvjax dviéy 


Ss (12) 


This is exactly like du/dx = 1/(dx/du). It is the derivative of the inverse function. 
The product of slopes is 1—stretch out, shrink back. From xy to uv we have 2 by 2 
matrices, and the identity matrix / takes the place of 1: 


ax du Ox/Gu  Ox/ev || Gufdx du/dy 1 0 
—-—-=1 becomes = ' (13) 
dy/du dy/v || dvfdx  Gufdy 0 1 


The first row times the first column is (@x/Gu)(@u/@x) + (dx/év)(dv/dx) = dx/dx = 1, 
The first row times the second column is (@x/Gu)}(Gu/dy) + (8x/ev)(dv/Oy) = dx/dy = 9. 
The matrices are inverses of each other. The determinants of a matrix and its inverse 
obey our rule: old J times new J = 1. Those J’s cannot be zero, just as dx/du and 
du/dx were not zero. (Inverse functions increase steadily or decrease steadily.) 

In two dimensions, an area dx dy goes to J du ds and comes back to dx dy. 


44.2 EXERCISES 


Read-through questions 


We change variables to improve the _ 9 __ of integration. 

The disk x?+y?<9 becomes the rectangle O<r<_b_, 

0<0<_¢__. The inner limits on [[dydxarey=+ 4 

In polar coordinates this area integral becomes {f__e = 
t 


A polar rectangle has sides dr and _ 9 __. Two sides are 
not __/_ but the angles are still __|  . The area between 
the circies r= 1 and r=3 and the rays 0@=0 and 0=2/4 is 

i__. The integral {jx dy dx changes to [{f__k _. This is 


the_ | around the _™__ axis. Then xis the ratio 


This is the x coordinate of the _ ©, and it is the _ PB 
value of x. 


In a rotation through 2, the point that reaches (u, v) starts 
atx=ucosa—vrsina, y=_ 4. A rectangle in the wt plane 
comesfroma__'_inxy. The areasare__# so the stretch- 
ing factor is J=__t__., This is the determinant of the matrix 

u_ containing cos and sing. The moment of inertia 
fj x?dx dy changes to f[ _¥ du do. 


For single integrals dx changes to __¥ _ du. For double 
integrals dx dy changes to Jdudv with J=_ x. The 
stretching factor J is the determinant of the 2 by 2 matrix 

y__. The functions x(u, v} and yu, v) connect an xy region 
R toa wv region S, and ffgdxdy=[f,_ 2 =areaof_A 
For polar coordinatesx=_ 8 ,y=_C_.Forx=u, y= 
u + 4p the 2 by 2 determinant is J/=_ 2. A square in the 
uv plane comes froma __€ ___ in xy. In the opposite direction 
the change has u=x and v=}(y—x)andanewJ=_F_. 
This J is constant because this change of variables is __© 


In 1-12 R is a pie-shaped wedge: 0 <r <1 and 7/4 <0 < 3n/4. 


1 What is the area of R? Check by integration in polar 
coordinates. 


2 Find limits on ff dy dx to yield the area of R, and integ- 
rate. Extra credit: Find limits on {f dx dy. 


3 Equation (1) with «= 7/4 rotates R into the uv region S = 
. Find limits on {f du de. 


4 Compute the centroid height j of R by changing [f y dx dy 
to polar coordinates. Divide by the area of R. 


5 The region R has x = 0 because . After rotation 
through z = 7/4, the centroid (x, §) of R becomes the centroid 
of S. 


6 Find the centroid of any wedge O<¢ r<a,0<0 <b, 


7 Suppose R* is the wedge R moved up so that the sharp 
point is atx =0, y=1, 
(a} Find limits on ff dy dx to integrate over R*. 
(b} With x* =x and y* = y—1, the xp region R* corres- 
ponds to what region in the x*y* plane? 
{c) After that change dx dy equals dx*dy*. 


8 Find limits on [{r dr d0 to integrate over R* in Problem 7. 


9 The right coordinates for R* are r* and #*, with x= 
r* cos 0* and y =r" sin &* + 1. 

{a) Show that J=r* so dA =r*dr*d0*. 

(b) Find limits on {jr*dr*d0* to integrate over R*. 
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10 Ifthe centroid of R ts (0, ¥), the centroid of R* is : 
The centroid of the circle with radius 3 and center (1, 2) is 
. The centroid of the upper half of that circle is 


11 The moments of inertia I,, f,, Zo of the original wedge R 
are 


12 The moments of inertia I,, J,, Io of the shifted wedge R* 
are - 


Problems 13-16 change four-sided regions to squares. 


13 Rhas straight sides y = 2x, x = 1, y= 1+ 2x, x =0. Locate 
its four corners and draw R. Find its area by geometry. 


14 Choose a,b,c,d so that the change x=au+bv, y= 
cu + dv takes the previous R into S, the unit square O <u <I, 
0<v <1. From the stretching factor J = ad — bc find the area 
of R. 

15 The region R has straight sides x =0, x=1, y=0, y= 
2x +3. Choose a,b,¢ so that x =u and y=au+be+cuv 
change R to the unit square S. 


16 A nonlinear term uv was needed in Problem 15. Which 
regions R could change to the square § with a linear x= 
au + bv, y=cu+ dv? 


Draw the xy region R that corresponds in 17-22 to the uv 
square S with corners (0, 0), (1, 0), (0, 1), (1, 1). Locate the 
corners of R and then its sides (like a jigsaw puzzle). 


17 x=2u+v, p=ut2v 
18 x =3u+2v, youtv 
19 x=e74t" yo et? 
20 x=, p=v? — wv? 
2ox=u, y=ol +47) 


22 x=ucosv, y=usin v (only three corners) 


23 In Problems 17 and 19, compute J from equation (8). Then 
find the area of R from [fs|J|du dv. 


24 In 18 and 20, find J = d(x, y)/Au, v) and the area of R. 


25 If R lies between x =0 and x =1 under the graph of y= 
Six)> 6, then x=u, y = vf{u) takes R to the unit square S. 
Locate the corners of R and the point corresponding to 
u=4, v= 1. Compute J to prove what we know: 


area of R=) fix)dx = J} fp J du dv. 


26 From r=./x?+ y? and @=tan™ (y/x), compute dr/dx, 
ér/ay, 00/ax, d0/dy, and the determinant J = dr, A/A{x, y). 
How is this J related to the factor r = Ax, y)/Ar, 6) that enters 
rdr d@? 


27 Example 4 integrated e~** from 0 to oo (answer /n). Also 
B=fte-*dx leads to B? = (je-*'dx fhe “dy. Change this 
double integral over the unit square to r and @— and find 
the limits on r that make exact integration impossible. 


28 Integrate by parts to prove that the standard normal 
distribution p(x) =e" *"?/, /2n has a? =["_ x?p(x}dx = 1. 

29 Find the average distance from a point on a circle to the 
points inside. Suggestion: Let (0,0) be the point and let 
O<r<2acos 0,0<06 <2 be the arcle (radius a), The distance 
is r, So the average distance is F = {f ff 

30 Draw the region R: 0<x <1, 0< y< oo and describe it 


with polar coordinates {limits on r and 6). Integrate 
Sax? + y2)" *dx dy in polar coordinates. 


31 Using polar coordinates, find the volume under z= 
x? + y* above the unit disk x? + y? <1. 


32 The end of Example | stated the moment of inertia 
Jf ya A. Check that integration. 


33 In the square -1 <x <2, —2<y<1, where could you 
distribute a unit mass (with {fp dxdy = 1) to maximize 
{a) ffx’p dA (b) {fy p dA (c) [fr?p dA? 
34 True or false, with a reason: 
(a) If the uv region S corresponds to the xy region R, then 
area of S =area of R. 
(b) ffx dA <ffxdA 
(c) The average value of f(x, y) is [f f(x, yl@A 
(d) f° xe dx =0 
(e} A polar rectangle has the same area as a straight-sided 
region with the same corners. 


35 Find the mass of the tilted square in Example 1 if the 
density is p = xy. 


36 Find the mass of the ring in Example 2 if the density is 
p =x? + y’. This is the same as which moment of inertia with 
which density? 


37 Find the polar moment of inertia {, of the ring in 
Example 2 if the density is p = x? + y”. 


38 Give the following statement an appropriate name: 
Sle fx, y\d@A = f{P) times {area of R), where P is a point in R. 
Which point P makes this correct for f=x and f= y? 


39 Find the xy coordinates of the top point in Figure 14.6a 
and check that it goes to (u, v) = (I, 1). 
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Re § 143 Tipleinteqas 


At this point in the book, I feel I can speak to you directly. You can guess what triple 
integrals are like. Instead of a small interval or a small rectangle, there is a small box. 
Instead of length dx or area dx dy, the box has volume dV = dx dy dz. That is length 
times width times height. The goal is to put small boxes together (by integration). 
The main problem will be to discover the correct limits on x, y, z. 

We could dream up more and more complicated regions in three-dimensional 
space. But I don’t think you can see the method clearly without seeing the region 
clearly. In practice six shapes are the most important: 


box prism cylinder cone tetrahedron sphere. 


The box is easiest and the sphere may be the hardest (but no problem in spherical 
coordinates). Circular cylinders and cones fall in the middle, where xyz coordinates 
are possible but r@z are the best. I start with the box and prism and xyz. 


EXAMPLE 1 By triple integrals find the volume of a box and a prism (Figure 14.12). 
1 3 2 L 3=Sz 2 
fffav= J J J dxdydz and {ffav= § Jf Jf dxdydz 
box ==0 y=0 x=0 prism z=0 y=0 x=0 
The inner integral for both is { dx = 2. Lines in the x direction have length 2, cutting 
through the box and the prism. The middle integrals show the limits on y (since dy 
comes second): 
3 
J 2dy=6 and 


3-32 
J 2dy=6-6z. 
y=0 y=0 
After two integrations these are areas. The first area 6 is for a plane section through 
the box. The second area 6 — 6z is cut through the prism. The shaded rectangle goes 
from y = 0 to y = 3 — 3z—we needed and used the equation y + 3z = 3 for the bound- 
ary of the prism. At this point z is still constant! But the area depends on z, because 
the prism gets thinner going upwards. The base area is 6 — 6z = 6, the top area is 
6—6z=0. 
The outer integral multiplies those areas by dz, to give the volume of slices. They 
are horizontal slices because z came last. Integration adds up the slices to find the 


total volume: 


1 1 
box volume= { 6dz=6 prism volume= [{ (6~—6z)dz= [ 62 - 32? |} = 3. 
z=0 ==0 


The box volume 2-3-1 didn’t need calculus. The prism is half of the box, so its 
volume was sure to be 3—but it is satisfying to see how 6z — 3z? gives the answer. 
Our purpose is to see how a triple integral works. 


area 3/2 
volume 3/2 dx 


Fig. 14.42 Box with sides 2, 3, 1. The prism is half of the box: volume {(6—6z)dz or [3 dx. 


14.3 Triple integrals 


Question Find the prism volume in the order dz dy dx (six orders are possible). 


2 73 PB3-y/3 23 /3~-y 23 
Answer | | | ds dy ax= | | 252 har axe | 34x =3. 
o Ja JO 0 J0 0 


To find those limits on the z integral, follow a line in the z direction. It enters 
the prism at z= 0 and exits at the sloping face y + 3z = 3. That gives the upper limit 
z=(3 — y)/3. It is the height of a thin stick as in Section 14.1. This section writes out 
J dz for the height, but a quicker solution starts at the double integral. 

What is the number # in the last integral? It is the area of a vertical slice, cut by a 
plane x = constant. The outer integral adds up slices. 


[fa y, z) dV is computed from three single integrals | [| [| f dx |dy |dz. 


That step cannot be taken in silence—some basic calculus is involved. The triple 
integral is the limit of ©) f, AV, a sum over small boxes of volume AV. Here f, is any 
value of f(x, y, z) in the ith box. {In the limit, the boxes fit a curved region.) Now take 
those boxes in a certain order. Put them into lines in the x direction and put the lines 
of boxes into planes. The lines lead to the inner x integral, whose answer depends on 
y and z. The y integral combines the lines into planes. Finally the outer integral 
accounts for all planes and all boxes. 

Example 2 is important because it displays more possibilities than a box or prism. 


EXAMPLE 2 Find the volume of a tetrahedron (4-sided pyramid). Locate (x, ¥, 2). 


Solution A tetrahedron has four flat faces, all triangles. The fourth face in 
Figure 14.13 is on the plane x + y+ z= 1. A line in the x direction enters at x =0 
and exits at x= 1—y—z. (The length depends on y and z. The equation of the 
boundary plane gives x.) Then those lines are put into plane slices by the y integral: 


l-z fi-y-=z 1-z 1 
| | ax dy=| (1—y-2\dy=[| y-4y?~zy J) *=40- 2). 
y=0 Jx=0 y=0 

What is this number 3(1— z)?? It is the area at height z. The plane at that height 
slices out a right triangle, whose legs have length 1 — z. The area is correct, but look 
at the limits of integration. If x goes to 1— y — z, why does y ge to | — 2? Reason: 
We are assembling lines, not points. The figure shows a line at every y up to 1 — z. 


z 


y=l-z 
\, end of triangle 


x=l-y-z 
__ end of line 


. 
. 
Bae 
- 


Aig. 14.43 Lines end at plane x+y+z= 1. Triangles end at edge y+z= 1. The average 
height is z= {jz dV/{ff av. 
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Adding the slices gives the volume: J} 4(1 — z)?dz = (A(z — 1)°]} = 4. This agrees 
with 3(base times height), the volume of a pyramid. 

The height 2 of the centroid is “Zyyerage-"” We compute [{[zdV and divide by the 
volume. Each horizontal slice is multiplied by its height z, and the limits of integration 
don’t change: 


1 1-y fl-y-z 1 2(1 -—2z/ 1 
zdV= zdxdydz=| ———~dz=—. 
il} i i i y {, 2 24 


This is quick because z is constant in the x and y integrals. Each triangular slice 
contributes z times its area $(1 — z)* times dz. Then the z integral gives the moment 
1/24. To find the average height, divide 1/24 by the volume: 


> = heigh i _MMfeav_ 1/24 1 
z= height of centroid {(} av 64 


By symmetry x= 4 and y=4. The centroid is the point (7, 4, 3). Compare that with 
(4, 4), the centroid of the standard right triangle. Compare also with 3, the center of 
the unit interval. There must be a five-sided region in four dimensions centered at 
(hed 85): 

For area and volume we meet another pattern. Length of standard interval is 1, 
area of standard triangle is 5, volume of standard tetrahedron is ¢, hypervolume in 
four dimensions must be . The interval reaches the point x = {, the triangle 
reaches the line x + y= 1, the tetrahedron reaches the plane x + y + z= [. The four- 
dimensional region stops at the hyperplane = 1, 


EXAMPLE 3 Find the volume J|[{ dx dy dz inside the unit sphere x? + y? + 2? = 1. 


First question: What are the limits on x? If a needle goes through the sphere in the 
x direction, where does it enter and leave? Moving in the x direction, the numbers y 
and z stay constant. The inner integral deals only with x. The smailest and largest x 
are at the boundary where x? + y? + z?= 1. This equation does the work—we solve 
it for x. Look at the limits on the x integral: 


22 vy yiost 29 
volume of sphere = [ {jf dx dy dz=[[ 2/1—y?-z?dydz (1) 
49° ~vTRyF a7 ky 

The limits on y are ~ ./1 — z* and + ./1 — 2”. You can use algebra on the boundary 
equation x” + y? + z? = I. But notice that x is gone! We want the smallest and largest 
y, for each z. It helps very much to draw the plane at height z, slicing through the 
sphere in Figure 14.14. The slice is a circle of radius r= ./1— 2?. So the area is 
ar’, which must come from the y integral: 


{ 2/1—y? ~ 2? dy =area of slice = n{t — 2°). (2) 


I admit that I didn’t integrate. Is it cheating to use the formula zr?? I don’t think so. 
Mathematics is hard enough, and we don't have to work blindfolded. The goal is 
understanding. and if you know the area then use it. Of course the integral of 
,/1— y* — z? can be done if necessary —use Section 7.2. 


The triple integral is down to a single integral. We went from one needle to a cirele 


of ncedles and now to a sphere of needles. The volume is a sum of slices of area 
n{i — z?), The South Pole is at z= —1, the North Pole is at z= +1, and the integral 
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is the volume 47/3 inside the unit sphere: 


1 1 
Biel sata atenl oe et 
[_ ma z*)dz (= 37 YY, 37 ( 5) 3 (3) 


Question 1 A cone also has circular slices. How is the last integral changed? 
Answer The slices of a cone have radius 1 — z. Integrate (1 — z)? not ./1—z* 


Question 2. How does this compare with a circular cylinder (height |, radius 1)? 
Answer Now all slices have radius 1. Above z= 0, a cylinder has volume m and a 
half-sphere has volume $x and a cone has volume 472. 

For solids with equal surface area, the sphere has largest volume. 


Question 3 What is the average height Z in the cone and half-sphere and cylinder? 


Answer Z 


- J 2(slice area)dz ih 
J (slice area)dz 4 


and 


: 
7 


Col|w 
bat) 
=] 
a. 


dx=adu dy=bdv dz=cdw 


Fig. 14.44 | dx =length of needle, {{ dx dy =area of slice. Ellipsoid is a stretched sphere. 


EXAMPLE 4 Find the volume f{{ dx dy dz inside the ellipsoid x?/a? + y?/b? + z7/c? = 1. 


The limits on x are now +,/1 — y?/b? — z?/c?. The algebra looks terrible. The geom- 
etry is better—all slices are ellipses. A change of variable is absolutely the best. 

Introduce u=x/a and v=y/b and w=2z/c. Then the outer boundary becomes 
u? + yp? + w? = 1. In these new variables the shape is a sphere. The triple integral for 
a sphere is {{f du dv dw = 4x/3. But what volume dV in xyz space corresponds to a 
small box with sides du and dv and dw? 


Every uvw box comes from an xyz box. The box is stretched with no bending or 
twisting. Since u is x/a, the length dx is a du. Similarly dy = b dv and dz =c dw. The 
volume of the xyz box (Figure 14.14) is dx dy dz = (abc) du dv dw. The stretching factor 
J = abc is a constant, and the volume of the ellipsoid is 


bad limits better limits An 
{Vf dx dy dz= Vf (abc) du dv dw = — abe. (4) 
ellipsoid sphere 3 


You realize that this is special—other volumes are much more complicated. The 
sphere and ellipsoid are curved, but the small xyz boxes are straight. The next section 
introduces spherical coordinates, and we can finally write “‘good limits.” But then we 
need a different J. 
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14.3 EXERCISES 


Read-through questions 


Six important solid shapes are _@ 
fff dx dy dz adds the volume __& 
putation it becomes __d__ single integrals. The inner integral 
dx is the _e _ of a line through the solid. The variables 

t and _g are held constant. The double integra) 
ffdx dy is the _h of slice, with _1 held constant. 
Then the z integral adds up the volumes of __J 


The integral 
ofsmall_¢ _. Forcom- 


If the solid region V is bounded by the planes x =0, y =0, 
z=0, and x + 2y+3z=14, the limits on the inner x integral 
are __k__. The limits on y are__!__. The limits on z are 

m__. Inthe new variables u = x,» = 2y, w = 32, the equation 
of the outer boundary is____. The volume of the tetrahe- 
dron in uew space is__9 _. From dx = dy and dy = dv/2 and 
dz=_p_, the volume of an xyz box is dxdydz= 

q__ du dv dw. So the volume of Vis __ + 


To find the average height Zin Vwecompute__ 5 /_ + 
To find the total mass in V if the density is p = e? we compute 
the integra] _¥ __. To find the average density we compute 
v__/__w _. In the order [ff dz dx dy the limits on the inner 
integral can depend on__x__. The limits on the middle integ- 
ral can depend on __y¥_. The outer limits for the ellipsoid 
x74 2y? 4+ 327<8are 2 


1 For the solid region 0< x <y<z<{, find the limits in 
{{f dx dy dz and compute the volume. 


2 Reverse the order in Problem | to {ff dz dy dx and find 
the kimits of integration. The four faces of this tetrahedron 
are the planes x = 0 and y= x and 


3 This tetrahedron and five others like it fill the unit cube. 
Change the inequalities in Problem 1 to describe the other five. 


4 Find the centroid (x, §, Z) in Problem 1. 
Find the limits of integration in {{j dx dy dz and the yolume of 
solids 5-16, Draw a very rough picture. 

§ A cube with sides of length 2, centered at (0, 0, 0}. 

6 Half of that cube, the box above the xy plane. 

7 Part of the same cube, the prism above the plane z = y. 

8 Part of the same cube, above z = y and z=0. 

9 Part of the same cube, above z = x and below z = y. 


10 Part of the same cube, where x < y<z. What shape is 
this? 


11 The tetrahedron bounded by planes x =0, y=0, z=0, 
and x + y+ 2z=2. 


12 The tetrahedron with corners (0, 0, 0), (2, 0, 0}, (0, 4, 0), 
(0, 0, 4). First find the plane through the last three corners. 


13 The part of the tetrahedron in Problem 11 below z=4, 


14 The tetrahedron in Problem 12 with its top sliced off by 
the plane z= 1. 


15 The volume above z=0 below the cone ./x?+ y?=1—z. 
*16 The tetrahedron in Problem 12, after it falls across the 

x axis onto the xy plane. 

In 17-20 find the limits in j{f dx dy dz or [{{ dz dy dx. Compute 

the volume. 

17 A circular cylinder with height 6 and base x? + y?< 1. 


18 The part of that cylinder below the plane z = x. Watch the 
base. Draw a picture. 


19 The volume shared by the cube (Problem 5) and cylinder. 
20 The same cylinder lying along the x axis. 


21 A cube is inscribed in a sphere: radius 1, both centers at 
(0, 0, 0}. What is the volume of the cube? 


22 Find the volume and the centroid of the region bounded 
by x=0, y=0, z=0, and x/a+ p/b+z/c=1. 


23 Find the volume 
0<2<4-x7_-y’*. 


and centroid of the solid 


24 Based on the text, what is the volume inside 
x? 4 4y? +92? = 16? What is the “hypervolume” of the 
4-dimensional pyramid that stops atx+y+z+w= I? 


25 Find the partial derivatives Gf/dx, di/@y, 2° 1/y dz of 


26 Define the average value of f{x, y, z) in a solid V. 


27 Find the moment of inertia [ff (? dV of the cube |x| <1, 
yl < f, [z| << t when / is the distance to 
(a) the x axis (b) the edge y = 2 = ! (c) the diagonal x = y=z. 


28 Add upper limits to produce the volume of a unit cube 
from small cubes: V= ¥ ¥ ¥ (Ax =1. 
i=1 j=l k=1 


Bfax 2/Ax J 
*29 Find the limit as Ax — 0 of ¥ >S (Ay?. 
i=] j=1 k=1 


30 The midpoint rule for an integral over the unit cube 
chooses the center value f(4, 3, 4). Which functions f= x™y’z? 
are integrated correctly? 


31 The trapezoidal rule estimates [345 {¢ SOs y, 2) dx dy dz 
as } times the sum of f(x, y, z) at 8 corners, This correctly 
integrates x™y"2" for which m, a, p? 


32 Propose a 27-point “Simpson’s Rule” for integration over 
a cube. If many smal]! cubes fill a large box, why are there 
only 8 new points per cube? 
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= 14.4 Cylindrical and Spherical Coordinates [= 


Cylindrical coordinates are good for describing solids that are symmetric around an 
axis. The solid is three-dimensional, so there are three coordinates r, 0, z: 


r: out from the axis 0: around the axis z: along the axis. 


This is a mixture of polar coordinates r@ in a plane, plus z upward. You will not find 
r0z difficult to work with. Start with a cylinder centered on the z axis: 


solid cylinder: 0<r<1_ flat bottom and top:0<z<3_ half-cylinder: 0<0<1 


Integration over this half-cylinder is [3 {5 Jj? dr dO dz. These limits on r, 0, z are 
especially simple. Two other axially symmetric solids are almost as convenient: 


cone: integrate tor+z=1 sphere: integrate to r* + z? = R? 


I would not use cylindrical coordinates for a box. Or a tetrahedron. 

The integral needs one thing more—the volume dV. The movements dr and d@ 
and dz give a “curved box” in xyz space, drawn in Figure 14.15c. The base is a polar 
rectangle, with area r dr d0. The new part is the height dz. The volume of the curved 
box is r dr dO dz. Then r goes in the blank space in the triple integral—the stretching 
factor is J =r. There are six orders of integration (we give two): 


volume = | | | r dr dO =| | | r dr dz do. (1) 
2J0 uF O@JzdJr 


(r, 8, 2) 


8=n/2 
(y axis) 


6 =0 (¥ axis) 


Fig. 14.45 Cylindrical coordinates for a point and a half-cylinder. Small volume r dr d@ dz. 


EXAMPLE 4 (Volume of the half-cylinder). The integral of r dr from 0 to 1 is $. The 
0 integral is x and the z integral is 3. The volume is 32/2. 


EXAMPLE 2 The surface r = 1 — z encloses the cone in Figure 14.16. Find its volume. 


First solution Since r goes out to 1 — z, the integral of r dr is $(1 — z)?. The 0 integral 
is 27 (a full rotation). Stop there for a moment. 

We have reached [J r dr d = $(1 — z)? 2z. This is the area of a slice at height z. The 
slice is a circle, its radius is 1 — z, its area is x(1 — z)?. The z integral adds those slices 
to give 2/3. That is correct, but it is not the only way to compute the volume. 
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Second solution Do the z and @ integrals first. Since z goes up to 1 — r, and @ goes 
around to 2x, those integrals produce {fr dz d@ = r(1 — r)2x. Stop again—this must 
be the area of something. 

After the z and @ integrals we have a shell at radius r. The height is 1 — r (the outer 
shells are shorter). This height times 2zr gives the area around the shell. The choice 
betweeen shells and slices is exactly as in Chapter 8. Different orders of integration 
give different ways to cut up the solid. 

The volume of the shell is area times thickness dr. The volume of the complete 
cone is the integral of sheil volumes: i Ml — r)2n dr = 7/3. 


Third solution Do the r and z integrals first: [[rdrdz=%. Then the @ integral is 
J & d0, which gives % times 2z. This is the volume /3—but what is % d0? 

The third cone is cut into wedges. The volume of a wedge is 4 d@. It is quite common 
to do the @ integral last, especially when it just multiplies by 2z. It is not so common 
to think of wedges. 


Question Is the volume 4 d@ equal to an area % times a thickness d@? 
Answer No! The triangle in the third cone has area 4 not 4. Thickness is never d@. 


area Zur(1—r) 


Fig. 44.46 A cone cut three ways: slice at height z, shell at radius r, wedge at angle 0. 


This cone is typical of a sofid of revolution. The axis is in the z direction. The @ 
integral yields 2x, whether it comes first, second, or third. The r integral goes out to 
a radius f(z), which is 1 for the cylinder and 1 — z for the cone. The integral {f r dr d@ 
is n(f(z))’ = area of circular slice. This leaves the z integral f n{f(z))°dz. That is our 
old volume formula { x(/{x})’dx from Chapter 8, where the slices were cut through 
the x axis. 


EXAMPLE 3 The moment of inertia around the z axis is {ff redr d@ dz. The extra r? 
is (distance to axis)’. For the cone this triple integral is x/10. 


EXAMPLE 4 The moment around the z axis is {{{ r? dr d@ dz. For the cone this is 7/6. 
The average distance F is (moment)/(volume) = {1/6)/(1/3) = 4. 


EXAMPLE 5 A sphere of radius R has the boundary r? +z? =R?, in cylindrical 
coordinates. The outer limit on the r integral is _/R* — z*. That is not acceptable in 
difficult problems. To avoid it we now change to coordinates that are natural for a 
sphere. 
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SPHERICAL COORDINATES 


The Earth is a solid sphere (or near enough). On its surface we use two coordinates— 
latitude and longitude. To dig inward or fly outward, there is a third coordinate— 
the distance p from the center. This Greek letter rho replaces r to avoid confusion 
with cylindrical coordinates. Where r is measured from the z axis, g is measured from 
the origin. Thus r? = x? + y? and p? = x? + y? 4 z?, 

The angle @ is the same as before. It goes from 0 to 2z. It is the longitude, which 
increases as you travel east around the Equator. 

The angle ¢ is new. It equals 0 at the North Pole and 2 (not 22) at the South Pole. 
It is the polar angle, measured down from the z axis. The Equator has a latitude of 
0 but a polar angle of 2/2 (halfway down). Here are some typical shapes: 


solid sphere {or ball): O<p<R surface of sphere: p =R 
upper half-sphere: 0 < @ < 2/2 eastern half-sphere:0 <@ <x 


North Pole_ $= 0 


(p, >, 8) 


y 


“Oya psin¢cos 4 


y=psingsin® 


South Pole ¢=7 
Fig. 44.47 Spherical coordinates pp@. The volume dV = p? sin ¢ dp d¢ dé of a spherical box. 


The angle ¢ is constant on a cone from the origin. It cuts the surface in a circle 
(Figure 14.17b), but not a great circle. The angle @ is constant along a haif-circle 
from pole to pole. The distance p is constant on each inner sphere, starting at the 
center p = 0 and moving out to p=R. 

In spherical coordinates the volume integral is |{{ p? sin @ dp d¢ dé. To explain that 
surprising factor J = p? sin @, start with x = r cos @ and y =r sin 6. In spherical coor- 
dinates r is psin gd and z is pcos @—see the triangle in the figure. So substitute 
p sin ¢ for r: 

x=psin @ cos 8, y=p sin ¢@ sin 8, z= p cos ¢. {1} 


Remember those two steps, pf@ to réz to xyz. We check that x? + y? + z? = p?: 
p*(sin? cos*@ + sin? sin?@ + cos?) = p*(sin?d + cos?) = p?. 
The volume integral is explained by Figure 14.17c. That shows a “spherical box” 


with right angles and curved edges. Two edges are dg and pd@. The third edge is 
horizontal. The usual rd? becomes p sin ¢ d@. Multiplying those lengths gives dV. 


The volume of the box is dV = p” sin @ dp dé d@. This is a distance cubed, from p7dp. 
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EXAMPLE 6 A solid ball of radius R has known volume V = $2R°. Notice the limits: 


( { {’ p’sin @ dp dd dO = [3 pt [- ne Pa a = 4 R\(2)(2m). 


Question What is the volume above the cone in Figure 14.17? 
Answer The ¢ integral stops at [—cos $]/* =4. The volume is (4R7)(3){2z). 


EXAMPLE 7 The surface area of a sphere is A = 42R?. Forget the p integral: 


a=" if R?sin do d6 = R*[ —cos ¢ |S | 8 [5° = R*(2)(2n). 


After those examples from geometry, here is the real thing from science. | want to 
compute one of the most important triple integrais in physics—“the gravitational 
attraction of a solid sphere.” For some reason Isaac Newton had trouble with this 
integral, He refused to publish his masterpiece on astronomy until he had solved it. 
I think he didn’t use spherical coordinates—and the intcgral is not casy even now. 

The answer that Newton finally found is beautiful. The sphere acts as if all its mass 
were concentrated at the center, At an outside point (0,0, D), the force of gravity is 
proportional to t/D?. The force from a uniform solid sphere equals the force from a 
point mass, at every outside point P. That is exactly what Newton wanted and 
needed, to explain the solar system and to prove Kepler’s faws. 

Here is the difficulty. Some parts of the sphere are closer than D, some parts are 
farther away. The actual distance q, from the outside point P to a typical inside point, 
is shown in Figure 14.18. The average distance @ to all points in the sphere is not D. 
But what Newton needed was a different average, and by good luck or some divine 
calculus it works perfectly: The average of 1/q is 1/D. This gives the potential energy: 


potential at point P= [{f - Ly = Sele ISDS: 


sphere 4 D @) 


A small volume dV at the distance g contributes dV/q to the potential (Section 8.6, 
with density {). The integral adds the contributions from the whole sphere. Equation 
(2) says that the potential at r= D is not changed when the sphere is squeezed to 
the center. The potential equals the whole volume divided by the single distance D. 

Important point: The average of i/g is 1/D and not 1/9. The average of 3 and j is 
not 3. Smaller point: I wrote “sphere” where I should have written ‘ball.’ The 
sphere is solid: O< p< R,0<G <n, 0<4< 2z. 


What about the force? For the small volume it is proportional to dV/q? (this is the 
inverse square law). But force is a vector, pulling the outside point toward dV—not 
toward the center of the sphere. The figure shows the geometry and the symmetry. 
We want the z component of the force. (By symmetry the overall x and y components 
are zero.) The angle between the force vector and the z axis is 2, so for the z component 
we multiply by cos x. The total force comes from the integral that Newton discovered: 

force at point P= {{] COS 2 Ay, volume of sphere 


2 
sphere q’ D 


(3) 


1 will compute the integral (2) and leave you the privilege of solving (3). | mean that 
word seriously. If you have come this far, you deserve the pleasure of doing what at 
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outside @(0, 0, D) 


Fig. 14.48 Distance ¢ from outside point to inside point. Distances g and Q to surface. 


one time only Isaac Newton could do. Problem 26 offers a suggestion (just the law 
of cosines) but the integral is yours. 

The law of cosines also helps with (2). For the triangle in the figure it gives q? = 
D* —2pDcos + p*. Call this whole quantity u. We do the surface integral first 
(dd and dé with p fixed). Then g* =u and g= Ju and du=2pD sin ¢ dd: 


>* (* 5? sin @ dp dd _ (35 5-|2 ie 4 
J i q 2pD Ju D vi b-0 @) 


2x came from the @ integral. The integral of du/./u is Ju. Since cos = — | at the 
upper limit, u is D? + 2pD+ p”. The square root of u is D+ p. At the lower limit 
cos ¢= +1 and u= D? — 2pD + p’. This is another perfect square—its square root 
is D — gp. The surface integral (4} with fixed p is 


dA 2nxp Anp? 
[[4-*[0+0-o-n]- 9. (5) 


Last comes the p integral: [* 4np?dp/D=4nxR?/D. This proves formula (2): 
potential equals volume of the sphere divided by D. 


Note t Physicists are also happy about equation (5). The average of 1/q is 1/D not 
only over the solid sphere but over each spherical shell of area 4x7. The sheils can 
have different densities, as they do in the Earth, and still Newton is correct. This also 
applies to the force integral (3)—each separate shell acts as if its mass were concen- 
trated at the center. Then the final p integral yields this property for the solid sphere. 


Note 2 Physicists also know that force is minus the derivative of potential. The 
derivative of (2) with respect to D produces the force integra! (3). Problem 27 explains 
this shortcut to equation (3). 


EXAMPLE & Everywhere inside a hollow sphere the force of gravity is zero. 


When D is smaller than p, the Jower limit ./u in the integral (4) changes from D— p 
to p— D. That way the square root stays positive. This changes the answer in (5) to 
4zp*/p, so the potential no longer depends on D. The potential is constant inside the 
hollow shell. Since the force comes from its derivative, the force is zero. 

A more intuitive proof is in the second figure. The infinitesimal areas on the surface 
are proportional to q* and Q?. But the distances to those areas are q and Q, so the 
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forces involve 1/q? and 1/Q? (the inverse square law). Therefore the two areas exert 
equal and opposite forces on the inside point, and they cancel each other. The total 
force from the shell is zero. 

I believe this zero integral is the reason that the inside of a car is safe from lightning. 
Of course a car is not a sphere. But electric charge distributes itself to keep the surface 
at constant potential. The potential stays constant inside—therefore no force. The 
tires help to prevent conduction of current (and electrocution of driver). 

P.S. Don’t just step out of the car. Let a metal chain conduct the charge to the 
ground. Otherwise you could be the conductor. 


CHANGE OF COORDINATES——STRETCHING FACTOR J 


Once more we look to calculus for a formula. We need the volume of a small curved 
box in any uvw coordinate system. The r@z box and the p¢é@ box have right angles, 
and their volumes were read off from the geometry (stretching factors J =r and J = 
p* sin in Figures 14.15 and 14.17). Now we change from xyz to other coordinates 
upw—which are chosen to fit the problem. 

Going from xy to up, the area dA = J du dv was a 2 by 2 determinant. In three 
dimensions the determinant is 3 by 3. The matrix is always the “Jacobian matrix,” 
containing first derivatives. There were four derivatives from xy to uv, now there are 
nine from xyz to uvw. 


44C Suppose x, y,z are given in terms of u,v, w. Then a small box in wow 
space (sides du, dv, dw) comes from a volume dV = J du dv dw in xyz space: 


ox/du @xf/év ax/dw 
J=|dy/éu é@y/dv édy/éw]| = stretching factor 
Ozjdu dzjdv dz/aw 


Ax, y, Zz) 
Au, v, w) (6) 


The volume integral [ff dx dy dz becomes {{{|J| du dv dw, with limits on uvw. 


Remember that a 3 by 3 determinant is the sum of six terms (Section 11.5). One term 
in J is (@x/du}(dy/dv){2z/dw), along the main diagonal. This comes from pure stretch- 
ing, and the other five terms allow for rotation. The best way to exhibit the formula 
is for spherical coordinates—where the nine derivatives are easy but the determinant 
is not: 


EXAMPLE 9 Find the factor J for x= p sin d cos 8, y= psin ¢ sin 8, z= pcos ¢. 


sing@cos@ pcecos@cos@ —psin @ sin 


_ Ox, y, 2) 


J= = |sin @ sin @ cos @ sin @ sin @ cos @]. 
Hp, @, 8) ¢ pcos @ psin @ 


cos ¢ —p sin 0 


The determinant has six terms, but two are zero—because of the zero in the corner. 
The other four terms are p’sin@cos*@sin?@ and p’sin@cos*dcos’é and 
p’sin*¢ sin7@ and p’sin*¢ cos?@. Add the first two (note sin?@ + cos?#) and separately 
add the second two. Then add the sums to reach J = p’sin @. 

Geometry already gave this answer. For most upw vanables, use the determinant. 
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EXERCISES 


Read-through questions 


The three _a _ coordinates are r8z, The point at x= y= 
z=1 has r=_b_, @=_c_, z=_d_., The volume 
integral is {{f__e _. The solid region 1 <r <2, O<@<2n, 
O<z<disa_¢t _. Its volume is__g__. From the r and @ 
integrals the area ofa _h equals _t__. From the z and 
integrals the area ofa) —sequals_k__. In r@z coordt- 


nates the shapes of __| are convenient, while _m__ are 
not. 


The three__n__ coordinates are pp?, The point at x = y= 


z=lhasp=_o ,¢=_p_,9=_4 _. The angle ¢ is 
measured from __t__. 8 is measured from _s__. 9 is the 
distance to__t__, where r was the distance to__u__. If p¢é 
are known then x=_v_, p=_w_, z=_x_. The 


stretching factor Jisa3by3_y ,andvolumeis f{f_2_. 


The solid region] <p <2,0<¢<2,0S60< 2misa_a_. 
Its volume is__B__. From the ¢ and @ integrals the area of 
a_C__ at radius p equals ___9 __. Newton discovered that 
the outside gravitational attraction ofa __E _ is the same as 


for an equai mass jocated at _ fF 


Convert the xyz coordinates in 1-4 to r0z and po. 
1 (D, 0, 0) 2 (0, —D, 0) 
3 (0, 0, D)(watch 6) 4 (3, 4, 5) 


Convert the spherical coordinates in 5—7 to xyz and rz. 
5 p=4, d6=2/4, 0 = — 2/4 
6 p=2, dG=n/3, 0B=n/6 
7 p=1,¢=7, 0 = anything. 
§ Where does x =r and y=0? 


9 Find the polar angle ¢ for the point with cylindrical 
coordinates r@z. 


10 What are x{t), (2), 2(t} on the great circle from p=1, 
o@ =n/2, 0=0 with speed | to p=t, P6=x/4, 0= 7/2? 


From the limits of integration describe each region in 11-20 
and find its volume. The inner integral has the inner limits. 


tn inZ fst-w 
11 | | | r dz dr dO 
6=0 Jr=0 z=F 


2n 1 z-3 

13 | | | r dr dz d@ 
@=0 Jzr=0 Jr=0 

i | { r dO dr dz 
o Jo Jo 


Riz Pri2 1 
is | | | p* sin @ dp dp dd 
oO tv) 


t4) 


2n fxia 2 
16 { { { p’sin @ dp dp dO 
0 +. secp 


n fx fsing 
17 | | | p’sin @ dp dé dO 
ago 


2x fxd (3 
18 ( { { p*sin ¢ dp dé d0 
0 1 


9 | | | p’sin ¢ dp dé dé 
o Jo Jo 


1 1 1 
20 { { { p’sin @ dp dp dO 
14] 9 a 


21 Example 5 gave the volume integral for a sphere in rOz 
coordinates. What is the area of the circular slice at height z? 
What is the area of the cylindrical shell at radius r? Integrate 
over slices (dz) and over shells (dr) to reach 42R?/3. 


22 Describe the solid with O<p<t-—cos@ and find its 
volume. 


23 A cylindrical tree has radius a. A saw cuts horizontally, 
ending halfway in at the x axis. Then it cuts on a sloping 
plane (angle x with the horizonta!}, also ending at the x axis. 
What is the volume of the wedge that fal!s out? 


24 Find the mass of a planet of radius R, if its density at each 
radius p is db=(p + 1)/p. Notice the infinite density at the 
center, but finite mass M=[[f{ddV. Here p is radius, not 
density. 


25 For the cone out to r= I —z, the average distance from 
the z axis is F = 4. For the triangle out to r = | — z the average 
is 7 = 4. How can they be different when rotating the triangle 
produces the cone? 


Problems 26-32, on the attraction of a sphere, use Figure 14.18 
and the law of cosines g? = D? — 29D cos + p? =u, 


26 Newton's achievement Show that {j[(cos 2)dV/q? equals 
volume/D?. One hint only: Find cos z from a second law of 
cosines p* = D?—2qgDcos2+q?. The @ integral should 
involve 1/q and 1/q?. Equation (2) integrates 1/g, leaving 
SffaViq? still to do. 


27 Compute @q/dD in the first cosine law and show from 
Figure 14.18 that it equals cosa. Then the derivative of 
equation (2) with respect to D is a shortcut to Newton’s 
equation (3). 


28 The lines of length D and q meet at the angle «. Move the 
meeting point up by AD, Explain why the other line stretches 
by Ag = AD cos 2. So 0q/8D = cos z as before. 
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29 Show that the average distance is G=4R/3, from the 
North Pole (D = R) to points on the Earth’s surface (p = R). 
To compute: g = [[ qR*sin ¢ do d0/(area 4nR7). Use the same 
substitution wu. 


3 Show as in Problem 29 that the average distance is 
G=D+4+4p7/D, from the outside point (0, 0, D) to points on 
the shell of radius p. Then integrate [{{qdV and divide by 
4nR3/3 to find q for the solid sphere. 


31 in Figure 14.18b, it is not truc that the areas on the surface 
are exactly proportional to g? and Q?. Why not? What hap- 
pens to the second proof in Example 8? 


32 For two solid spheres attracting each other (sun and 
planet), can we concentrate both spheres into point masses at 
their centers? 


*33 Compute [[fcos ad¥/q°* to find the force of gravity at 
(0, 0, D) from a cylinder x? + y? <a”, 0<z<h. Show froma 
figure why q? =r? + (D —z}* and cosa =(D — z}/q. 


34 A finear change of variables has x =au+by+ew, y= 
du +ev+fw, and z=gu + hv + iw. Write down the six terms 
in the determinant J. Three terms have minus signs, 


35 A pure stretching has x = au, y = bv, and z = cw. Find the 
3 by 3 matrix and its determinant J. What is special about 
the xyz box in this case? 


36 (a) The matrix in Example 9 has three columns. Find the 
lengths of those three vectors (sum of squares, then square 
root). Compare with the edges of the box in Figure 14,17. 
(b) Take the dot product of every column in J with every 
other column. Zero dot products mean right angles in the 
box, So J is the product of the column lengths. 


37 Find the stretching factor J for cylindrical coordinates 
from the matrix of first derivatives. 


38 Follow Problem 36 for cylindrical coordinates—find the 
length of each column in J and compare with the box in 
Figure 14.15. 


39 Find the moment of inertia around the z axis of a spherical 
shelt (radius p, density 1). The distance from the axis to a 
point on the shell is r= . Substitute for r to find 


Hp) = (2" [Er p?sin } do dO. 


Divide by mr? (which is 4xp*} to compute the number J for 
a hollow ball in the rolling experiment of Section 8.5. 


40 The moment of inertia of a solid sphere (radius R, 
density 1) adds up the hollow spheres of Problem 39: 
1=[F i(p\dp = . Divide by mR? (which is ¢2R5) to 
find / in the rolling experiment. A solid ball rolls faster than 
a hollow ball because : 


4l Inside the Earth, the force of gravity is proportional to 
the distance p from the center. Reason: The inner ball of 
radius p has mass proportional to (assume constant 
density). The force is proportional to that mass divided by 

. The rest of the Earth (sphere with hole) exerts no 
force because 


42 Dig a tunnel through the center to Australia. Drop a ball 
in the tunnel at y=R; Australia is y= —R. The force of 
gravity is —cy by Problem 41. Newton’s law is my" = —cy. 
What does the ball do when it reaches Australia? 
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CHAPTER 15 


Vector Calculus 


Chapter 14 introduced double and triple integrals. We went from J dx to [f dx dy and 
{ff dx dy dz, All those integrals add up small pieces, and the limit gives area or volume 
or mass. What could be more natural than that? I regret to say, after the success of 
those multiple integrals, that something is missing. It is even more regrettable that 

_ we didn’t notice it. The missing piece is nothing less than the Fundamental Theorem 
of Calculus. 

The double integral ff dx dy equals the area. To compute it, we did not use an 
antiderivative of 1. At least not consciously. The method was almost trial and error, 
and the hard part was to find the limits of integration. This chapter goes deeper, to 
show how the step from a double integral to a single integral is really a new form of 
the Fundamental Theorem—-when it is done right. 


Two new ideas are needed early, one pleasant and one not. You will like vector 
fields. You may not think so highly of fine integrals. Those are ordinary single integrals 
like { (x)dx, but they go along curves instead of straight lines. The nice step dx 
becomes the confusing step ds. Where {dx equals the length of the interval, { ds is 
the length of the curve. The point is that regions are enclosed by curves, and we have 
to integrate along them. The Fundamental Theorem in its two-dimensional form 
(Green’s Theorem) connects a double integral over the region to a single integral along 
its boundary curve. 

The great applications are in science and engineering, where vector fields are so 
natural, But there are changes in the language. Instead of an antiderivative, we speak 
about a potential function. Instead of the derivative, we take the “divergence” and 
“curl.” Instead of area, we compute flux and circulation and work, Examples come 
first. 


ee 15.1 Vector Fields 


For an ordinary scalar function, the input is a number x and the output is a number 
f(x). For a vector field (or vector function), the input is a point (x, y) and the output 
is a two-dimensional vector F(x, y). There is a “field” of vectors, one at every point. 549 
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In three dimensions the input point is (x, y,z) and the output vector F has three 
components. 


DEFINITION Let R be a region in the xy plane. A vector field F assigns to every point 
(x, y) in R a vector F(x, y) with two components: 


F(x, y} = M(x, yi + Nx, yj. (1} 
This plane vector field involves two functions of two variables. They are the compo- 
nents M and N, which vary from point to point. A vector has fixed components, a 
vector fieid has varying components. 
A three-dimensional vector field has components M(x, y,z) and N(x, y,z) and 
P(x, y, z}. Then the vectors are F = Mi+ Nj+ Pk. 


EXAMPLE 4. The position vector at (x, y) is R= xi+ yj. Its components are M =x 
and N=y. The vectors grow larger as we leave the origin (Figure 15.la). Their 
direction is outward and their length is |R| = ./x? + y? =r. The vector R is boldface, 
the number r is lightface. 


EXAMPLE 2. The vector field R/r consists of unit vectors u,, pointing outward. We 
diyide R = xi+ }j by its length, at every point except the origin. The components 


of R/r are M =x/r and N =y/r. Figure 15.1 shows a third field R/r*, whose length 
is Ir. 


Fig. 45.4 The vector fields R and R:r and R;r? are radial. Lengths r and 1 and |/r. 


EXAMPLE 3. The spin field or rotation field or turning field goes around the origin 


instead of away from it. The field is S. Its components are M = — y and N=x: 
S = — yi + xj also has length |S| = \/(—y)? +. x? =r. Q) 


S is perpendicular to R—their dot product is zero: S*R =(— y}(x) + (x}(y) =0. The 
spin fields S/r and S/r* have lengths | and 1/r: 


Pe 
r 


= 
== 


x 
—-i+-j has “5 
r r? ? 


Fig. 45.2 The spin fields § and S/r and S/r? go around the origin. Lengths r and | and 1:r. 


15.1 Vector Flelde 


EXAMPLE 4 A gradient field starts with an ordinary function /(x, y). The components 
M and N are the partial derivatives af/@x and éf/éy. Then the field F is the gradient 
of f: 

F = grad f=Vf= efféxit Oey i. (3) 


This vector field grad f is everywhere perpendicular to the level curves f(x, y}=c. The 
length |grad f| tells how fast fis changing (in the direction it changes fastest). Invent 
a function like f=x?y, and you immediately have its gradient field F = 2xyi + x°j. 
To repeat, M is df/éx and N is éf/ay. 


For every vector field you should ask two questions: Is it a gradient field? If so, 
what is f? Here are answers for the radial fields and spin fields: 


45A The radial fields R and R/r and R/r? are all gradient fields. 


The spin fields § and S/r are not gradients of any f(x, y). 
The spin field S/r? is the gradient of the polar angle 6 = tan™ '(y/x). 


The derivatives of f = 4(x? + y*) are x and y. Thus R is a gradient field. The gradient 
of f=r is the unit vector R/r pointing outwards. Both fields are perpendicular to 
circles around the origin. Those are the level curves of f=4r? and f=r. 


Question Is every R/r’ a gradient field? 
Answer Yes, But among the spin fields, the only gradient is $/r?. 


A major goal of this chapter is to recognize gradient fields by a simple test. The 
rejection of S and S/r will be interesting. For some reason —yi+ xj is rejected and 
yi+xj is accepted. (It is the gradient of .) The acceptance of S/r? as the 
gradient of f= @ contains a surprise at the origin (Section 15.3). 

Gradient fields are called conservative. The function f is the potential function. 
These words, and the next examples, come from physics and engineering. 


EXAMPLE 5 The velocity field is V and the flow field is pV. 


Suppose fluid moves steadily down a pipe. Or a river lows smoothly (no waterfall). 
Or the air circulates in a fixed pattern. The velocity can be different at different points, 
but there is no change with time, The velocity vector V gives the direction of flow 
and speed of flow at every point. 

In reality the velocity field is V(x, y, z}, with three components M, N, P. Those are 
the velocities »;, 2, v3 in the x, y,z directions. The speed |¥V| is the length: |¥|* = 
of +03 +03. In a “plane flow” the k component is zero, and the velocity field is 
byitvyj= Mit Nj. 


gravity 


gravity . 


MS 


F=-R/r3 
Fig. 15.3. A steady velocity field V and two force fields F. 
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For a compact disc or a turning wheel, V is a spin field (Y= wS, w= angular 
velocity). A tornado might be closer to ¥ = S/r? (except for a dead spot at the center). 
An explosion could have V = R/r?. A quieter example is flow in and out of a lake 
with steady rain as a source term. 

The flow field p¥ is the density p times the velocity field. While V gives the rate of 
movement, p¥ gives the rate of movement of mass. A greater density means a greater 
rate |p¥| of “mass transport.” It is like the number of passengers on a bus times the 
speed of the bus. 


EXAMPLE 6 Force fields from gravity: F is downward in the classroom, F is radial 
in space. 


When gravity pulis downward, it has only one nonzero component: F = — mgk. This 
assumes that vectors to the center of the Earth are parallel—almost true in a class- 
room, Then F is the gradient of —mgz (note df/éz = — mg). 

In physics the usual potential is not — mgz but +mgz. The force field is minus grad f 
also in electrical engineering. Electrons fiow from high potential to low potential. 
The mathematics is the same, but the sign is reversed. 

In space, the force is radial inwards: F = — mMGR/r*. Its magnitude is propor- 
tional to i/r? (Newton’s inverse square law). The masses are m and M, and the 
gravitational constant is G = 6.672 x 10” 1! —with distance in meters, mass in kilo- 
grams, and time in seconds. The dimensions of G are (force)(distance)*/(mass)*. This 
is different from the acceleration g = 9.8m/sec”, which already accounts for the mass 
and radius of the Earth. 

Like all radial fields, gravity is a gradient field. It comes from a potential f: 

mMG of mMGx mMGy mMGz 


of 7 of _ 
oF sue ox r ae by r and oz r (4) 


EXAMPLE 7 (a short example) Current in a wite produces a magnetic field B. It is 
the spin field S/r? around the wire, times the strength of the current. 


STREAMLINES AND LINES OF FORCE 


Drawing a vector field is not always easy. Even the spin field looks messy when the 
vectors are too long (they go in circles and fall across each other). The circles give a 
clearer picture than the vectors. In any field, the vectors are tangent to “field lines”— 
which in the spin case are circles. 


DEFINITION $C is a field line or integral curve if the vectors F(x, y) are tangent to C. 
The slope dy/dx of the curve C equals the slope N/M of the vector F = Mi+ Nj: 


dy N(x, y) 


dx M(x, y) 

We are still drawing the field of vectors, but now they are infinitesimally short. 

They are connected into curves! What is lost is their length, because S and S/r and 

S/r? all have the same field lines (circles). For the position field R and gravity field 

R/r>, the field lines are rays from the origin. In this case the “curves” are actually 
straight. 


(- - : for the spin feld) (6) 


EXAMPLE 8 Show that the field lines for the velocity field V = yi + xj are hyperbolas. 


dy N 
bat Ah SO? a ydy=x dx => 4y? — $x? = constant. 


15.1 Vector Fields 


Ag. 15.4 Velocity fields are tangent to streamlines. Gradient fields also have equipotentials. 


At every point these hyperbolas line up with the velocity V. Each particle of fluid 
travels on a field line. In fluid flow those hyperbolas are called streamlines. Drop a 
leaf into a river, and it follows a streamline. Figure 15.4 shows the streamlines for a 
river going around a bend. 

Don’t forget the essential question about each vector field. Is it a gradient field? 
For V = yi+ xj the answer is yes, and the potential is f= xy: 


the gradient of xy is (6f/6x)i + (Of/Gy)j = yi + xj. (7) 


When there is a potential, it has level curves. They connect points of equal potential, 
so the curves f{x, y)=c are called eguipotentials. Here they are the curves xy = c— 
also hyperbolas. Since gradients are perpendicular to level curves, the streamlines are 
perpendicular to the equipotentials. Figure 15.4 is sliced one way by streamlines and 
the other way by equipotentials, 


A gradient field F = of/éx i+ df/éy j is tangent to the field lines (stream- 
lines) and perpendicular to the equipotentials (level curves of f). 


In the gradient direction { changes fastest. In the level direction f doesn’t change at 
all. The chain rule along f(x, y) =c proves these directions to be perpendicular: 


Sf dx dy _ = 
ae GS 0 or (grad f)-{tangent to level curve) = 0. 
EXAMPLE 9 The streamlines of S/r? are circles around (0, 0). The equipotentials are 
rays @=c. Add rays to Figure 15.2 for the gradient field S/r?. 


For the gravity field those are reversed. A body is pulled in along the field lines (rays). 
The equipotentials are the circles where f= l/r is constant. The plane is crisscrossed 
by “orthogonal trajectories”—curves that meet everywhere at right angles. 

If you bring a magnet near a pile of iron filings, a little shake will display the field 
lines. In a force field, they are “lines of force.” Here are the other new words. 


Vector field F(x, y, z) = Mi+ Nj + Pk Piane field F = M(x, y)i + N(x, y)j 
Radial field: multiple of R = xi + yj+zk Spin field: multiple of S= — yi+ xj 
Gradient field = conservative field: M = af/ax, N = offéy, P = affdz 


Potential f(x, y} (not a vector) Equipotential curves f(x, y) = 


Streamline = field line = integral curve: a curve that has F(x, y} as its tangent 
vectors. 
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15,4 
Read-through questions 


A vector field assigns a__ 
In two dimensions F(x, y) 


to each point {x, y) or (x, y, z). 
=_b i+ _¢_j, An example is 
the position field R=__¢_. Its magnitude is [Ri =__e 
and its direction is __¢ _. It is the gradient field for f= 
g__. The level curves are_h , and they are__i to 
the vectors R. 


Reversing this picture, the spin fieldisS=__ J __. Its mag- 
nitude is |S|=__k_ and its direction is__!__. It is not a 
gradient field, because no function has éf/éx=_m_ and 
afféy=__n__. S is the velocity field for flow going 9° 
The streamlines or __P__ lines or integral_q__are__t 
The flow field p¥ gives the rate at which _s__ is moved 
by the flow. 


A gravity field from the origin is proportional to F=__t 
which has |F|=__v__. This is Newton’s  ¥_ square law. 
It is a gradient field, with potential f=__~ __. The equipoten- 
tial curves f(x, y}=c are__x__. They are__y¥___ to the field 
lines which are __z__. This illustrates that the _A_ of a 


function f(x, y)is__8 _ to its level curves. 


The velocity field pi+ xj is the gradient of f=_C¢__. Its 
streamlines are__D__. The slope dy/dx of a streamline equals 
the ratio __E ___ of velocity components. The field is__F to 
the streamlines. Drop a leaf onto the flow, and it goes along 

S 


Find a potential f(x, y) for the gradient fields 1-8. Draw the 
streamlines perpendicular to the equipotentials f(x, y) = c. 


1 F=i+2j (constant field) 2 F=xi+j 

3 F=cos(x+ ylitcoixt+ yj 4 F=(I/yli-(x/y?j 
§ F = (2xi + 2yj)/(x? + y’} 6 F=x7i4 yj 

7 F=xyi+ j 8 F=/yit___j 


9 Draw the shear field F = xj. Check that it is not a gradient 
field: If af/éx =0 then af/dy =x is impossible. What are the 
streamlines (field lines} in the direction of F? 


10 Find all functions that satisfy éf/éx = — y and show that 
none of them satisfy of/@y=x. Then the spin field S= 
— yit xj is not a gradient field. 


Compute d//éx and df/éy in 11-18. Draw the gradient field 
F = grad f and the equipotentials {{x, y)=c: 


11 f=3x+y 12 f=x—3y 

3 f=x+y? 14 fo(x—1)? + y? 
{5 f=x?-y’ 16 f=e* cos y 
WWf=e7" 18 f= yix 


EXERCISES 


Find equations for the streamlines in 19—24 by solving dy/dx = 
N/M (inclading a constant C). Draw the streamlines. 

19 F=i-j 20 F =i+ xj 

21 F =S (spin field) 22 F =Sj/r (spin field) 

23 F = grad (x/y)} 24 F = grad (2x + y). 


25 The Earth's gravity field is radial, but in a room the field 
lines seem to go straight down into the floor. This is because 
nearby field lines always look 


26 A line of charges produces the electrostatic force field F = 


R/r? = (xi + yp)/(x? + y’). Find the potential f(x, y). (F is also 
the gravity field for a line of masses.) 


In 27-32 write down the yector fields Mi+ Nj. 
27 F points radially away from the origin with magnitude 5. 


28 The velocity is perpendicular to the curves x* + p> = c and 
the speed is 1. 


29 The gravitational force F comes from two unit masses at 
(0, 0) and (1, 0). 


30 The streamlines are in the 45° direction and the speed is 4. 


31 The streamlines are circles clockwise around the origin 
and the speed is 1. 


32 The equipotentials are the parabolas y = x7 + ¢ and F is 
a gradient field. 


33 Show directly that the hyperbolas xy = 2 and x? — y? =3 
are perpendicular at the point (2, 1), by computing both slopes 
dy/dx and multiplying to get —1. 


34 The derivative of f(x, y) =c is f, +f{dy/dx) = 0. Show that 


the slope of this level curve is dy/dx = — M/N. It is perpendic- 
ular to streamlines because (— M@/N)(N/M) = 


35 The x and y derivatives of f(r) are f/ex = and 
offay = by the chain rule. (Test f=r?.) The equi- 
potentials are 
36 F =(ax + byjit+ (bx +cy)j is a gradient field. Find the 
potential f and describe the equipotentials. 
37 True or folse: 
1. The constant field i+ 2k is a gradient field. 
2. For non-gradient fields, equipotentials meet stream- 
lines at non-right angles. 
3. In three dimensions the equipotentials are surfaces 
instead of curves. 
4. F=x*i+y*j+27k points outward from (0,0,0)— 
a radial field. 


38 Create and draw f and F and your own equipotentials 
and streamlines. 


45.2 Line Integrals 559 


39 How can different vector fields have the same streamlines? 40 Draw arrows at six or eight points to show the direction 
Can they have the same equipotentials? Can they have the and magnitude of each field: 
same f? (a)R+S (b}R/r—S/r (C) x7i+x7j (d) yi 


ee = 15.2 Line Integrals 


A line integral is an integral along a curve. It can equal an area, but that is a special 
case and not typical. Instead of area, here are two important line integrals in physics 
and engineering: 


Work along a curve = | F-T ds Flow across @ curve = | F-nds. 
c c 


In the first integral, F is a force field. In the second integral, F is a flow field. Work 
is done in the direction of movement, so we integrate F « T. Flow is measured through 
the curve C, so we integrate F-n. Here T is the unit tangent vector, and F + T is the 
force component along the curve, Similarly nis the unit normal vector, at right angles 
with T. Then F +n ts the component of flow perpendicular to the curve. 

We will write those integrals in several forms. They may never be as comfortable 
as J y(x) dx, but eventually we get them under control. I mention these applications 
early, so you can see where we are going. This section concentrates on work, and 
flow comes later. (It is also called fux—the Latin word for flow.) You recognize ds 
as the step along the curve, corresponding to dx on the x axis. Where j dx gives the 
length of an interval (it equals b — a), J ds is the length of the curve. 


EXAMPLE 1 Flight from Atlanta to Los Angeles on a straight line and a semicircle. 


According to Delta Airlines, the distance straight west is 2000 miles. Atianta is at 
(1000, 0) and Los Angeles is at (—1000,0), with the origin halfway between. The 
semicircle route C has radius 1000, This is not a great circle route. It is more of a 
“flat circle,” which goes north past Chicago. No plane could fly it (it probably goes 
into space). 

The equation for the semicircle is x7 + y* = 1000*. Parametrically this path is x = 
1000 cos t, y= 1000 sin ¢. For a line integral the parameter is better. The plane leaves 
Atlanta at = 0 and reaches L.A. at t = x, more than three hours later. On the straight 
2000-mile path, Delta could almost do it. Around the semicircle C, the distance is 
1000x miles and the speed has to be 1000 miles per hour. Remember that speed is 
distance ds divided by time df: 


ds/dt = ./{dx/dt)* + (dy/dt? = 1000./(—sin t)* + (cos 1)? = 1000. (I) 


The tangent vector to C is proportional to (dx/dt, dy/dt) =(—1000 sin t, 1000 cos 2). 
But T is a unit vector, so we divide by 1000-—which is the speed. 

Suppose the wind blows due east with force F = Mi. The components are M and 
zero. For Mf =constant, compute the dot product F-T and the work —2000 M: 


F-T = Mi-(—sinti+cos rj) = M(~sin ¢) + 0(cos t) = — M sint 


% ; d. x ; 
{ F-Tds=|] (MM sin a(F i) =| —1000M sin ¢ dt = — 2000M. 
c 1=0 dt 0 
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Work is force times distance moved. It is negative, because the wind acts against the 
movement. You may point out that the work could have been found more simply — 
go 2000 miles and muitiply by —M. 1 would object that rhis straight route is a 
different path, But you claim that the path doesn’t matter—the work of the wind is 
—2000M on every path. I concede that this time you are right (but not always). 


Most line integrals depend on the path. Those that don’t are crucially important. 
For a gradient field, we only need to know the starting point P and the finish Q, 


45B When F is the gradient of a potential function f(x, y), the work [¢ F+T ds 
depends only on the endpoints P and Q. The work is the change in f: 


if F=éf/éxi+éf/éyj then { F°T ds=f(Q)—f(P). (2) 
c 


When F = Mi, its components M and zero are the partial derivatives of f= Mx. To 
compute the line integral, just evaluate f at the endpoints. Atlanta has x = 1000, Los 
Angeles has x = — 1000, and the potential function f= Mx ts like an antiderivative: 


work = f(Q) —f(P) = M(—1000) — M(1000) = — 2000M. (3) 


Tds Teds 


LAX 
— 1000 


ATL LAX ATL 
1000 — 1000 1000 


J F - Tes =- 2000M depends on path 


Fig. 45.5 Force Mi, work —2000M on all paths. Force Myi, no work on straight path, 


May I give a rough explanation of the work integral [ F- Tds? It becomes clearer 
when the small movement T ds is written as dx i+ dy j. The work is the dot product 
with F: 

ef af Cf 


Ptas=( Fie 49) -lari- arp f dx +“! dy=df (4) 
cy Cx ra 


y 
The infinitesimal work is df: The total work is f df=f(Q) —f(P). This is the Fuadamen- 
tal Theorem for a line integral. Only one warning: When F is not the gradient of any 
J (Example 2), the Theorem does not apply. 


EXAMPLE 2 Fty these paths against the non-constant force field F = Myi. Compute 
the work. 


There is no force on the straight path where y =0. Along the x axis the wind does 
no work. But the semicircle gocs up where y = 1000 sin # and the wind is strong: 


F-T =(Myi)-(—sin ti + cos tj) = — My sin t= — 1000M sin?: 
. " 2, ds ‘ bag ein 2 T ne 
F°-Tds= | (—1000M sin*s)— dt = —10°M sin*t dt = — = 10°M. 
¢ 0 dt 0 2 


This work is enormous (and unrealistic). But the calculations make an important 
point—everything is converted to the parameter t. The second point is that F = Myi 
is not a gradient field. First reason: The work was zero on the straight path and 
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nonzero on the semicircle. Second reason: No function has df/dx = My and af/éy = 
0. (The first makes f depend on y and the second forbids it. This F is called a shear 
force.) Without a potential we cannot substitute P and Q—and the work depends 
on the path, 


THE DEFINITION OF LINE INTEGRALS 


We go back to the start, to define { F ~T ds, We can think of F -T as a function g(x, y) 
along the path, and define its integral as a limit of sums: 


{ e{x, y) ds = limit of ¥ e{x;, yJAs; as (As)... 7 0. (5) 
c i=1 


The points (x;, y;) lie on the curve C. The last point @ is (xy, yy); the first point P is 
(Xo, Yo). The step As; is the distance to (x;, y,) from the previous point. As the steps 
get smali {As — 0) the straight pieces follow the curve. Exactly as in Section 8.2, the 
special case g=1 gives the arc length. As long as g(x, y) is piecewise continuous 
(jumps allowed) and the path is piecewise smooth (corners allowed), the limit exists 
and defines the line integral. 

When g is the density of a wire, the line integral is the total mass. When g is F “T, 
the integral is the work. But nobody does the calculation hy formula (5). We now 
introduce a parameter t—which could be the time, or the arc length s, or the distance 
x along the base. 

The differential ds becomes (ds/dt}dt, Everything changes over to t: 


t= 


b 
ext), y(t) ./(dx/dt)? + (dy/dt)? de. (6) 


[acs y)ds = 


The curve starts when t = a, runs through the points (x(t), y(t), and ends when t =b. 
The square root in the integral is the speed ds/dt. In three dimensions the points on 
C are (x(t), y(t), 2(t)) and (dz/dt)? is in the square root. 


EXAMPLE 3 The points on a coil spring are (x, y, z) =({cos ¢, sin ¢, t). Find the mass 
of two complete turns (from t = 0 to t = 4x) if the density is p = 4. 


Solution The key is (dx/dt)* + (dy/ds)? + (dz/dt)? —sin?t+cos¢z+1=2. Thus 
ds/dt = of. To find the mass, integrate the mass per unit length which is g = p = 4: 


ax ds 4x 
mass | pGa~ | 4,/2 dt =16 2n. 
o «at ° 


That is a line integral in three-dimensional space. It shows how to introduce t. But 
it misses the main point of this section, because it contains no vector field F. This 
section is ahout work, not just mass. 


DIFFERENT FORMS OF THE WORK INTEGRAL 


The work integral { F ‘T ds can be written in a better way. The force is F = Mi + Nj. 
A small step along the curve is dx i + dy j. Work is force times distance, but it is only 
the force component along the path that counts. The dot product F - T ds finds that 
component automatically. 
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45C The vector to a point on C is R = xi+ yj. Then dR =Tds =dxitdy;j 


work=| FedR=| Mdr+N dy (7) 
c c 


Along a space curve the work is [F-T ds={F+dR=JM dx +N dy + P dz. 


The product M dx is (force in x direction}(movement in x direction), This is zero if 
cither factor is zero, When the only force is gravity, pushing a piano takes no work. 
It is friction that hurts. Carrying the piano up the stairs brings in Pdz, and the total 
work is the piano weight P times the change in z, 

To connect the new j F-dR with the old [F-T ds, remember the tangent vector 
T. It is dRjds. Therefore Tds is dR. The best for computations is dR, because the 
unit vector T has a division by ds/dt = ./(dx/dt)* + (dy/dz)*. Later we multiply by this 
square root, in converting ds to (ds/dt)dt. It makes no sense to compute the square 
root, divide by it, and then multiply by it. That is avoided in the improved form 
[Mdx+N dy. 


EXAMPLE 4 Vector field F = — yi + xj, path from (1, 0) to (0, 1): Find the work. 


Note] This F is the spin field S. It goes around the origin, while R = xi + yj gocs 
outward. Their dot product is F-R=—yx+xy=0. This does not mean that 
F-dR =0. The force is perpendicular to R, but not to the change in R. The work to 
move from (1,0) to (0, 1), x axis to y axis, is not zero. 

Note2 We have not described the path C. That must be done. The spin field is not 
a gradient field, and the work along a straight line does not equal the work on a 
quarter-circle: 


straight line x=!-—t, y=t quarter-circle x = cost, y =sinf. 
Cateulation of work Change F-dR = M dx + N dy to the parameter ¢: 


1 
Straight line: {- ydx +xdy= —t(—dt)+{1—fdt=1 
Jo 


m2 


Quarter-circle: {- ydx+x av=[ —sin —sin t dt)+cos dcos ¢ dt) = 
0 


hola 


General method The path is given by x(t) and p(s). Substitute those into M(x, y) 
and N(x,y)—then F is a function of 1. Also find dx/dt and dyjdt. Integrate 
M dxidt + N dyide from the starting time r to the finish. 


4 [are 
work f F-dR =] work R/2 no work 


Fig. 15.6 Three paths for /F-dR=Jj —ydx+xdy=1, 22,0. 


15.2 Line integrals 


For practice, take the path down the x axis to the origin (x = 1 —t, y=0), Then 
go up the y axis (x = 0, y =1 — 1). The starting time at (1, 0) is t = 0. The turning time 
at the origin is t = 1. The finishing time at (0, 1) is t= 2. The integral has two parts 
because this new path has two parts: 


Bent path: [—ydx+xdy=0+0 (y=0 on one part, then x =0). 


Note 3 The answer depended on the path, for this spin field F = S. The answer did 
not depend on the choice of parameter. If we follow the same path at a different 
speed, the work is the same. We can choose another parameter t, since (ds/di)dt and 
(ds/dt)dt both equal ds. Traveling twice as fast on the straight path (x =1-—2r, 
y = 2z) we finish at t =4 instead of t= 1. The work is still 1: 


1/2 


{- ydx+xdy= (—2t)(— 2dz) + (1 — 22)(2d7) = ie 2 dt=1. 
0 9 


CONSERVATION OF TOTAL ENERGY (KINETIC + POTENTIAL) 


When a force field does work on a mass m, it normally gives that mass a new velocity. 
Newton’s Law is F = ma = m dv/dt. (It is a vector law. Why write out three compo- 
nents?) The work [F-dR is 


[im dv/di)+(v dt) = dmv-y Jf = 4miv(Q)[? — 4m HPP. (8) 


The work equals the change in the kinetic energy jm|v|’. But for a gradient field the 
work is also the change in potential—with a minus sign from physics: 


work = [F -dR=— [u- s(P) —f(Q). (9) 


Comparing (8) with (9), the combination $mly|? +f is the same at P and Q. The total 
energy, kinetic plus potential, is conserved. 


INDEPENDENCE OF PATH: GRADIENT FIELDS 


The work of the spin field S depends on the path. Example 4 took three paths— 
straight line, quarter-circle, bent line. The work was 1, 2/2, and 0, different on each 
path. This happens for more than 99.99% of all vector fields. It does not happen for 
the most important fields. Mathematics and physics concentrate on very special 
fields—for which the work depends only on the endpoints. We now explain what 
happens, when the integral is independent of the path. 

Suppose you integrate from P to @ on one path, and back to P on another path. 
Combined, that is a closed path from P to P (Figure 15.7). But a backward integral 
is the negative of a forward integral, since dR switches sign. If the integrals from P 
to Q are equal, the integral around the closed path is zero: 


P @ P 2 Q 

b F-dR -{ F-dR+ | F:dR -| F-dR - | F-dR =0. (10) 

P P r?] P P 
closed path | back path 2 path 1 path 2 


The circle on the first integral indicates a closed path. Later we will drop the P’s. 
Not ail closed path integrals are zero! For most fields F, different paths yieid 
different work. For “conservative” fields, all paths yield the same work. Then zero 
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work around a closed path conserves energy. The big question is: How to decide 
which fields are conservative, without trying all paths? Here is the crucial information 
about conservative fields, in a plane region R with no holes: 


45D F = M(x, y)i+ N(x, y)j is a conservative field if it has these properties: 
A. The work JF +dR around every closed path is zero. 
B. The work [er *dR depends only on P and Q, not on the path. 


C. F is a gradient field. M = éf/dx and N =¢f/éy for some potential f(x, y). 
D, The components satisfy ¢M/éy = @N/éx. 


A field with one of these properties has them all. D is the quick test. 


These statements A D bring everything together for conservative fields (alias gradient 
fields). A closed path goes one way to Q and back the other way to P. The work 
cancels, and statements A and B are equivalent. We now connect them to C. Note: 
Test D says that the “eurf” of F is zero. That can wait for Green’s Theorem in the 
next section—the full discussion of the curl comes in 15.6. 

First, a gradient field F = grad f is conservative. The work is f(Q)—f(P), by the 
fundamental theorem for line integrals. It depends only on the endpoints and not the 
path. Therefore statement C leads back to B. 

Our job is in the other direction, to show that conservative fields Mi+ Nj are 
gradients. Assume that the work integral depends only on the endpoints. We must 
construct a potential f, so that F is its gradient. In other words, ¢fiéx must be M and 
cficy must be N, 


Fix the point P. Define {(Q) as the work to reach Q. Then F equals grad f- 


Check the reasoning. At the starting point P, fis zero. At every other point Q, f is 
the work | Mdx + N dy to reach that point. All paths from P to Q give the same {{Q), 
because the field is assumed conservative. After two examples we prove that grad f 
agrees with F—the construction succeeds. 


back path 2 


path 2 ; g? 
iQi=|-F-dR 

fr-a-0 So Ag ip ‘ 

PS a g7? 


: dR 
f F 


path | 
Fig. 45.7 Conservative fields: {F-dR =0 and [$F -dR=/(Q) f(P). Here f(P)=0. 


EXAMPLE5 Find f(x,y) when F= Mi+ Nj=2xyit+x7j. We want éf:ex =2xy 
and éfféy =x?. 


Solution 1 Choose P = {0, 0). Integrate M dx + N dy along to (x, 0) and up to (x. +): 
ix.) dey} 
| 2xv dx =0 {since » =0) | x?dy = x*y (which is f). 
i0.Q) (x0) 


Certainly f= x*y meets the requirements: f, = 2x) and f, = x7. Thus F = grad f. Note 
that dy =0 in the first integral (on the x axis). Then dx =0 in the second integral 
(x is fixed). The integrals add to f= x’y. 


15.2 Line Integrals 


Solution 2 Integrate 2xy dx + x?dy on the straight line (xt, yt) from r=0 to t=1: 


[, A(xt)(ye)(x at) + (xtP(y de) = {, 3x? yt7dt = x? yt? ]4 = x7y, 
+7] 0 


Most authors use Solution 1, 1 use Solution 2, Most students use Solution 3: 
Solution 3 Directly solve af/@x = M = 2xy and then fix up df/dy = N = x": 
afjix=2xy gives f=x"y (plus any function of y). 


In this example xy already has the correct derivative 4f/ay =x?. No additional 
function of y is necessary. When we integrate with respect to x, the constant of 
integration (usually C} becomes a function C(y). 

You will get practice in finding f, This is only possible for conservative fields! J 
tested M = 2xy and N = x? in advance (using D) to be sure that €M/dy = @N/Ox. 


EXAMPLE 6 Look for f(x, y) when Mi+ Nj is the spin field — yi + xj. 
Attempted solution 1 Integrate —ydx + x dy from (0, 0) to (x, 0) to (x, y): 


(x, 0) {x. 9) 
| —ydx=0 and | xdy=xy (which seems like f). 
t t 


0,0) x, 0) 


Attempted solution 2 Integrate — dx + x dy on the line (xt, yt) from t=O to I: 
1 
| —(yt)(x dt) + (xt){ydt)=0 (a different f, also wrong). 
0 


Attempted solution 3 Directly solve df/éx = — y and try to fix up af/éy =x: 
ofjax=—y gives f= —xy (plus any function C(y)). 


The y derivative of this f is —x+dC/dy. That does not agree with the required 
af/dy =x. Conclusion: The spin field — yi+ xj is mot conservative. There is no f- 
Test D gives GM/éy= —1 and @N/éx = +1. 


To finish this section, we move from examples to a proof. The potential {(Q) is 
defined as the work to reach Q. We must show that its partial derivatives are M and 
N. This seems reasonable from the formula f(Q)= | Mdx+Ndy, but we have to 
think it through. 

Remember statement A, that all paths give the same f(Q). Take a path that goes 
from P to the left of @. It comes in to Q on a line y=constant (so dy = 0). As the 
path reaches Q, we are only integrating M dx. The derivative of this integral, at Q, is 
ofjéx = M. That is the Fundamental Theorem of Calculus. 

To show that df/éy = N, take a different path. Go from P to a point below Q. The 
path comes up to @ on a vertical line (so dx = 0). Near Q we are only integrating 
N dy, so Gf/dy = N. ; 

The requirement that the region must have no holes will be critical for test D. 


EXAMPLE 7 Find f(x, y) = fig, xdx + ydy. Test D is passed: @N/Ax = 0 = 6M/dy. 
Solution 1 f(o,,xdx = 3x? is added to Ji°9, ydy = hy”. 

Solution 2 [4 (xt)(x dt) + (yt)(y dt) = [5 (x? + pede = (x? + y). 

Solution 3 af/éx =x gives f= 4x? + C(y). Then df/éy = y needs C(y) = 4y?. 
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415.2 EXERCISES 


Read-through questions 


Work is the 9 of F-dR. Here Fis the _> and R is 


the. ¢ .The_d __ product finds the component of _ ¢ 


in the direction of movement dR = dxi+dyj. The straight 
path {x, y)=(t,2¢) goes from __f att=Oto_@ att= 


1 with dR =dti+__h_. The work of F =3i+jis[F-dR= 
fit dt=_i 


Another form of dR is Tds, where Tis the _k vector to 
the path and ds = (__. For the path (¢, 22), the unit vector 
Tis_m_andds=__n__ dt. For F =3i+4, F’Tds is still 


o dt. This F is the gradient of f=__p__. The change in 


f=3xty from (0,0) to (l,2)is a 


When F=gradf, the dot product F«dR is (éf/dx)dx + 
1__=df. The work integral from P to Q is fdf=_s_. 
In this case the work depends on the _ t but not on the 
__4__. Around a closed path the work is__¥__. The field is 
called _w_. F=(1+ y}i+ xj is the gradient of f=__x_. 
The work from (0, 0}to (1, 2)is__y _, the change in potential. 


For the spin field S=__2z__, the work (does)(does not) 
depend on the path. The path (x, y)=(3 cost, 3sin 2) is a 
circle with S-dR=_A_. The work is _2 around the 
compiete circle. Formally f g(x, y)ds is the limit of the sum 

c.. 


The four equivalent properties of a conservative field F = 
Mi+NjareA:_0 BE ,C:_ FF  ,andD:_¢© 


Test D is (passed}{not passed) by F =(y + 1)i+ xj. The work 
j F-dR around the circle (cos ¢, sin t)is__. H_. The work on 
the upper semicircle equals the work on __! __. This field is 


the gradient of f=__/__, so the work to (—I, O} is __K 


Compute the line integrals in 1-6. 
1 feds and fody: x=t, y= 2t,0<¢<1. 
2 fexds and foxyds: x =cost, y=sint,0<t<x/2, 
3 {xy ds: bent line from (0, 0} to (1, 1) to {I, 0}. 
4 {-ydx —xdy: any square path, sides of length 3. 
5 §¢dx and [- ydx: any closed circte of radius 3. 
6 jc(ds/dt) de: any path of length 5. 
7 Does [@ xydy equal }xy?]2? 
8 Does [2x dx equal $x7)¢? 
9 Does (feds)? =(fcdxy’ + fedy)?? 
10 Does {- (ds)? make sense? 
In 11-16 find the work in moving from {1, 0) to (0, 1}. When F 


is conservative, construct { Choose your own path when F is 
not conservative. 


11 F=i+yj 12 F=yitj 


13 F=xy'it+ yxj 
16 F=(x/r)it+ (y/nj 
17 For which powers n is S/r" a gradient by test D? 


14 F=ei4 xe'j 
16 F= —yi+ x7j 


18 For which powers n is R/r" a gradient by test D? 


19 A wire hoop around a vertical circle x?+2?=a" has 
density p =a +z. Find its mass M = J pds. 


20 A wire of constant density p lies on the semicircle 
x? + y? =a", y2O0. Find its mass M and also its moment 
M, =f pyds. Where is its center of mass t= M,/M, ¥=M,/ 
M? 


21 Ifthe density around the circle x* + y? = a? is p = x”, what 
is the mass and where is the center of mass? 
22 Find [F-dR along the space curve x=1, y=0t?, z=2°, 
O<t<l. 
(a) F=grad (xy+xz}  (b} F=yi-—xj+zk 
23 (a) Find the unit tangent vector T and the speed ds/dt 
along the path R = 2ti + #?j. 
(b) For F=3xi+4j, find F-Tds using (a) and F-dR 
directly. 
({c) What is the work from (2, 1} to (4, 4)? 
24 If M(x, y, zyit+ N(x, y, 2)j is the gradient of f(x, y, z}, show 
that none of these functions can depend on z. 


25 Find all gradient fields of the form M(yji + NQx)j. 


26 Compute the work W(x,y)=[Mdx+Ndy on the 
straight line path (xt, yt} from t=O tot = 1. Test to see if dW 
éx = M and dW/dy=N., 

{a) M=y?, N =3xy? (b} M =x°, N = 3px’ 

(c}) M=x/y, N=y/x (d)M=e*),N=e* 
27 Find a field F whose work around the unit square (y=0 
then x = 1 then y=1 then x = 9) equals 4. 


28 Find a nonconservative F whose work around the unit 
circle x? + y? =1 is zero. 


in 29-34 compute {F-dR along the straight line R = 4i + ¢j 
and the parabola R = ti + 27}, from (0, 0} to (1, 1). When F is a 
gradient field, use its potential f(x, y). 


29 F=i-2j 30 F=x3j 
31 F = 2xy7i + 2yx7j 32 F=x*yit xy?j 
33 F=yi- xj 34 F=(xi+ yp? +y? +1) 


35 For which numbers a and b is F=axyi+(x*+5y)j a 
gradient field? 


36 Compute f— ydx+xdy from (1,0) to (0, 1} on the line 
x=1-—2, y=¢? and the quarter-circle x = cos 2t, y = sin 2t. 
Example 4 found | and x/2 with different parameters. 
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Apply the test N. = M, to 37-42. Find f when test D is passed. 43 Around the unit circle find §ds and $dx and § xds. 


44 True or false, with reason: 


3 = 2o-*] 2 -3 <2 a) ¢ 
lai hs eo he aod (a) When F = yi the line integral (F-dR along a curve 
xi+yj grad xy from P to Q equals the usual area under the curve. 
9F= |xi+ yl 0 F= [grad xy| {b) That line integral depends only on P and Q, not on the 
curve. 
41 F=R+S 42 F=(ax + by)ji+ (cx + dy)j (c) That line integral around the unit circle equals z. 


Ree §686©§15.3 Green's Theorem EE! 


This section contains the Fundamental Theorem of Calculus, extended to two dimen- 
sions. That sounds important and it is. The formula was discovered 150 years after 
Newton and Leibniz, by an ordinary mortal named George Green. His theorem 
connects a double integral over a region R to a line integral along its boundary C. 

The integral of df/dx equals f(b) — f(a). This connects a one-dimensional integral 
to a zero-dimensional integral. The boundary only contains two points a and b! The 
answer f(b) — f(a) is some kind of a “point integral.” It is this absolutely crucial idea— 
to integrate a derivative from information at the boundary—that Green’s Theorem 
extends into two dimensions. 


There are two important integrals around C. The work is [F» Tds =| M dx + N dy. 
The flux is [F+nds =| Mdy—N dx (notice the switch). The first is for a force field, 
the second is for a flow field. The tangent vector T turns 90° clockwise to become 
the normal vector n, Green’s Theorem handles both, in two dimensions. In three 
dimensions they split into the Divergence Theorem (15.5) and Stokes’ Theorem (15.6). 

Green’s Theorem applies to “smooth” functions M(x, y) and N(x, y), with con- 
tinuous first derivatives in a region slightly bigger than R. Then all integrals are well 
defined. M and N will have a definite and specific meaning in each application—to 
electricity or magnetism or fluid flow or mechanics. The purpose of a theorem is to 
capture the central ideas once and for all. We do that now, and the applications 
follow. 


45E Green’s Theorem Suppose the region -R is bounded by the simple 
closed piecewise smooth curve C. Then an integra] over R equals a line integral 


around C: 
ON OM 
pmax way= [| (B- Sana (1) 


A curve is “simple” if it doesn’t cross itself (figure 8°s are excluded). It is “closed” if 
its endpoint @ is the same as its starting point P. This is indicated by the closed circle 
on the integral sign. The curve is “smooth” if its tangent T changes continuously — 
the word “piecewise” allows a finite number of corners. Fractals are not allowed, but 
all reasonable curves are acceptable (later we discuss figure 8’s and rings), First comes 
an understanding of the formula, by testing it on special cases. 
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. oe, = 
YARM 23k 


strip area (x, —.x,)dy 


x4dy 


y,dx 
Fig. 15.8 Area of R adds up strips: $x dy = {{ dx dy and $ y dx = — ff dy dx. 


Special case 1: M=0 and N =x. Green’s Theorem with 0N/0x = 1 becomes 


} xdy= {| 1 dxdy (which is the area of R). (2) 
c R 


The integrals look equal, because the inner integral of dx is x. Then both integrals 
have xdy—but we need to go carefully. The area of a layer of R is dy times the 
difference in x (the length of the strip). The line integral in Figure 15.8 agrees. It has 
an upward dy times x (at the right) plus a downward —dy times x (at the left). The 
integrals add up the strips, to give the total area. 


Special case 2: M=y and N =0 and $-ydx = {{(—1) dx dy = — (area of R). 


Now Green’s formula has a minus sign, because the line integral is counterclockwise. 
The top of each slice has dx < 0 (going left) and the bottom has dx > 0 (going right). 
Then ydx at the top and bottom combine to give minus the area of the slice in 
Figure 15.8b. 


Special case 3: $1 dx =0. The dx’s to the right cancel the dx’s to the left (the curve 
is closed). With M =1 and N =0, Green’s Theorem is 0 = 0. 


Most important case: Mi + Nj is a gradient field. It has a potential function f(x, y). 
Green’s Theorem is 0 = 0, because 0M/0y = 0N/Ox. This is test D: 


4 4 fA AN a far 

M ee () is the same as — = £(2) (3) 

ey iy \ ex Ox 6x\éy 

The cross derivatives always satisfy /,, =/,,. That is why gradient fields pass test D. 
When the double integral is zero, the line integral is also zero: $-¢ M dx + N dy =0. 

The work is zero. The field is conservative! This last step in A> B>C>D=A will 


be complete when Green’s Theorem is proved. 
Conservative examples are $x dx =0 and $ ydy =0. Area is not involved. 


QS 


Remark The special cases $xdy and —4ydx led to the area of R. As long as 
= 0N/0x — 6M/éy, the double integral becomes {{ 1 dx dy. This gives a way to com- 
pute area by a line integral. 


The area of R isd xdy= -> pipes (x dy — ydx). (4) 
c Cc 2 c 


EXAMPLE 4 The area of the triangle in Figure 15.9 is 2. Check Green’s Theorem. 


The last area formula in (4) uses $8, half the spin field. N =43x and M = —4y yield 
N,—M,=4+4=1. On one side of Green’s Theorem is {{1 dx dy =area of triangle. 
On the other side, the line integral has three pieces. 


15.3 Green’s Theorem 


(0, b) 


x=acost 
y=hsint 
(a, 0) 


Fig. 15.9 Green’s Theorem: Line integral around triangle, area integral for ellipse. 


(2, 0) 


y=0 


Two pieces are zero: x dy — ydx = 0 on the sides where x = 0 and y = 0. The sloping 
side x = 2—y has dx = —dy. The line integral agrees with the area, confirming 
Green’s Theorem: 


l 1 {* 1 [# 
=® xdy—ydx== (2—y)dy+ydy==| 2dy=2. 
2 Cc 2 y=0 2 


0 


EXAMPLE 2 The area of an ellipse is nab when the semiaxes have lengths a and b. 


This is a classical example, which all authors like. The points on the ellipse are 
x=acost, y=bsint, as t goes from 0 to 2z. (The ellipse has (x/a)? + (y/b)? = 1.) 
By computing the boundary integral, we discover the area inside. Note that the 
differential x dy — ydx is just ab dt: 


(a cos t)(b cos t dt)—(b sin t)(—a sin t dt) = ab(cos*t + sint)dt = ab dt. 
The line integral is ${5" ab dt = nab. This area nab is mr?, for a circle with a=b=r. 


Proof of Green’s Theorem: In our special cases, the two sides of the formula were 
equal. We now show that they are always equal. The proof uses the Fundamental 
Theorem to integrate (@N/@x)dx and (6M/éy)dy. Frankly speaking, this one-dimen- 
sional theorem is all we have to work with—since we don’t know M and N. 

The proof is a step up in mathematics, to work with symbols M and N instead of 
specific functions. The integral in (6) below has no numbers. The idea is to deal with 
M and N in two separate parts, which added together give Green’s Theorem: 


gpa | 


Start with a “very simple” region (Figure 15.10a). Its top is given by y=/f(x) and 
its bottom by y = g(x). In the double integral, integrate —0M/0y first with respect to 
y. The inner integral is 


A 
si dxdy and separately } Ndy= {| = dx dy. (5) 
oy c R OX 


J) aM rx 

\.. 7 op see. ye = — M(x, f(x) + M(x, g(x). (6) 
The Fundamental Theorem (in the y variable) gives this answer that depends on x. 
If we knew M and f and g, we could do the outer integral—from x = a to x = b. But 
we have to leave it and go to the other side of Green’s Theorem—the line integral: 


b 


pm dx = | M(x, y)dx + | M(x, y)dx = iL M(x, f(x))dx + | M(x, g(x))dx. (7) 
top bottom b 
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y=g(x) J Madx 


Fig. 15.40 Very simple region (a—b). Simple region (c) is a union of very simple regions. 


Compare (7) with (6). The integral of M(x, g(x)) is the same for both. The integral of 
M(x, f(x)) has a minus sign from (6). In (7) it has a plus sign but the integral is from 
b to a. So life is good. 

The part for N uses the same idea. Now the x integral comes first, because 
(GN/éx)dx is practically asking to be integrated—from x = G(y) at the left to x = F(y) 
at the right. We reach N(F(y), y) — N(G(y), y). Then the y integral matches $ Ndy and 
completes (5). Adding the two parts of (5) proves Green’s Theorem. 


Finally we discuss the shape of R. The broken ring in Figure 15.10 is not “very 
simple,” because horizontal lines go in and out and in and out. Vertical lines do the 
same. The x and y strips break into pieces. Our reasoning assumed no break between 
y=f(x) at the top and y = g(x) at the bottom. 

There is a nice idea that saves Green’s Theorem. Separate the broken ring into 
three very simple regions R,, R,, R;. The three double integrals equal the three line 
integrals around the R’s. Now add these separate results, to produce the double 
integral over all of R. When we add the line integrals, the crosscuts CC are covered 
twice and they cancel. The cut between R, and R, is covered upward (around R,) 
and downward (around R,). That leaves the integral around the boundary equal to 
the double integral inside—which is Green’s Theorem. 

When R is a complete ring, including the piece R4, the theorem is still true. The 
integral around the outside is still counterclockwise. But the integral is clockwise 
around the inner circle. Keep the region R to your left as you go around C. The 
complete ring is “doubly” connected, not “simply” connected. Green’s Theorem 
allows any finite number of regions R; and crosscuts CC and holes. 


EXAMPLE 3 The area under a curve is i, y dx, as we always believed. 


In computing area we never noticed the whole boundary. The true area is a line 
integral — 4 ydx around the closed curve in Figure 15.1la. But y=0 on the x axis. 
Also dx =0 on the vertical lines (up and down at b and a). Those parts contribute 
zero to the integral of ydx. The only nonzero part is back along the curve—which 
is the area — [ff ydx or |? ydx that we know well. 

What about signs, when the curve dips below the x axis? That area has been 
counted as negative since Chapter |. I saved the proof for Chapter 15. The reason 
lies in the arrows on C. 

The line integral around that part goes the other way. The arrows are clockwise, 
the region is on the right, and the area counts as negative. With the correct rules, a 
figure 8 is allowed after all. 


15.3 Green’s Theorem 


b 
$- ydx =| ydx ‘ 
dx=0 a" a F=S/r- 
( =0=0 
f=0=2n 
dx =0 infinite spin 
at the center $F -dR=2n 


Fig. 15.44 Closed path gives the sign of the area. Nonconservative field because of hole. 


CONSERVATIVE FIELDS 


We never leave gradients alone! They give conservative fields—the work around a 
closed path is f(P) —f(P) =0. But a potential function f(x, y) is only available when 
test D is passed: If 0f/0x = M and 0f/0y = N then 0M/éy = ON/Ox. The reason is that 
Sag = Sy 

Some applications prefer the language of “differentials.” Instead of looking for 
J (x, y), we look for df: 


DEFINITION The expression M(x, y)dx + N(x, y)dy is a differential form. When it 
agrees with the differential df= (éf/0x)dx + (df/éy)dy of some function, the form 
is called exact. The test for an exact differential is D: 6N/éx = 6M/éy. 


Nothing is new but the language. Is ydx an exact differential? No, because M, = 1 
and N,=0. Is ydx + xdy an exact differential? Yes, it is the differential of f= xy. 
That is the product rule! Now comes an important example, to show why R should 
be simply connected (a region with no holes). 


EXAMPLE 4 The spin field S/r? = (— yi + xj)/(x? + y) almost passes test D. 


@( x \_x?+y?—x(2x) _O/_-y \_—(%*+y’)+y(2y) 
( J (x? +y"P =M=s( )- . 


Both numerators are y* — x”. Test D looks good. To find f, integrate M = 6f/ éx: 


eT” Bx x+y = e+y (x? + y?)? 


f(x, y)= [-» dx/(x* + y*) =tan~ '(y/x) + C(y). 


The extra part C(y) can be zero—the y derivative of tan~ '(y/x) gives N with no help 
from C(y). The potential f is the angle 0 in the usual x, y, r right triangle. 


Test D is passed and F is grad 6. What am I worried about? It is only this, 
that Green’s Theorem on a circle seems to give 2x=0. The double integral is 
Jf (N. — M,)dx dy. According to (8) this is the integral of zero. But the line integral is 
2n: 


br -dR= hr y dx + x dy)/(x? + y?) = 2(area of circle)/a? =2na?/a*=2n. (9) 


With x =acost and y=asint we would find the same answer. The work is 2x (not 
zero!) when the path goes around the origin. 

We have a paradox. If Green’s Theorem is wrong, calculus is in deep trouble. Some 
requirement must be violated to reach 2x = 0. Looking at S/r?, the problem is at the 
origin. The field is not defined when r = 0 (it blows up). The derivatives in (8) are not 
continuous. Test D does not apply at the origin, and was not passed. We could remove 
(0, 0), but then the region where test D is passed would have a hole. 
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It is amazing how one point can change everything. When the path circles the 
ofigin, the line integral is not zero. The potential function f= 0 increases by 2x. That 
agrees with {F -dR = 2x from (9). It disagrees with {{ 0 dx dy. The 2z is right, the zero 
is wrong. N, — M, must be a “delta function of strength 2x.” 

The double integral is 2z from an infinite spike over the origin—even though N, = 
M, everywhere else. In fluid flow the delta function is a “vortex.” 


FLOW ACROSS A CURVE: GREEN’S THEOREM TURNED BY 90° 


A flow field is easier to visualize than a force field, because something is really there 
and it moves. Instead of gravity in empty space, water has velocity M(x, y)it+ N(x, y)j. 
At the boundary C it can flow in or out. The new form of Green’s Theorem is a 
fundamental “balance equation” of applied mathematics: 


Flow through C (out minus in) = replacement in R (source minus sink). 


The flow is steady. Whatever goes out through C must be replaced in R, When there 
are no sources or sinks (negative sources}, the total flow through C must be zero. 
This balance law is Green’s Theorem in its “normal form” (for n) instead of its 
“tangential form” {for T): 


45F For a steady flow field F= M(x, y)it N(x, yj, the flux { F-nds through 
the boundary C balances the replacement of fluid inside R: 


aM ON 
$ M dy — Ndx= {I (= + *) dx dy. (10) 


Figure 15.12 shows the 90° turn. T becomes n and “circulation” along C becomes 
flux through C. In the original form of Green’s Theorem, change N and M to M and 
—N to obtain the Aux form: 


fm as +Ndy> bax +M dy [lo — M,}dxdy > te +Nyjdxdy. (II) 


Playing with letters has proved a new theorem! The two left sides in {11) are equal, 
so the right sides are equal— which is Green’s Theorem (10) for the flux. The compo- 
nents M and N can be chosen freely and named freely. 

The change takes Mi + Nj into its perpendicular field - Ni+ Mj. The field is turned 
at every point (we are not just turning the plane by 90°). The spin field S = — yi + xj 
changes to the position field R = xi + yj. The position field R changes to —S. Stream- 
lines of one field are equipotentials of the other field. The new form (10) of Green's 


circulation 
FsTds = Mdx + Ndy g> . 


Tds Cc 
a7 | ay idy 
idy 


dx idx 


-jdx nds Fends = Mdy -Ndx ™ flux 


Aig. 15.12 The perpendicular component F -n flows through C. Note ads = dyi—dxj. 


15.3 Green’s Theorem 


Theorem is just as important as the old one—in fact I like it better. It is easier to 
visualize flow across a curve than circulation along it. 
The change of letters was just for the proof. From now on F = Mi+ Nj. 


EXAMPLE 5 Compute both sides of the new form (10) for F = 2xi + 3yj. The region 
R is a rectangle with sides a and b. 


Solution This field has 0M/0x + 6N/dy = 2 + 3. The integral over R is {{p 5dxdy = 
Sab. The line integral has four parts, because R has four sides. Between the left and 
right sides, M =2x increases by 2a. Down the left and up the right, { M dy = 2ab 
(those sides have length b). Similarly N = 3y changes by 3b between the bottom and 
top. Those sides have length a, so they contribute 3ab to a total line integral of Sab. 


Important: The “divergence” of a flow field is OM/0x + ON/éy. The example has 
divergence =5. To maintain this flow we must replace 5 units continually—not just 
at the origin but everywhere. (A one-point source is in example 7.) The divergence is 
the source strength, because it equals the outflow. To understand Green’s Theorem 
for any vector field Mi + Nj, look at a tiny rectangle (sides dx and dy): 


Flow out the right side minus flow in the left side = (change in M) times dy 
Flow out the top minus flow in the bottom = (change in N) times dx 
Total flow out of rectangle: dM dy + dN dx =(@M/0x + ON/éy)dx dy. 


The divergence times the area dx dy equals the total flow out. Section 15.5 gives more 
detail with more care in three dimensions. The divergence is M, + N, + P-. 


flux 3ab F = 2xi 
+ 3yj 
b N 
flux 
M_ 2ab 
0 a 


Fig. 15.43 M,+N,=2+3=5 yields flux = S(area)=Sab. The flux is dMdy+dN dx= 
(M, + N,) dx dy. The spin field has no flux. 


EXAMPLE 6 Find the flux through a closed curve C of the spin field S = — yi + xj. 


Solution The field has M = — y and N=x and M, + N, =0. The double integral is 
zero. Therefore the total flow (out minus in) is also zero—through any closed curve. 
Figure 15.13 shows flow entering and leaving a square. No fluid is added or removed. 
There is no rain and no evaporation. When the divergence M,. + N, is zero, there is 
no source or sink. 


FLOW FIELDS WITHOUT SOURCES 


This is really quite important. Remember that conservative fields do no work around 
C, they have a potential f, and they have “zero curl.” Now turn those statements 
through 90°, to find their twins. Source-free fields have no flux through C, they have 
stream functions g, and they have “zero divergence.” The new statements E-F—G—H 
describe fields without sources. 
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15G The field F = M(x, y)it+ N(x, yj is source-free if it has these properties: 
E The total flux $F +n ds through every closed curve is zero. 
F Across all curves from P to Q, the flux {2 F+nds is the same. 


G_ There is a stream function g(x, y), for which M = dg/éy and N = — dg/éx. 
H_ The components satisfy 0M/dx + €N/dy = 0 (the divergence is zero). 
A field with one of these properties has them all. H is the quick test. 


The spin field —yi+xj passed this test (Example 6 was source-free). The field 
2xi + 3yj does not pass (Example 5 had M, + N, =5). Example 7 almost passes. 


EXAMPLE 7 The radial field R/r? = (xi + yj)/{x? + y*) has a point source at (0, 0). 
The new test H is divergence = 0M/dx + GN/dy = 0. Those two derivatives are 


éf x _ x+y? —x(2x) a of_y _ x? +y*? ~ y(2y) (12) 
dx\x?+y/ (x? +y’y? dy\x*+y27f (x? +y")?—_ ~ 

They add to zero. There seems to be no source (if the calculation is correct). The flow 
through a circle x? + y? =a? should be zero. But it’s not: 


pe dy—Ndx= fi dy — ydx){(x? + y”} = 2Aarea of circle)/a? = 22. (13) 


A source is hidden somewhere. Looking at R/r?, the problem is at (0, 0). The field is 
not defined when r = 0 (it blows up). The derivatives in (12) are not continuous. Test 
H does not apply, and was not passed. The divergence M, +N, must be a “delta 
function” of strength 2x. There is a point source sending flow out through all circles. 

I hope you see the analogy with Example 4. The field S/r? is curl-free except at r = 0. 
The field R/r? is divergence-free except at r = 0. The mathematics is parallel and the 
fields are perpendicular. A potential fand a stream function g require a region without 
holes. 


THE BEST FIELDS: CONSERVATIVE AND SOURCE-FREE 


What if F is conservative and also source-free? Those are outstandingly important 
fields. The curl is zero and the divergence is zero. Because the field is conservative, it 
comes from a potential. Because it is source-free, there is a stream function: 


M= oe og and of o & (14) 
éx éy éy Gx 
Those are the Cauchy-Riemann equations, named after a great mathematician of his 
time and one of the greatest of all time. I can’t end without an example. 


EXAMPLE & Show that yi+ xj is both conservative and source-free. Find f and g. 


Solution With M=y and N =x, check first that GM/¢y = 1=¢CN/éx. There must 
be a potential function. It is f= xy, which achieves éf/éx = y and éf/¢y =x. Note 
that fr. +hhy = 0. 

Check next that éM/éx + éN/¢y =0+4+0. There must be a stream function, It is 
g=(y? — x”), which achieves dg/dy = y and dg/éx ~ — x. Note that g,, +8,,=0. 


Read-through questions equals dy i — dxj. The divergence of Mi+ Nijis 
xithe double integralis 2 _. There (is)(is not) a source. For 


The work integral § Mdx + N dy equals the double integral F = yi the divergence is A _. The divergence of R/r? is zero 


and __¢@___, the work equals the area of R. When 
M = éf/éx and N = df/dy, the double integral is zero because 


Test D for gradient feldsis_' _. A field that passes this 
testhas$F-dR=_m__.Thereisasolutiontof,= nn and 
. Then df= Mdx+Ndyis an _p__ differential, 


The flow form of Green's Theorem is_ u = _ v__. The 
normal vectorinF -mdspoints  w and|n[=__* _andnds 1 $xdy along the circle 2 $x*ydy along the circle 


15.3 Green's Theorem 


The curves f= constant are the equipotentials. The curves g = constant are the 
streamlines (Figure 15.4). These are the twin properties—-a conservative field with 
a potential and a source-free field with a stream function. They come together into 
the fundamental partial differential equation of equilibrium—tLaplace’s equation 
Sex thy = 9. 


45H There is a potential and stream function when M, = N, and M, = — Ny. 
They satisfy Laplace’s equation: 


far thy=M,+N,=0 and 8xx + Syy = —N,+ M,=0. (15) 


If we have f without g, as in f=x?+y? and M=2x and N=2y, we don’t have 
Laplace’s equation: f,,+f,,=4. This is a gradient field that needs a source. 
If we have g without f, as in g=x? + y* and M = 2y and N = — 2x, we don’t have 
Laplace’s equation. The field is source-free hut it has spin. The first field is 2R and 
the second field is 2S. 

With no source and no spin, we are with Laplace at the center of mathematics and 
science. 


Green’s Theorem. Tangential form $F ~T ds and normal form $F -nds 


p Maéx+ndy~ {f (N,~ M,)dx dy pM dy—wax~[f (M,,+ Ny)dx dy 
¢c R ce R 


work curl flux divergence 


Conservative: work = zero, N, = M,, gradient of a potential: M =f, and N =f, 


Source-free: flux = zero, M, = ~ N,,hasa stream function: M = g,andN = ~ g, 
Conservative + source-free: Cauchy-Riemann + Laplace equations for fand g. 


45.3 EXERCISES 


$ Theorem. For F = 31+ 4j the work is_¢_. exceptat_B_. This fieldhasa__¢ __ source. 


equations of/éx = dg/dy and _ P 


The spin field S/r? passes test Dexcept at__a_. Its potential 
increases by __s _ going around the origin. The Compute the line integrals 1-6 and (separately) the double integ- 

integral [{(N.—M,)dx dy isnotzerobut__t _. rals in Green’s Theorem (1). The circle has x =acos 1, 

y=asin ¢. The triangle has sides x = 0, y= 0, x + y= 1. 


A field with no source has propertisE= D0 ,F= 
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E 


+ 


{__. The line integral is zero because__g __. An example is G=_f __, H=zero divergence. The stream function ¢g 
.ThedirectiononCis__| around the outside and satisfies the equations G . Then daM/éx+ 

|__ around the boundary of a hole. If R is broken into very ON/éy = 0 because d7g/@xdy =__H__. The example F = yihas 
simple pieces with crosscuts between them, the integrals of g=_1_. There (is)(is not) a potential function. The example 
cancel along the crosscuts, F=xi-yjhasg=_J and alsof=_K_. This f satisfies 


Laplace’s equation __|___, because the field F is both _M 


and __N__. The functions f and g are connected by the _© 
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3 $x dx along the triangle 4 $y dx along the triangle 


5 $x?ydx along the circle 6 $x*ydx along the triangle 


7 Compute both sides of Green’s Theorem in the form (10): 
(a) F = xi + yj, R = upper half of the disk x7 + y? <1. 
(b) F = x7i+ xyj, C = square with sides y =0,x = 1, y=1, 
x=0. 
8 Show that $¢(x?y + 2xkdy + xy?dx depends only on the 
area of R. Does it equal the area? 


9 Find the area inside the hypocycloid x =cos*t, y= sin*t 
from $4x dy — ydx. 


10 For constants 5 and c, how is $bydx + cxdy related to 
the area inside C? If b=7, which c makes the integra] zero? 


11 For F = grad./x* + y’, show in three ways that §F -dR = 
0 around x =cost, y=sin t. 

(a) F is a gradient field so 4 

(b) Compute F and directly integrate F + dR. 

(c) Compute the double integral in Green’s Theorem. 
12 Devise a way to find the one-dimensional theorem 


f° (dfidx}dx = f(b) — f(a) as a special case of Green’s Theorem 
when R is a square. 


13 (a) Choose x(t) and y(t) so that the path goes from (1, 0) 
to (t, 0) after circling the ongin twice. 


(b) Compute $ ydx and compare with the area inside your 
path. 

(c) Compute $(y dx — x dy)/(x? + y*} and compare with 2x 
tn Example 7. 


14 In Exampie 4 of the previous section, the work [S-dR 
between (1, 0) and (0, 1} was ! for the straight path and 2/2 
for the quarter-circle path. Show that the work is always twice 
the area between the path and the axes. 


Compute both sides of § F +n ds = {[(M, + N,) dx dy in 15-20. 
15 F = yi+ xj in the unit circle 

16 F = xyi in the unit square O< x, y<l 

17 F=R/r in the unit circle 

18 F =S/r in the unit square 

19 F =x7yj in the unit triangle (sides x= 0, y=0, x + y= 1} 
20 F = grad r in the top half of the unit circle. 


21 Suppose div F =0 except at the origin. Then the flux 
$F-nds is the same through any two circles around the 
origin, because . (What is f[(M, + N,)dx dy between 
the circles?) 


22 Example 7 has div F =0 except at the origin. The flux 
through every circle x? + y? =a? is 2m. The flux through a 
square around the origin is also 2m because . (Com- 
pare Problem 21.) 


23 Evaluate $ ax, y}dx + b(x, y\dy by both forms of Green’s 
Theorem. The choice M =a, N =b in the work form gives 
the double integral . The choice M =b, N = —ain 
the flux form gives the double integral . There was 
only one Green. 


24 Evaluate § cos*y dy — sin*x dx by Green’s Theorem. 


25 The field R/r? in Example 7 has zero divergence except at 
r=0. Solve ag/@y = x/(x? + y”) to find an attempted stream 
function g, Does g have trouble at the origin? 


26 Show that S/r? has zero divergence (except at r =0). Find 
a stream function by solving dg/@y = y/{x? + y?). Does g have 
trouble at the origin? 


27 Which differentials are exact: ydx—x dy, x*dx + y*dy, 
yrdx + x7dy? 


28 If M,+N,=0 then the equations dg/dy = and 
Og/dx = yield a stream function. If also N,=M,, 
show that g satisfies Laplace’s equation. 


Compute the divergence of each field in 29-36 and solve g, = 
M and g, = — N for a stream function (if possible). 


29 2xyi— yj 30 3xy*i- yj 
31 x*i+ yj 2 yitxy 
33 e& cos yi—e* sin yj 44 e* i -j} 
35 2yi/x + yi/x?  xyi— xyj 


37 Compute N,—M, for each field in 29-36 and find a 
potential function f when possible. 


38 The potential /(Q) is the work [2 F -Tds to reach Q from 
a fixed point P (Section 15,2). In the same way, the stream 
function g(Q) can be constructed from the integral 

Then g(Q) — g{P} represents the flux across the path from P to 
Q. Why do all paths give the same answer? 


39 The real part of (x + iy)> = x? + 3ix?y — 3xy? —iy? is f= 
x? — 3xy?. Its gradient field is F= grad f= . The 
divergence of F is . Therefore f satisfies Laplace’s 
equation f,, +/,, =9 (check that it does). 


40 Since div F=0 in Problem 39, we can solve dg/éy = 
and dg/éx = . The stream function is g = 
. It is the imaginary part of the same {x + iy)*. Check 
that f and g satisfy the Cauchy-Riemann equations. 


41 The real part fand imaginary part g of (x + iy)" satisfy the 
Laplace and Cauchy-Riemann equations for n=}, 2, sees 
(They give all the polynomial solutions.) Compute f and g for 
n=4. 


42 When is M dy — Ndx an exact differential dg? 


43 The potential f=e*cos y satisfies Laplace’s equation. 
There must be a g. Find the field F = grad f and the stream 
function g{x, y). 


15.4 Surface Integrale 573 


44 Show that the spin field S does work around every simple inside R can be squeezed to a point without leaving R. Test 
closed curve. these regions: 
1, xy plane without (0,0) 2. xyz space without (0, 0, 0) 


45 For F =f(x}j and R=unit square 0<x<1, O<y<l, 3. sphere x? 4+ y?4+z?=1 4.a torus (or doughnut) 


integrate both sides of Green’s Theorem (1). What formula is 


required from one-variable calculus? 3:8 Sweaice 6.3 human body 
7. the region between two spheres 
46 A region R is “simply connected” when every closed curve 8. xyz space with circle removed. 


Re = 15.4 Surface Integrals 


The double integral in Green’s Theorem is over a flat surface R. Now the region 
moves out of the plane. It becomes a curved surface S, part of a sphere or cylinder 
or cone. When the surface has only one z for each (x, y), it is the graph of a function 
2(x, y). In other cases S$ can twist and close up—a sphere has an upper z and a lower 
z. In all cases we want to compute area and flux. This is a necessary step (it is our 
last step) before moving Green’s Theorem to three dimensions. 


First a quick review. The basic integrals are {dx and {f dx dy and {{fdx dydz. The 
one that didn’t fit was {ds—the length of a curve. When we go from curves to 
surfaces, ds becomes dS. Area is [{ dS and flux is [[F -ndS, with double integrals 
because the surfaces are two-dimensional, The main difficulty is in dS. 

All formulas are summarized in a table at the end of the section. 


There are two ways to deal with ds (along curves). The same methods apply to dS 
{on surfaces}, The first is in xyz coordinates; the second uses parameters. Before this 
subject gets complicated, I will explain those two methods. 


Method 1 is for the graph of a function: curve (x) or surface 2(x, y). 
A small piece of the curve is almost straight. It goes across by dx and up by dy: 


length ds = \/(dx)? + (dy)* = \/1 + (dy/dx)? dx. (1) 


A small piece of the surface is practically flat. Think of a tiny sloping rectangle. One 
side goes across hy dx and up by (42/éx)dx. The neighboring side goes along by dy 
and up by (d2/dy)dy. Computing the area is a linear problem (from Chapter 11), 
because the flat piece is in a plane. 

Two vectors A and B form a parallelogram. The length of their cross product is the 
area. In the present case, the vectors are A =i+ (dz/éx)k and B=j + (dz/éy)k. Then 
Adx and Bdy are the sides of the small piece, and we compute A x B: 


ij k 
AxB=|1 0 @é2/8x| = —dz/éxi— éz/dyj+k. (2) 
0 | 6z/éy 


This is exactly the mormaf vector N to the tangent plane and the surface, from 
Chapter 13. Please note: The small flat piece is actually a parallelogram (not always 
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a rectangle). Its area dS is much like ds, but the length of N =A x B involves two 
derivatives: 


area dS =|Adx x Bdy| =|N|dx dy = ./1 + (6z/8x)? + (dz/dy)? dx dy. (3) 


EXAMPLE 4 Find the area on the plane z = x + 2y above a base area A. 


This is the example to visualize. The area down in the xy plane is A. The area up on 
the sloping plane is greater than A. A roof has more area than the room underneath 
it. If the roof goes up at a 45° angle, the ratio is afd Formula (3) yields the correct 
ratio for any surface—including our plane z = x + 2y. 


X =U COS V 
y=usinv 


zZ=U 


Fig. 15.44 Roof area = base area times |N|. Cone and cylinder with parameters u and v. 


The derivatives are 0z/0x = 1 and 0z/éy =2. They are constant (planes are easy). 
The square root in (3) contains | + 1? + 2? = 6. Therefore dS = \/6 dx dy. An area in 
the xy plane is multiplied by \/6 up in the surface (Figure 15.14a). The vectors A and 
B are no longer needed—their work was done when we reached formula (3)—but 
here they are: 


A=i+(dz/oxyk=i+k B=j+(dz/dyk=j+2k N=—i-2j+k. 


The length of N=A x Bis /6. The angle between k and N has cos # = 1/,/6. That 
is the angle between base plane and sloping plane. Therefore the sloping area is /6 
times the base area. For curved surfaces the idea is the same, except that the square 
root in |N| = 1/cos 8 changes as we move around the surface. 


Method 2 is for curves x(t), y(t) and surfaces x(u, v), y(u, v), z(u,v) with parameters. 


A curve has one parameter t. A surface has two parameters u and v (it is two- 
dimensional). One advantage of parameters is that x, y, z get equal treatment, instead 
of picking out z as f(x, y). Here are the first two examples: 


cone X =u COS v, V=uU SIN v, Z=U cylinder x =cos v, y=sinv,z=u. (4) 


Each choice of u and v gives a point on the surface. By making all choices, we get 
the complete surface. Notice that a parameter can equal a coordinate, as in z=u. 
Sometimes both parameters are coordinates, as in x =u and y=v and z=/(u, v). 
That is just z = f(x, y) in disguise—the surface without parameters. In other cases we 
find the xyz equation by eliminating u and v: 


cone (ucos v)?+(usin v?=u? or x?+y?=2? or z=,/x?+4+y? 


cylinder (cos v)? +(sinvl?=1 or x?+y?=1. 


X= COs VD 
y=sinv 
ZU 


15.4 Surface Integrals 


The cone is the graph of f= ./x? + y?. The cylinder is not the graph of any function. 
There is a line of z’s through each point on the circle x? + y? = 1. That is what z = 
u tells us: Give u ail values, and you get the whole line. Give u and v all values, and 
you get the whole cylinder. Parameters allow a surface to close up and even go 
through itself—which the graph of f(x, y) can never do, 

Actually z=./x*+ y? gives only the top half of the cone. (A function produces 
only one z.) The parametric form gives the bottom half also. Similarly y = ./1 — x* 
gives only the top of a circle, while x =cos t, y=sint goes all the way around. 

Now we find dS, using parameters. Small movements give a piece of the surface, 
practically flat. One side comes from the change du, the neighboring side comes from 
dy, The two sides are given by small vectors Adu and Bav: 

éx, ey, 62 éx, Gp, Gz 


A=aitaits, and B=_ i+ Jjta& (5) 


To find the area dS of the parallelogram, start with the cross product N =A x B: 


* 


jk 
é2 8x 6xé é a 
a a eee Ged eee (6) 


cucv éu éy Gude = éu ev 
Xe Ve Zp 


Admittedly this looks complicated—actual examples are often fairly simple. The area 
dS of the small piece of surface is |N| du dv. The length |N| is a square root: 


EXAMPLE 2. Find A and Band N=A x Band dS for the cone and cylindcr. 


The cone has x =ucosv, p=usine, z=u. The u derivatives produce A = GR/du= 
cosvi+sinvj+k. The v derivatives produce the other tangent vector B= @R/év = 
—usinvi+ucosvj. The normal vector is Ax B= —ucosvi—usinejt+uk. Its 
length gives dS: 


dS =|A x BI dude = \/(u cos v)? +(u sin v)? +4? dudv=\/2udude. 


The cylinder is even simpler: dS = du dv. In these and many other examples, A is 
perpendicular to B. The smail piece is a rectangle. Its sides have length |A|du and 
|Bidv. (The cone has |A] =u and |B| = /2, the cylinder has |A| =|B{ = 1). The cross 
product is hardiy needed for area, when we can just multiply |A| du times [Bldv. 


Remark on the two methods Method 1 also used parameters, but a very special 
choice—-u is x and v ts y. The parametric equations are x =x, p=y, 2=f(x, y). If 
you go through the long square root in (7), changing u to x and r to ¥, it simplifies 
to the square root in (3). (The terms ¢y/e¢x and x/¢y are zero; éx/éx and Cy/éy are 
1.) Still it pays to remember the shorter formula from Method 1. 

Don't forget that after computing dS, you have to integrate it. Many times the 
good way is with polar coordinates. Surfaces are often symmetric around an axis or 
a point. Those are the surfaces of revolution—which we saw in Chapter 8 and will 
come back to. 

Strictly speaking, the integral starts with AS (not dS}. A flat piece has area 
|A x BJAxAy or {A x BlJAuAv. The area of a curved surfaee is properly defined as a 
limit. The key step of calculus, from sums of AS to the integral of dS, is safe for 
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smooth surfaces. In examples, the hard part is computing the double integral and 
substituting the limits on x, y or y, v. 


EXAMPLE 3 Find the surface area of the cone z=. /x* + y? up to the height z =a. 


We use Method | (no parameters). The derivatives of z are computed, squared, and 
added: 
2 2 
a a a a | 
dx /xt+y? by fy? +9? x?+y? " x?+y? 
Conclusion: |N| = af2 and dS = aft dx dy. The cone is on a 45° slope, so the area 
dx dy in the base is multiplied by «f2 in the surface above it (Figure 15.15). The 
square root in dS accounts for the extra area due to slope. A horizontal surface has 
dS = am dx dy, as we have known all year. 
Now for a key point. The integration is down in the base plane. The limits on x and 
y are given by the “shadow” of the cone. To locate that shadow set z = ./x? + y? 
equal to z=a. The plane cuts the cone at the circle x + y? =a”. We integrate over 
the inside of that circle (where the shadow is): 


surface area ofcone= ff \/2dxdy=,/2 na’. 


shadow 


EXAMPLE 4 Find the same area using dS = /2 u du dv from Example 2. 


With parameters, dS looks different and the shadow in the base looks different. The 
circle x? + y? =a? becomes u? cos”v + u? sin*v = a”. In other words u = a. (The cone 
has z=u, the plane has z=a, they meet when u=a.) The angle parameter v goes 
from 0 to 2x. The effect of these parameters is to switch us “automatically” to polar 
coordinates, where area is r dr dé: 


2n fa 
surface area of cone = [fas = | | \/2.ududv =,/2na?. 
0 0 


Fig. 15.45 Cone cut by plane leaves shadow in the base. Integrate over the shadow. 


EXAMPLE 5 Find the area of the same cone up to the sloping plane z = 1 —} x. 


Solution The cone still has dS = /2 dx dy, but the limits of integration are changed. 
The plane cuts the cone in an ellipse. Its shadow down in the xy plane is another 
ellipse (Figure 15.15c). To find the edge of the shadow, set z = ./x* + y* equal to z= 
1 —4x. We square both sides: 


x?+y2=1—x+4x? or 3x49)? 4+y? =F. 


15.4 Surtace Integrals 


This is the ellipse in the base—where height makes no difference and z is gone. The 
area of an ellipse is xab, when the equation is in the form (x/a)? + (p/b)? = 1. After 
multiplying by 3/4 we find a = 4/3 and b= |/4/3. Then [f./2 dx dy = /2 nab is the 
surface area of the cone. 

The hard part was finding the shadow ellipse (I went quickly). Its area zab came 
from Example 15.3.2. The new part is 2 from the slope. 


EXAMPLE 6 Find the surface area of a sphere of radius a (known to be 42a’). 


This is a good example, because both methods almost work. The equation of the 
sphere is x? + y? + z* =a’, Method | writes z = \/a? — x? ~ y?. The x and y deriva- 
tives are —x/z and ~ y/z: 


ey Ga Ao a coe ae “a a’ 
* Vays f ay Piet gh Ugh gh phd cage 
The square root gives dS = a dx dy/,'a? — x? — y?. Noticc that z is gone (as it should 


be). Now integrate dS over the shadow of the sphere, which is a circle. Instead of 
dx dy, switch to polar coordinates and r dr dé: 


ar dr d@ 
ae is I eae = — 2nar/a?—¥? J, = 2na’ (8) 


This calculation is successful but wrong. 2a? is the area of the half-sphere above the 
xy plane. The lower half takes the negative square root of z? = a* — x? — y*, This 
shows the danger of Method |, when the surface is not the graph of a function. 


EXAMPLE 7 (same sphere by Method 2: use parameters) The natural choice is spheri- 
cal coordinates. Every point has an angle u = ¢ down from the North Pole and an 
angle e=@ around the equator. The xyz coordinates from Section 14.4 are x= 
asin @ cos @ y= asin ¢ sin 8, z =a cos ¢. The radius p = a is fixed (not a parameter). 
Compute the first term in equation (6), noting é2/é0 = 0: 


(CyfEb)(E2/C0) — (€z/e@\Ey/20) = —(—a sin $)(a sin d cos 6}= a? sin*¢ cos 4, 


The other terms in (6) are a* sin’ sin @ and a’ sin # cos d. Then dS in equation (7) 
squares these three components and adds. We factor out a* and simplify: 


a*(sin*d cos? + sin*@ sin?8 + sin?@ cos?) = at{sin*d + sin? cos?) = a* sind. 


Conclusion: dS =a’ sin 6 d@ d@. A spherical person will recognize this immediately. 
It is the volume clement dV = p? sin ¢ dp d@ dé, except dg is missing. The small box 
has area dS and thickness dp and volume dV. Here we only want dS: 


area of sphere = [fas 7 i | a? sin @ d@ d? = 4na?’. (9) 
18] 0 


Figure 15.16a shows a small surface with sides a d@ and a sin ¢ d@. Their product is 
dS. Figure 15,166 gocs back to Method |, where equation (8) pave dS =(a/z) dx dy. 

I doubt that you will like Figure 15.16c—and you don’t need it. With parameters 
@ and 0, the shadow of the sphere is a rectangle. The equator is the linc down the 
middie, where ¢ = 7/2, The height is z =acos @. The area d¢ dé in the base is the 
shadow of dS =a? sin ¢ d@ d@ up in the sphere. Maybe this figure shows what we 
don't have to know about parameters. 
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y 


Fig. 45.46 Surface area on a sphere: (a) spherical coordinates (b) xyz coordinates (c) 60 space. 


EXAMPLE 8 Rotate y = x? around the x axis. Find the surface area using parameters. 


The first parameter is x (from a to b). The second parameter is the rotation angle 0 
hom 0 to 2x). The points on the surface in Figure 15.17 are ar x, y= x? cos 0, 
z= x? sin 0. Equation (7) leads after much calculation to dS = x? /1 + 4x? dx dé. 

Main point: dS agrees with Section 8.3, where the area was f 2ny, /\ +(dy/dx)? dx. 
The 2x comes from the @ integral and y is x”. Parameters give this formula auto- 
matically. 


VECTOR FIELDS AND THE INTEGRAL OF F-n 


Formulas for surface area are dominated by square roots. There is a square root in 
dS, as there was in ds. Areas are like arc lengths, one dimension up. The good point 
about line integrals | F - nds is that the square root disappears. It is in the denominator 
of n, where ds cancels it: F-nds = M dy—N dx. The same good thing will now 
happen for surface integrals {{ F - ndS. 


451 Through the surface z= f(x, y), the vector field F(x, y, z)= Mi+ Nj+ Pk 
has 


Ox 


5 nS = _ywt_nG 
flux = ij F ndS nial M no +P) andy. (10) 


This formula tells what to integrate, given the surface and the vector field (f and F). 
The xy limits come from the shadow. Formula (10) takes the normal vector from 
Method 1: 


N = — éf/dxi— of/dyj+k and |N| =./1 +(dfjex)? + (Oflay). 


For the unit normal vector n, divide N by its length: n= N/|N|. The square root is in 
the denominator, and the same square root is in dS. See equation (3): 


F-ndS = 


a dxdy= (- MwA +P) aay (11) 


That is formula (10), with cancellation of square roots. The expression F « ndS is often 
written as F «dS, again relying on boldface to make dS a vector. Then dS equals ndS, 
with direction n and magnitude dS. 


45.4 Surface Integrafs 5/79 


F 7 


ee : 
Fen = 1/N6 
dS = V6 dxdy 


‘Sf xs 
yextcos &2= 4° sin @ + 


Fig. 45.47 Surface of revolution: parameters x, (?. Fig. 45.48 EF-ndS gives flow through dS. 


EXAMPLE? Find ndS for the plane z = x + 2y. Then find F-ndS for F =k. 
This plane produced /6 in Example | {for area), For flux the ./6 disappears: 


N —i-2j+k 
iN] d$§ 4 
For the flow field F =k, the dot product k + ndS$ reduces to ldx dy. The slope of the 
plane makes no difference! The flow through the base aiso flows through the plane. The 
areas are different, but flux is like rain. Whether it hits a tent or the ground below, 
it is the same rain (Figure 15.18). In this case ({ F -ndS = {{ dx dy = shadow area in 
the base. 


ndS = ./6 dx dy =(—i—2j +k) dx dy. 


EXAMPLE 10 Find the flux of F = xi+ yj + zk through the cone z = ./x? + y?. 


; x ¥ 2 2 
Solution FendS =| —x[ --p oe | - tif/xe + dx dy =0. 
Ge + =) (3 ~ =) yee | a 


The zero comes as a surprise, but it shouldn’t. The cone goes straight out from the 
origin, and so docs F. The vector n that is perpendicular to the cone is also perpendic- 
ular to F. There is no flow through the cone, because F+n=0, The flow travels out 
along rays. 


JJ F-ndS FOR A SURFACE WITH PARAMETERS 


In Example 10 the cone was z =f(x, y)=./x? + y?. We found dS by Method 1. 
Parameters were not needed (more exactly, they were x and y). For surfaces that fold 
and twist, the formulas with u and v look complicated but the actual calculations can 
be simpler. This was certainly the case for dS = du dv on the cylinder. 

A small piece of surface has area dS = |A x B| du dv. The vectors along the sides are 
A=x,i+ yj +z.k and B=x,i+ y,j+z,k. They are tangent to the surface. Now we 
put their cross product N = A x B to another use, because F -ndS involves not only 
area but direction. We need the unit vector n to see how much flow goes through. 

The direction vector is n= N/|N|. Equation (7) is dS =|N|du dz, so the square root 
JN| cancels in ndS. This leaves a nice formula for the “normal component” of flow: 


45J Through a surface with parameters y and », the fteld F = Mi+ Nj+ Pk 
has 


flux = {fr «ndS = {lr °N dudv= {lr *(A x B) du dv. (12) 
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EXAMPLE 44 Find the flux of F = xi+}j+zk through the cylinder x7 + )?=1 
O<z<b. 


. 


Solution The surface of the cylinder is x = cos u, y = sin u, z = ¢. The tangent vectors 
from (5) are A =(—sin u)i+ (cos u)j and B=k. The normal vector in Figure 15.19 
goes straight out through the cylinder: 


N=Ax B=cosuit+sinuj (check A-N=0 and B’>N =0), 
To find F-N, switch F = xi + yj + zk to the parameters u and r. Then F-N = I: 
F-N=(cos uit sinuj+ek)+(cos vi+sin uj) =cos?u 4 sin?y. 
For the flux, integrate F~N = 1 and apply the limits on u=@ and »=z: 
h fin 
flux -| | ] duds = 2x6 = surface area of the cylinder. 
o Jo 


Note that the top and bottom were not included! We can find those fluxes too, The 
outward direction is n=k at the top and n= —k down through the bottom. Then 
F-nis +z=b6 at the top and —z=0 at the bottom. The bottom flux is zero, the top 
flux is b times the area (or 2b). The total flux is 225 + x) = 3x6. Hold that answer 
for the next section. 

Apology: I made u the angle and rv the height. Then N goes outward not inward. 


EXAMPLE 12 Find the flux of F =k out the top half of the sphere x* + y? + 2? =a’. 
Solution Use spherical coordinates. Example 7 had u= ¢ and v= 8. We found 
N=Ax B=a’ sin’ cos Gita’ sin’¢ sin @j +a’ sin d cos Pk. 


The dot product with F =k is F-N =a? sin & cos @. The integral goes from the pole 
to the equator, dé =0 to @ = 2/2, and around from ? =0 to 4 = 2n: 


2 m2 oon m2 
flux -| | a’ sin @ cos ¢ dd d@ = 2na? = ‘ = 7a’. 


Qo Q 0 


The next section will show that the flux remains at a* through any surface {!) that 
is bounded by the equator. A special case is a flat surface—the disk of radius a at 
the equator. Figure 15.18 shows n=k pointing directly up, so F-n=k~k = 1. The 
flux is sj | dS = area of disk = na?. All fluid goes past the equator and out through the 
sphere. 


Fig. 15.49 Flow through cylinder. Fig. 15.20 Mdbius strip (no way to choose n). 


FORMULAS 
FOR 
SURFACE 
INTEGRALS 


15.4 Surface Integrals 


I have to mention one more problem. It might not occur to a reasonable person, but 
sometimes a surface has only one side. The famous example is the Mobius strip, for 
which you take a strip of paper, twist it once, and tape the ends together. Its special 
property appears when you run a pen along the “inside.” The pen in Figure 15.20 
suddenly goes “outside.” After another round trip it goes back “inside.” Those words 
are in quotation marks, because on a Mobius strip they have no meaning. 

Suppose the pen represents the normal vector. On a sphere n points outward. 
Alternatively n could point inward; we are free to choose. But the MGbius strip makes 
the choice impossible. After moving the pen continuously, it comes back in the 
opposite direction. This surface is not orientable. We cannot integrate F +n to compute 
the flux, because we cannot decide the direction of n. 

A surface is oriented when we can and do choose n. This uses the final property of 
cross products, that they have length and direction and also a right-hand rule. We 
can tell A x B from B x A. Those give the two orientations of n. For an open surface 
(like a wastebasket) you can select either one. For a closed surface (like a sphere) it 
is conventional for n to be outward. By making that decision once and for all, the 
sign of the flux is established: outward flux is positive. 


45.4 EXERCISES 
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Read-through questions 


A small piece of the surface z = f(x, y) is nearly __a@__. When 
we go across by dx, we go up by _6 _. That movement is 
Adx, where the vector A is i+__¢ _. The other side of the 
piece is Bdy, where B=j+__¢ __. The cross product A x B 
is N=__e __. The area of the piece is dS = |N|dx dy. For the 
surface z= xy, the vectorsareA=__' andB=_g and 


N=_h_. The area integral is [/dS=__' _dxdy. 


With parameters u and v, a typical point on a 45° cone is 
x=ucosv, y=__|__, z=__k__. A change in u moves that 
point by Adu =(cos vi+__!__)du. A change in v moves the 
point by Bdv=__m_. The normal vector is N=A x B= 

n__. The areaisdS=__o _ dudv. In this example A- B= 


pP__ so the small piece isa __q__ and dS =|A||B|dudbv. 


For flux we need ndS. The _+' _ vectornis N=AxB 
divided by __s__. For a surface z =/(x, y), the product ndS 


is the vector __! (to memorize from table). The particular 


surface z=xyhasndS=_u___dxdy. For F= xi+ yj + zk the 


flux through z=xy is F-ndS=_v___dxdy. 


On a 30° cone the points are x = 2u cos v, y = 2u sin v, z = 
u. The tangent vectors are A=_wW__ and B=__-x_.. This 
cone has ndS=A x Bdudv=_y__. For F=xi+ yj + zk, 
the flux element through the cone is F-ndS=_ 2 __. The 


reason for this answer is__A__. The reason we don’t compute 
flux through a Mobius strip is__8 _. 


In 1-14 find N and dS =|N| dx dy and the surface area {{ dS. 
Integrate over the xy shadow which ends where the z’s are equal 
(x? + y? =4 in Problem 1). 


1 Paraboloid z = x? + y? below the plane z = 4. 
2 Paraboloid z = x? + y? between z =4 and z=8. 
3 Plane z = x —y inside the cylinder x? + y? =1. 


4 Plane z= 3x + 4y above the square 0<x<1,0<y<l. 
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5 Spherical cap x? + y? + 2? = 1 above z= 1/,/2. 
6 Spherical band x? + y? +z? =1 between z =O and 1/,/2. 
7 Plane z=7y above a triangle of area A. 
8 Cone z7 =x? + y* between planes z=a and z=. 
9 The monkey saddle z=4x° — xy? inside x? + y? =1. 
10 z=x + y above triangle with vertices (0, 0), (2, 2), (0, 2). 
11 Plane z= 1— 2x —2y inside x >0, y20,z20. 


12 Cylinder x*+2z?= <a? inside x74 y?=a?. Only set up 
§f as. 


{3 Right circular cone of radius a and height h. Choose 
z= f(x, y) or parameters u and v. 


14 Gutter z= x? below z =9 and between y= +2. 


In 15-18 compute the surface integrals {f g(x, y, z)dS. 

18 g=xy over the triangle x + y+z=1, x,y,z 20. 

16 g=x? + y? over the top half of x? + y? + 2? = 1 (use ¢, 6). 
17 g=xyz on x? +y?+4z7=1 above z*? =x? + y? (use ¢, 9), 
18 g=xon the cylinder x? + y? = 4 between z = 0 and z =3. 


In 19-22 calculate A, B, N, and dS. 

19 x=u,y=vtuz=v+2ud+l. 

20 x=auv, youtu,z=u—v. 

21 x=(3+ cos u) cos v, y=(3+cos u) sin v, z =sinu. 
22 x=ucoso, y=usiny, z= (not z=#). 


23-26 In Problems 1-4 respectively find the flux {fF -adS 
for F = xi+ yj+ zk. 


27-28 In Problems 19~20 respectively compute {fF - adS for 
F = yi — xj through the region u? +p? < 1. 


29 A unit circle is rotated around the z axis to give a torus 
{see figure). The center of the circle stays a distanee 3 from 
the z axis. Show that Problem 21 gives a typical point (x, y, z) 
on the torus and find the surface area jj dS = ff |N| dude. 


30 The surface x =r cos 6, y=r sin 6, z =a? —r? is bounded 
by the equator (r =a). Find N and the flux {{k-+ndS, and 
compare with Example 12. 


31 Make a “double Mobius strip” from a strip of paper by 
twisting it twice and taping the ends. Does a normal vector 
{use a pen) have the same direction after a round trip? 


32 Make a “triple MGbius stnp” with three twists. Is it 
orientable—does the normal vector come back in the same 
or opposite direction? 


33 Ifa very wavy surface stays close to a smooth surface, are 
their areas close? 


34 Give the equation of a plane with roof area dS = 3 times 
base area dx dy. 


38 The points (x, f(x) cos @, f(x) sin 6) are on the surface of 
revolution: y=f(x) revolved around the x axis, parameters 
u=x and v=@. Find N and compare dS =|N|dxd@ with 
Example 8 and Section 8.3. 


45.5 The Divergence Theorem [x 


This section returns to the fundamental law (flow out) — (flow in) = (source). In two 
dimensions, the flow was in and out through a closed curve C. The plane region 
inside was R. In three dimensions, the flow enters and leaves through a closed surface 
5. The solid region inside is V. Green’s Theorem in its normal form (for the flux of a 
smooth vector field) now becomes the great three-dimensional balance equation— 


the Divergence Theorem: 


15.5 The Divergence Theorem 


45K The flux of F = Mit+ Nj + Pk through the boundary surface S equals the 
integral of the divergence of F inside V. The Divergence Theorem is 


F OM ON. OP 
[frrmis [[fawean= [{f(Zi+ + 2) accra (t) 


In Green’s Theorem the divergence was ¢M/dx +dN/éy. The new term éP/éz 
accounts for upward flow. Notice that a constant upward component P adds nothing 
to the divergence (its derivative is zero). It also adds nothing to the flux (flow up 
through the top equals flow up through the bottom). When the whole field F is 
constant, the theorem becomes 0 = 0. 

There are other vector fields with diy F = 0. They are of the greatest importance. 
The Divergence Theorem for tbose fields is again 0 = 0, and there is conservation of 
fluid. When div FF =0, flow in equals flow out. We begin with examples of these 
“divergence-free” fields. 


EXAMPLE 4 The spin fields — yi+ xj + Ok and 0i— zj + yk have zero divergence. 


The first is an old friend, spinning around the z axis. The second is new, spinning 
around the x axis. Three-dimensional flow has a great variety of spin fields. The 
separate terms €M/dx, ON/éy, 6P/éz are all zero, so div F = 0. The flow goes around 
in circles, and whatever goes out through § comes back in. (We might have put a 
circle on {f, as we did on $¢, to emphasize that S is closed.) 


EXAMPLE 2 The position field R = xi+ yj+ zk has divR=1+1+1=3. 


This is radia! flow, straight out from the origin. Mass has to be added at every point 
to keep the flow going. On the right side of the divergence theorem is {ff3dV. 
Therefore the flux is three times the volume. 

Example 11] in Section 15.4 found the flux of R through a cylinder. The answer 
was 3nb. Now we also get 3nb from the Divergence Theorem, since the volume is b. 
This is one of many cases in which the triple integral is easier than the double integral. 


EXAMPLE 3 An electrostatic field R/p° or gravity field —R/p> almost has div F = 0. 


The vector R = xi+ yj+ zk has length ./x? + y?+2* =p. Then F has length p/p? 
(inverse square law). Gravity from a point mass pulls inward (minus sign). The electric 
field from a point charge repels outward. The three steps almost show that div F = 0: 


Step 1. dp/dx = x/p, dp/éy = y/o, ép/éz = z/p—but do not add those three. F is not 
p or |/p? (these are scalars). The vector field is R/p*. We need M/éx, @N/éy, 8P/dz. 


Step 2. @M/éx = d/éx{x/p*) is equal to 1/p°> — (3x ép/éx)/p* = 1/p> — 3x7/p*. For 
aN/éy and @P/éz, replace 3x? by 3y* and 3z?. Now add those three. 


Step 3. div F = 3/p? — 3(x? + y? + 27)/p* = 3/p3 — 3/p? =0. 


The calculation div F = 0 leaves a puzzle. One side of the Divergence Theorem seems 
to give [{fOdV=0. Then the other side should be {fF + ndS =0. But the flux is not 
zero when ail flow is outward: 


The unit normal vector to the sphere p = constant is n= R/p. 
The outward flow F-n=(R/o*):(R/p) = p?/p* is always positive. 
Then fj F +ndS = |{ dS/p? = 42p7/p? =4n. We have reached 4n = 0. 
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This paradox in three dimensions is the same as for R/r? in two dimensions. 
Section 15.3 reached 2x =0, and the explanation was a point source at the origin. 
Same explanation here: M, N, P are infinite when p =0. The divergence is a “delta 
function” times 4z, from the point source. The Divergence Theorem does not apply 
(unless we allow delta functions). That single point makes all the difference. 

Every surface enclosing the origin has flux = 4x. Our calculation was for a sphere. 
The surface integral is much harder when S is twisted (Figure 15.21a). But the Diver- 
gence Theorem takes care of everything, because div F = 0 in the volume V between 
these surfaces. Therefore {{F-ndS=0 for the two surfaces together. The flux 
Jj F -ndS = — 4r into the sphere must be balanced by {{ F - ndS = 4z out of the twisted 
surface. 


(P,- P,) dS ~ (dPidz) dV, 
(P,- Po) dS ~ (dPidz) dV, 
(P,— Py) dS ~ SUM > INTEGRAL 


Fig. 15.24 Point source: flux 42 through all enclosing surfaces. Net flux upward 
mt S{J(@P/dz)d V. 


Instead of a paradox 4x = 0, this example leads to Gauss’s Law. A mass M at the 
origin produces a gravity field F = —GMR/p*. A charge q at the origin produces an 
electric field E = (q/479)R/p*. The physical constants are G and é9, the mathematical 
constant is the relation between divergence and flux. Equation (1) yields equation (2), 
in which the mass densities M(x, y,z) and charge densities g(x, y,z) need not be 
concentrated at the origin: 


45L Gauss’s law in differential form: div F = —4nxGM and div E=q/é. 
Gauss’s law in integral form: Flux is proportional to total mass or charge: 


[Jr-mas——[[faxcmay ana |e-nas= {[{aaviey Q) 


THE REASONING BEHIND THE DIVERGENCE THEOREM 


The general principle is clear: Flow out minus flow in equals source. Our goal is to 
see why the divergence of F measures the source. In a small box around each point, 
we show that div F dV balances F-ndS through the six sides. 

So consider a small box. Its center is at (x, y, z). Its edges have length Ax, Ay, Az. 
Out of the top and bottom, the normal vectors are k and —k. The dot product with 
F = Mi+ Nj+ Pk is +P or —P. The area AS is AxAy. So the two fluxes are close 
to P(x, y,z+4Az)AxAy and — P(x, y,z—4Az)AxAy. When the top is combined with 
the bottom, the difference of those P’s is AP: 


net flux upward =~ APAxAy =(AP/Az)AxAyAz = (€P/éz)AV. (3) 


45.5 The Divergence Theorem 


Similarly, the combined flux on two side faces is approximately (@N/dy)AV. On the 
front and back it is (@M/éx)AV. Adding the six faces, we reach the key point: 


flux out of the box ~(GM/dx + ON/dy + @P/dz)AV. (4 


This is (div FJAV. For a constant field both sides are zero—the flow goes straight 
through. For F = xi + yj + 2k, a little more goes out than comes in. The divergence 
is 3, so 3AV is created inside the box. By the balance equation the flux is also 3AV. 

The approximation symbol ~ means that the leading term is correct (probably 
not the next term). The ratio AP/Az is not exactly ¢P/éz. The difference is of order 
Az, so the error in (3) is of higher order AVAz. Added over many boxes (about 1/AV 
boxes), this error disappears as Az > 0. 

The sum of (div F)AV over all the boxes approaches f{{(div F)¢V. On the other 
side of the equation is a sum of fluxes. There is F-nAS out of the top of one box, 
plus F -nAS out of the bottom of the box above. The first has n =k and the second 
has n= —k. They cancel each other—the flow goes from box to box. This happens 
every time two boxes meet. The only fluxes that survive (because nothing cancels 
them) are at the outer surface S. The final step, as Ax, Ay, Az — 0, is that those outside 
terms approach [jf F -ndS. Then the local divergence theorem (4) becomes the global 
Divergence Theorem (1). 


Remark on the proof That “final step” is not easy, because the box surfaces don't 
line up with the outer surface S. A formal proof of the Divergence Theorem would 
imitate the proof of Green’s Theorem. On a very simple region [ff (@P/dz)dx dy dz 
equals {f Pdxdy over the top minus | Pdx dy over the bottom. After checking the 
orientation this is {f Pk-ndS. Similarly the volume integrals of M/dx and dN/@y are 
the surface integrals [{ Mi-mdS and {{ Nj-mdS. Adding the three integrals gives the 
Divergence Theorem. Since Green’s Theorem was already proved in this way, the 
reasoning behind (4) is more helpful than repeating a detailed proof. 

The discoverer of the Divergence Theorem was probably Gauss. His notebooks 
only contain the outline of a proof—but after all, this is Gauss. Green and Ostrograd- 
sky both published proofs in 1828, one in England and the other in St. Petersburg 
(now Leningrad). As the theorem was studied, the requirements came to light (smooth- 
ness of F and S, avoidance of one-sided Mébius strips). 

New applications are discovered all the time—when a scientist writes down a bal- 
ance equation in a small box. The source is known. The equation is div F = source. 
After Example 5 we explain F. 


EXAMPLE 4 If the temperature inside the sun is T =1n 1|/p, find the heat flow F = 

— grad T and the source div F and the flux [[ F+ndS. The sun is a ball of radius pg. 

Solution F is —grad In 1/p = +grad In p. Derivatives of In p bring division by p: 
F = (dp/éx i+ dp/dy j + dp/éz k)/p = (xi+ yj + zk)/p?. 


This flow is radially outward, of magnitude !/p. The normal vector n is also radially 
outward, of magnitude 1. The dot product on the sun’s surface is 1/po: 


[Jr «nd§ = | fesie = (surface area)/py =4p3/py = 4xp. (5) 


Check that answer by the Divergence Theorem. Example 5 will find div F = 1/p7. 
Integrate over the sun. In spherical coordinates we integrate dp, sin ddd, and dé: 


i div F dV= I, a ie po? sin @ do dd d6/p? = (p5)(2)(2n) as in (5). 
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This example illustrates the basic frarnework of equilibrium. The pattern appears 
everywhere in applied mathematics—electromagnetism, heat flow, elasticity, even 
relativity. There is usually a constant c that depends on the material (the example 
has c= 1). The names change, but we always take the divergence of the gradient: 


potential f > force field —c grad f. Then div(—c grad f)=electric charge 
temperature T+ flowfteld —c grad T. Then div(—c grad T)=heat source 


displacement u — stress field +c gradu. Then div(—c grad u)= outside force. 


You are studying calculus, not physics or thermodynamics or elasticity. But please 
notice the main point. The equation to solve is div(—c grad f) = known source. The 
divergence and gradient are exactly what the applications need. Calculus teaches the 
right things. 

This framework is developed in many books, including my own text Introduction 
to Applied Mathematics (Wellesley-Cambridge Press). It governs equilibrium, in mat- 
rix equations and differential equations. 


PRODUCT RULE FOR VECTORS: INTEGRATION BY PARTS 


May I go back to basic facts about the divergence? First the definition: 
F(x, y, 2) = Mi+ Nj + Pk has div F=V+F =dM/dx + 6N/éy + OP/éz. 


The divergence is a scalar (not a vector). At each point div F is a number. In fluid 
flow, it is the rate at which mass leaves—the “flux per unit volume” or “flux density.” 
The symbol V stands for a vector whose components arte operations not numbers: 


V =“del” =i d/éx +j d/dy + k 0/dz. (6) 

This vector is illegal but very useful. First, apply it to an ordinary function f(x, y, z): 
Vf =“del f" =i offox + j of fey + k Of /éz = gradient of f. (7) 

Second, take the dot product V-F with a vector function F(x, y, z)= Mi+ Nj+ Pk: 
V-F = “del dot F” = 0M/éx + @N/éy + dP/éz = divergence of F. (8) 

Third, take the cross product V x F. This produces the vector curl F (next section): 
V x F = “del cross F” =... (to be defined) ... = curi of F. (9) 


The gradient and divergence and curl are V and V- and V x. The three great opera- 
tions of vector calculus use a single notation! You are free to write V or not—to 
make equations shorter or to help the memory. Notice that Laplace’s equation 


shrinks to 
epee yor é fof d fof\ _ 
= a(S) + a (2) +a(Z)-0 va 


Equation (10) gives the potential when the source is zero (very common). F = grad f 
combines with div F = 0 into Laplace’s equation div grad f=0. This equation is so 
important that it shrinks further to V?f=0 and even to Af=0. Of course Af= 
Fux + Sy +f, has nothing to do with Af=f(x + Ax) — f(x). Above all, remember that 
fis a scalar and F is a vector: gradient of scalar is vector and divergence of vector is 
scalar. 


15.5 The Divergence Theorem 


Underlying this chapter is the idea of extending calculus to vectors. So far we have 
emphasized the Fundamental Theorem. The integral of df/dx is now the integral of 
div F. Instead of endpoints a and b, we have a curve C or surface S. But it is the rules 
for derivatives and integrals that make calculus work, and we need them now for 
vectors. Remember the derivative of w times v and the integral (by parts) of u dv/dx: 


45M Scalar functions u(x, y, z) and vector fields V(x, y, z) obey the product rule: 
div(u¥) = u div V + V «(grad u). (11) 


The reverse of the product rule is integration by parts (Gauss’s Formuia): 


{ff div V dxdydz= — {{[v (grad u) dx dy dz+ {| VindS. (12) 


For a plane field tbis is Green’s Formula (and u=1 gives Green’s Theorem): 


8M aN Gu Bi ae 
— + —_— — + ° i 
{| Ax ) a =) dxdy fica +Nj)-nds. (13) 


Those look like heavy formulas. They are too much to memorize, unless you use 
them often. The important point is to connect vector calculus with “scalar calculus,” 
which is not heavy. Every product rule yields two terms: 


(uM), =udM/éx+M Gujéx (uN),=udN/dy+N éu/éy (uP), =u dP/dz + P 6u/dz. 


Add those ordinary rules and you have the vector rule (11) for the divergence of uV. 

Integrating the two parts of div(u¥) gives {fuV-ndS by the Divergence Theorem. 
Then one part moves to the other side, producing the minus signs in (12) and (13). 
Integration by parts leaves a boundary term, in three and two dimensions as it did in 
one dimension: fuv'dx = — [u'vdx + [uv]?. 


EXAMPLE 5 Find the divergence of F = R/p”, starting from grad p = R/p. 


Solution Take V=R and u=1/p? in the product rule (11). Then div F = (div R)/ 
p* +R°(grad 1/p?). The divergence of R= xi+ yj + zk is 3. For grad 1/p? apply the 
chain rule: 


R-(grad t/p?) = —2R-(grad p)/p? = — 2R+ R/p* = — 2/p’. 


The two parts of div F combine into 3/p? — 2/p? = 1/p?—as claimed in Example 4. 


EXAMPLE 6 Find the balance equation for flow with velocity V and fluid density p. 


¥ is the rate of movement of fluid, while p¥ is the rate of movement of mass. 
Comparing the ocean to the atmosphere shows the difference. Air has a greater 
velocity than water, but a much lower density. So normally F = p¥ is larger for the 
ocean. (Don’t confuse the density 9 with the radial distance p. The Greeks only used 
24 letters.) 

There is another difference between water and air. Water is virtually incompressible 
(meaning p =constant). Air ts certainly compressible (its density varies). The balance 
equation is a fundamental law—the conservation of mass or the “continuity equation” 
for fluids. This is a mathematical statement about a physical flow without sources or 
sinks: 

Continuity Equation. div{p¥) + Cp/ét = 9. (14) 
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Explanation: The mass in a region is [ff dV. Its rate of decrease is — fff ép/ée dV. 
The decrease comes from flow out through the surface (normal vector n). The dot 
product F-n=pV¥°-n is the rate of mass flow through the surface. So the integral 
{JF -ndS is the total rate that mass goes out. By the Divergence Theorem this is 
\[[ div F av. 

To balance — [{[ Gp/ét dV in every region, div F must equal — @p/¢t at every point. 
The figure shows this continuity cquation (14) for Row in the x direction. 


mass in extra mass out mass loss 
pv dS at UpV\ds ar ~ —dpdSax 


Fig. 45.22 Conservation of mass during time dt: d(p¥)idx + dpjdt = 0. 


45.5 EXERCISES 


Read-through questions 


. The flux of F 
through a closed surface S is the double integral _ 6 _. The 
divergence of Mi+ Nj+Pkis__¢ ,and it measures __d 
The total source is the triple integral __e _. That cquals the 
flux by the = ¢ = Theorem. 


In words, the basic balance law is a 


For F = 5zk the divergence is_ 9 __. If V is a cube of side 


a then the triple integral equals __h__. The top surface where 
z=ahasn=__! and F-n=__! _. The bottom and sides 


have F-n=__k__. The integral [[F+ndS equals __| 


The field F=R;p* has div F=0 except _om_. [J F+ndS 
equals _ 9 ~~ over any surface around the orgin. This 
illustrates Gauss’s Law © __. The field F = xi ~ yj — 2zk has 
divF=_ pe and j{F-ndS=_a_. For this F, the flux out 
through a pyramid and in through its base are ___ 


The symbol V stands for ___s__. In this notation div F is 
1. The gradient of fis _u __. The divergence of grad f 


is __¥__. The equation div grad f=0 is __w__’s equation. 


The divergence of a product is diviu¥} =__x__. Integration 
by parts is |[fudivVdxdyd>=_y + 2. In two 
.dimensions this becomes __A__. In one dimension it becomes 
B_. For steady fluid flow the continuity cquation ts 
div p¥=_C¢ 
In 1-10 compute the flux {{ F - ndS by the Divergence Theorem. 
1 F=xi+xjixk, S: unit sphere x? +5? +22 = 1. 
2 F=—-vi+xp Vi unit cube O<x<lOgy<!O<c<l. 
3 F=x7i+y?j4+ 27k. S: unit sphere 
4 F=x744+8y?j 427k, V: unit cube. 
§ F=xi-2yj. S:sidesv¥ =O, y=0,5=0,x+ytr=1. 
6 F=u,={xi+ sj +zk}ip. S$: sphere p=a 
7 F =p(xi~ yjt+ck), S: sphere p—a 


8 F=xci+y°j4+2°k, S: sphere p=a. 
9 F=27k, V: upper half of ball p <a. 
10 F =grad (xe sin z),  S: sphere p = a. 


1H Find {{] div(x?i+ yj + 2k)dV in the cube O< x, yz <a. 
Also compute n and Jf -nd§ for all six faces and add. 


12 When «@ is small in problem 11, the answer is close to cx*. 
Find the number ¢. At what point does div F =c? 
13 (a) Integrate the divergence of F = pi in the ball p <a. 
(b) Compute |[F-ndS over the spherical surface p = a. 
14 Integrate Jf R-ndS over the faces of the box O<x< 1, 
0<1<2,0<2 <3 and check by the Divergence Theorem. 
18 Evaluate |{F+ndS when F = xi+ 27j + y*k and: 
{a} $ is the cone z? = x? + y? bounded above by the plane 
el) 
{b) S ts the pyramid with corners (0, 0, 0), (1, 0, 0), (0, 1.0). 
(0, 0, 1). 


16 Compute all integrals in the Divergence Theorem when 
F = x(i+j—k) and V is the unit cube OS x, 2 < I. 


17 Following Example 5. compute the divergence of 
(xi ~ yj —zkyp’. 
18 (grad f)+ mis the derivative of fin the direction 
J Atis also written efecn. If fh. th, +f. =0 in Vo 

derive [Jefe dS — 0 from the Divergence Theorem. 
19 Describe the closed surface § and outward normal n: 

(a) F = hollow ball | <x? 41° 427 <9. 

{b) = solid cylinder x? + 4° <1, |z| <7. 

(c) kK =pyramid x20, 120.220, x+2¢4+3r<1. 

(d)  =solid cone x7 +)7 <2? <1. 
20 Give an example where [{F+ndS is easier than 
[[jdiv F dV. 
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21 Suppose F = M(x, y)it+ N(x, y)j, R is a region in the xy 
plane, and (x, y, z) is in V if (x, y) is in R and |2| <1. 
(a) Describe V and reduce [{jdivFdV to a double 
integral. 
(b} Reduce jj F -ndS to a line integral (check top, bottom, 
side}. 
(c) Whose theorem says that the double integral equals 
the line integral? 


22 Is it possible to have F-n=0 at all points of S and also 
diy F = 0 at all points in V? F=06 is not allowed, 


23 Inside a solid ball (radius a, density 1, mass M = 4na/3) 
the gravity field is F= — GMR/a’, 
{a) Check div F = — 42G in Gauss’s Law. 
(b) The force at the surface is the same as if the whole 
mass M were 


(c} Find a gradient field with div F =6 in the ball p<a 
and div F = 0 outside. 


24 The outward field F=R/p? has magnitude |F{=1/p?. 
Through an area A on a sphere of radius p, the flux is 

. A spherical box has faces at p, and p, with A= 
pi sin dddd@ and A= p72 sin ddbdé. Deduce that the flux 
out of the box is zero, which confirms diy F =0. 


15.6 Stokes’ Theorem and the Curl of F 


25 In Gauss’s Law, what charge distribution q(x, y, z) gives 
the unit field E = u,? What is the flux through the unit sphere? 


26 Ifa fluid with velocity V is incompressible (constant den- 


sity 9), then its continuity equation reduces to . [Fit 
is irrotational then F = grad. If it is both then / satisfies 
equation. 


27 True or false, with a good reason. 
(a) If {f{ F -mdS =0 for every closed surface, F is constant. 
(b) If F = grad f then div F =0. 
(c) If |F/ <1 at ali points then [ff div F dV < area of the 
surface S, 
(d) If |F| < 1 at all points then |div F| <1 at all points, 


28 Write down statements E~F-G-H for source-free fields 
F(x, y, z} in three dimensions. In statement F, paths sharing 
the same endpoint become surfaces sharing the same bound- 
ary curve. In G, the stream function becomes a vector field 
such that F =curl g. 


29 Describe two different surfaces bounded by the circle 
x? + y? = 1, z=0. The field F automatically has the same flux 
through both if ‘ 


WM The boundary of a bounded region R has no boundary. 
Draw a plane region and explain what that means. What does 
it mean for a solid bal]? 


For the Divergence Theorem, the surface was closed. S was the boundary of V. 
Now the surface is not closed and S has its own boundary—a curve called C. We 
are back near the original setting for Green’s Theorem (region bounded by curve, 
double integral equal to work integral), But Stokes’ Theorem, also called Stokes’s 
Theorem, is in three-dimensional space. There is a curved surface S bounded by a 
space curve C. This is our first integral around a space curve. 

The move to three dimensions brings a change in the vector field. The plane field 


F(x, y)= Mi+ Nj becomes a space field Fix, y,z)= 


Mi+Nj+Pk. The work 


Mdx + Ndy now includes Pdz. The critical quantity in the double integral (it was 
6N/dx —6M/dy} must change too. We called this scalar quantity “curl F,” but in 
reality it is only the third component of a vector. Stokes’ Theorem needs ail three 
components of that yector—which is curl F. 


DEFINITION The curl of a vector field F(x, y, z) = Mi+ Nj+ Pk is the vector field 


curl P= (Z-2) (# 
cy = az 


oP \, (= mt) () 


~ éxf \ax ay 
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The symbol V x F stands for a determinant that yields those six derivatives: 


i j k 
cull F=VxF=|é/éx d/dy éféz}. (2) 
M N P 


The three products id/éy P and j d/éz M and k d/dx N have plus signs. The three 
products like k é/@y M, down to the left, have minus signs, There is a cyclic symmetry. 
This determinant helps the memory, even if it looks and is illegal. A determinant is 
not supposed to have a row of vectors, a row of operators, and a row of functions. 


EXAMPLE 4 The plane field M(x, yji+ N(x, yj has P=0 and 8@M/éz=0 and 
8N/dz =0, Only two terms survive: curl F =(@N/éx — 0M/dy)k. Back to Green. 


EXAMPLE 2. The cross product a x R is a spin field §. Its axis is the fixed vector a = 
@,i+a,j+a3k. The flow in Figure 15.23 turns around a, and its components are 
i j k 
S=axR=la; a; 43| =(a,7—asyi+(asx—azjt(ny—axk. 3) 
x y 2 
Our favorite spin field — yi + xj has (a,, a2, a) = (0, 0, 1) and its axis is a=k. 


The divergence of a spin field is M, + N, + P, =0+0 +0. Note how the divergence 
uses M, while the curl uses N, and P,. The curl of S is the vector 2a: 


oP @N\. {GM @P\, (GN OM : 4 
(F- Fh (F- ax + (Fe k= 208+ 205+ Zak = 20, 
This example begins to reveal the meaning of the curl. It measures the spin! The direc- 


tion of curl F is the axis of rotation—in this case along a, The magnitude of curl F is 
twice the speed of rotation, In this case [curl F| = 2|a] and the angular velocity is [al. 


AXIS 2 

S=axR R=.xi+yj+ck 
curl § = 2a curi R=0 
div S=0 ares 


Fig. 15.23 Spin field S =a x R, position field R, velocity field (shear field) V = zi, any field F. 


EXAMPLE 3 (!1) Every gradient field F = Cf/éxi+ éf/éy 4+ éfiézk has curl F =0: 
daf @éf\, @écf @aéfr. cof 4 ef 
iF=;—---- = +4—S - SS +[—— -—= |k=0. 4 
se & dz éz Zhi (Z ex 6x ez éx dy ey éx 4 
Always f,. equals f,,. They cancel. Also f,. =/,, and f,, =f,,. So curl grad f= 0. 


15.6 Stokes’ Theorem and the Curl of F 


EXAMPLE 4 (twin of Example 3) The divergence of curl F is also automatically 
zero: 


df@P aN a/0M aP d/eéN 0M 
civ curl B= (FY) 4 oe a (RB a0 (5) 


Again the mixed derivatives give P,, = P,, and N,, =N,, and M,, = M,,. The terms 
cancel in pairs. In “curl grad” and “diy curl”, everything is arranged to give zero. 


The spin field S has no divergence. The position field R has no curl. R is the gradient 
of f=4x? + y? +27). S is the curl of a suitable F. Then div S=divy curl F and 
curl R =curl grad f are automatically zero. 

You correctly believe that curl F measures the “spin” of the field. You may expect 
that curl (F + G) is curl F + curl G. Also correct. Finally you may think that a field 
of parallel vectors has no spin. That is wrong. Example 5 has parallel vectors, but 
their different lengths produce spin. 


EXAMPLES The field V = zi in the x direction has curl ¥ =j in the y direction. 


If you put a wheel in the xz plane, this field will turn it. The velocity zi at the top of 
the wheel is greater than zi at the bottom (Figure | 5.23c). So the top goes faster and 
the wheel rotates. The axis of rotation is curl V =j. The turning speed is 3, because 
this curl has magnitude 1. 

Another velocity field y= — xk produces the same spin: curl ¥ =j. The flow is in 
the z direction, it varies in the x direction, and the spin is in the y direction. Also 
interesting is V+. The two “shear fields” add to a perfect spin field S = 2i — xk, 
whose curl is 2j. 


THE MEANING OF CURL F 


Example S$ put a paddlewheel into the flow. This is possible for any vector field F, 
and it gives insight into curl F. The turning of the wheel (if it turns) depends on its 
location {x, y, z). The turning also depends on the orientation of the wheel. We could 
put it into a spin field, and if the wheel axis n is perpendicular to the spin axis a, the 
whee] won’t turn! The general rule for turning speed is this: the angular velocity of 
the wheel is }(curl F)-n. This is the “directional spin,” just as (grad f):u was the 
“directional derivative’—and n is a unit vector like uw. 

There is no spin anywhere in a gradient field. It is irrotational: cur] grad f = 0. 

The pure spin field a x R has curl F = 2a. The angular velocity is a-n (note that 
4 cancels 2). This turning is everywhere, not just at the origin. If you put a penny on 
a compact disk, it turns once when the disk rotates once. That spin is “around itself,” 
and it is the same whether the penny is at the center or not. 

The turning speed is greatest when the wheel axis n lines up with the spin axis a. 
Then a -n ts the full length |a|. The gradient gives the direction of fastest growth, and 
the curl gives the direction of fastest turning: 


maximum growth rate of fis |grad f| in the direction of grad f 


maximum rotation rate of F is 3|curl F| in the direction of curl F. 
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STOKES’ THEOREM 


Finally we come to the big theorem. It will be like Green’s Theorem—a line integral 
equals a surface integral. The line integral is still the work $F -dR around a curve. 
The surface integral in Green’s Theorem is {{(N, —M,) dx dy. The surface is flat (in 
the xy plane). Its normal direction is k, and we now recognize N,—M, as the k 
component of the curl. Green’s Theorem uses only this component because the nor- 
mal direction is always k. For Stokes’ Theorem on a curved surface, we need all three 
components of curl F. 

Figure 15.24 shows a hat-shaped surface S$ and its boundary C (a closed curve). 
Walking in the positive direction around C, with your head pointing in the direction 
of n, the surface is on your left. You may be standing straight up (n=k in Green’s 
Theorem). You may even be upside down (n = — k is allowed). In that case you must 
go the other way around C, to keep the two sides of equation (6) equal. The surface is 
still on the left. A MObius strip is not allowed, because its normal direction cannot be 
established. The unit vector n determines the “counterclockwise direction” along C. 


450 (Stokes’ Theorem) p F-dR= | | (curl F)+ndS. 
c s 


The right side adds up small spins in the surface. The left side is the total circulation 
(or work) around C. That is not easy to visualize—this may be the hardest theorem 
in the book—but notice one simple conclusion. /f curl F = 0 then §F «dR =0. This 
applies above all to gradient fields—as we know. 

A gradient field has no curl, by (4). A gradient field does no work, by (6). In three 
dimensions as in two dimensions, gradient fields are conservative fields. They will be 
the focus of this section, after we outline a proof (or two proofs) of Stokes’ Theorem. 

The first proof shows why the theorem is true. The second proof shows that it 
really is true (and how to compute). You may prefer the first. 


First proof Figure 15.24 has a triangle ABC attached to a triangle ACD. Later there 
can be more triangles. § will be piecewise flat, close to a curved surface. Two triangles 
are enough to make the point. In the plane of each triangle (they have different n’s) 


Green’s Theorem is known: 
F-dR= ff curl F-nds F-dR= i} curl F< ndS. 
ABC ACD 


AB+BC+CA Pan ey 


Now add. The right sides give ff curl F-ndS over the two triangles. On the left, the 
integral over CA cancels the integral over AC. The “crosscut” disappears. That leaves 
AB + BC +CD + DA. This line integral goes around the outer boundary C—which 
is the left side of Stokes’ Theorem. 


electric field E(1) 


current 


Fig. 15.24 Surfaces § and boundary curves C. Change in B > curl E > current in C. 


magnetic 
field BCs) 


P wire C 
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Second proof Now the surface can be curved. A new proof may seem excessive, but 
it brings formulas you could compute with, From z = f(x, y) we have 


dz=dffaxdx+eéf/dydy and ndS = (— éf/dx i- ef/dy j +k) dx dy. 
For nad, see equation 15.4.11. With this dz, the line integral in Stokes’ Theorem is 
fF -dR= § Mdx+N dy+P(afféx dx + dffay dy). (7) 


shadow of C 


The dot product of curl F and nd§ gives the surface integral [[ curl F-ndS: 
hy 


wall, gfllPs — Ne Aflx) + (M, — P.(—Affdy) +(N.—M,JIdxdy. (8) 
To prove (7) =(8), change M in Green’s Theorem to M + Péf/dx. Also change N to 
N + Paf/dy. Then (7) =(8) is Green’s Theorem down on the shadow (Problem 47). 
This proves Stokes’ Theorem up on S. Notice how Green’s Theorem (flat surface) 
was the key to bath proofs of Stokes’ Theorem (curved surface). 


EXAMPLE & Stokes’ Theorem in electricity and magnetism yields Faraday’s Law. 


Stokes’ Theorem is not heavily used for calculations—equation (8) shows why. But 
the spin or curl or vorticity of a flow is absolutely basic in fluid mechanics. The other 
important application, coming now, is to electric fields. Faraday’s Law is to Gauss’s 
Law as Stokes’ Theorem is to the Divergence Theorem. 

Suppose the curve C is an actual wire. We can produce current along C by varying 
the magnetic field B(t). The flux g = j{ B+ ndS, passing within C and changing in time, 
creates an electric field E that does work: 


Faraday’s Law (integral form): work => E-dR = —do/dt. 
c 


That is physics. It may be true, it may be an approximation. Now comes mathematics 
(surely true), which turns this integral form into a differential equation. Information 
at points is more convenient than information around curves. Stokes converts the 
line integral of E into the surface integral of curl E: 


fE-dR = ff{curl E-ndS and also —dg/dt ={ — (6B/@t) -ndS. 
€ hy 


These are equal for any curve C, however small. So the right sides are equal for any 
surface §. We squeeze to a point. The right hand sides give one of Maxwell’s equations: 


Faraday’s Law (dilletential form): curl E = — éB/ét. 
CONSERVATIVE FIELDS AND POTENTIAL FUNCTIONS 


The chapter ends with our constant and important question: Which fields do no 
work around closed curves? Remember test D for plane curves and plane vector 
fields: 


if 6M/éy = @N/Ox then F is conservative and F = grad f and $F -dR=0. 


Now allow a three-dimensional field like F = 2xy i+ (x? + 2)j + yk. Does it do work 
around a space curve? Or is it a gradient field? That will require f/dx = 2xy and 
afjay=x*+z and af/éz=y. We have three equations for one function f(x, y, z). 
Normally they can’t be solved. When test D is passed (now it is the three-dimensional 
test: curl F = 0) they can be solved. This example passes test D, and fis x”y + yz. 
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45P F(x, y, z) = Mi+ Nj+ Pk is a conservative field if it has these properties: 
A. The work 4 -dR around every closed path in space is zero. 


B. The work |$F-dR depends only on P and Q, not on the path in space. 
C. F is a gradient field: M = Of/éx and N = Of/éy and P = df/az. 

D. The components satisfy M,=N,, M,=P,, and N, =P, (curl F is zero). 
A field with one of these properties has them all. D is the quick test. 


A detailed proof of A>B=>C=+D=A< is not needed. Only notice how C>D: 
curl grad F is always zero. The newest part is D > A. If curl F =0 thea § F -dR =0. 
But that is not news. It is Stokes’ Theorem. 


The interesting problem is to solve the three equations for f, when test D is passed. 
The example above had 


ffdx = 2xy > f= | 2xydx = x*y plus any function C(y, z) 

affay =x? +2 = x? + AC/dy = C = yz plus any function ¢{z) 

Offdz = y= y + de/dz => c(z) can be zero. 
The first step leaves an arbitrary C(y, z) to fix the second step. The second step leaves 
an arbitrary c(z} to fix the third step (not needed here). Assembling the three steps, 
f=x?y+C=x’y4+ yz+c=x’y+ yz. Please recognize that the “fix-up” is only pos- 
sihle when curl F = Q. Test D must be passed. 
EXAMPLE? Is F =(z — y)i +(x — z)j+(y — xk the gradient of any f? 


Test D says no. This F is a spin field a x R. Its curl is 2a = (2, 2, 2), which is not zero. 

A search for fis bound to fail, but we can try. To match éf/dx = z — y, we must have 

f=zx—yx + C(y, z). The y derivative is —x+ 8C/dy. That never matches N = x — z, 

so f can’t exist. 

EXAMPLE 8 What choice of P makes F = yz?i + xz*j + Pk conservative? Find f 

Solution We need curl F = 0, by test D. First check 8M/éy =z? = ON/éx. Also 
OP/éx = 6M /dz = 2yz and OP/éy = 6N/éz = 2xz. 


P = 2xyz passes ail tests. To find f we can solve the three equations, or notice that 
f=xyz? is successful. Its gradient is F. 

A third method defines f(x, y, z) as the work to reach (x, y, z) from (0, 0, 0). The path 
doesn’t matter. For practice we integrate F-dR= Mdx+Ndy+ Pdz along the 
straight line (xt, yt, zt): 


F(x, ys = {, (yt)(z#)?(x de) + (xt)(zt)*(y de) + 2ct)( ye)(zt)(z dt) = xyz”. 
0 


EXAMPLE Why is div curl grad f automatically zero (in two ways)? 


Solution First, curl grad f is zero (always). Second, div curl F is zero (always). Those 
are the key identities of vector calculus. We end with a review. 


Green’s Theorem: br ‘dR = [fewiex — €M/éy)dx dy 


oF ‘nds = | (@M /ex + 6N/Cy)dx dy 
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Divergenee Theorem: {lr ‘ndS) = \] (EMicx + ON icy + @Piéz)dx dy dz 


Stokes’ Theorent: 


br-aR 


{foun F-nds. 


The first form of Green’s Theorem leads to Stokes’ Theorem, The second form 

becomes the Divergence Theorem. You may ask, why not go to three dimensions in 

the first place? The last two theorems contain the first two (take P=0 and a flat 

surface). We could have reduced this chapter to two theorems, not four. I admit that, 

but a fundamental principle is involved: “It is easier to generalize than to specialize.” 
For the same reason df/dx came before partial derivatives and the gradient. 


15.6 EXERCISES 


Read-through questions 


The curl of Mi+ Nj+ Pk is the vector 9 __. It equals the 
3 by 3 determinant __> . The curl of x*i+z7k is ¢ 


For S = yi —{x + 2)j + yk the curlis__4 


.ThisSisa_ ® 
field a x R=4{curl F} x R, with axis vector a= _ ¢_, For 
any gradient field f,i+f,j+fk the curlis__9 _. That is the 
important tdentity curl gradf=__h_. It is based onf,, =f, 
andi and __J__. The twin identity is 


The curl measures the _! of a vector field. A pad- 

dlewheel in the field with its axis along m has turning speed 
m__. The spin is greatest when n is in the direction of 
n__. Then the angular velocity is__° 


Stokes’ Theorem is _® = 4 _. The curve C is the 
‘ofthe $s 5S. This is + Theorem extended to 
4 dimensions. Both sides are zero when F is a gradient 


field because __¥ 


The four properties of a conservative field are A=__¥__, 
B=_ x ,C=_y ,D=_2_. The field y?z?i + 2xp?zk 
(passes)(fails} test D. This field ts the gradient of f=_ 4. 
The work jF *dR from (0, 0,0) to (1,1, ljis__ 8 (on which 
path?}. For every field F, [{curl F-ndS is the same out 
through a pyramid and up through its base because __ © 


In Problems 1-6 find curl F. 
1 F=zi+xj+yk 2 F = grad(xe’ sin z} 

3Fa(x+y+z)(i+j+k) 4 F=(x+yi-(x4 yk 

§ F=p"(xi+ yj + zk) 6 F=(it+tj)xR 

7 Find a potential f for the field in Problem 3. 

8 Find a potential f for the field in Problem 5. 

9 When do the fields x"i and x"j have zero curl? 


10 When does (a,x + @2}' + a3z)k have zero curl? 


In 11-14, compute curl F and find §-F+dR by Stokes’ 
Theorem. 


11 F=x7i+y’k, C =circle x? + 27 =1, y=0. 

12 F=ixR, C =circle x? 4+ 2?=1, y=0. 

13 F=(i4+j) x R, C=circle y? + 2? =1, x =0. 

14 F =(yi— xj) x (xi + yj), C = circle x? + y? =1,2=0. 


15 (important) Suppose two surfaces § and T have the same 
boundary C, and the direction around C is the same. 
(a) Prove {f; curl F-ndS = ff, curl F-ndS. 
(b) Second proof: The difference between those integrals is 
f{jdivicurl F\@V. By what Theorem? What region is V? 
Why is thts integral zero? 
16 In 15, suppose S$ is the top half of the earth (nm goes out) 
and T is the bottom half (n comes in). What are C and V? 
Show by example that jf; F-ndS = [{-F +ndS is not generally 
true, 


17 Explain why [{ curl F-ndS =0 over the closed boundary 
of any solid V. 


18 Suppose curl F = @ and div F = 0. (a) Why is F the gradi- 
ent of a potential? (b) Why does the potential satisfy Luplace’s 
equation f.. +f, tf. = 0? 


In 19-22, find a potential / if it exists. 
19 F=zi+j+xk. 20 F = 2xyzi + x?2j + x7yk 
21 F=e* *4j-—¢*"*k 
23 Find a field with curl F =(1, 0, 0). 

24 Find all fields with curt F =(J, 0, 0). 


25 S=axR is a spin field. Compute F=b x S (constant 
yector b) and find its curl. 


22 F = pei t+ xzj+ (xy t+ 27k 
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26 Ilow fast is a paddlewheel turned by the field F = yi — xk 
(a) if its axis direction is n=j? (b}if its axis is lincd up with 
curl F? (c) if its axis is perpendicular to curl F? 

27 How is curl F related to the angular velocity « in the spin 
field F = cx(— yi + xj)? How fast does a wheel spin, if it is in 
the plane x + ytz=1? 

28 Find a vector field F whose curl is S = yi — xj. 

29 Find a vector field F whose curl is S=ax R. 


30 True or false: when two vector fields have the same curl 
at all points: (a) their difference is a constant field {b) their 
difference is a gradient field {c) they have the same divergence. 


In 31-34, compute [{ curl F - ndS over the top half of the sphere 
x? +)? +27 = 1 and (separately) 4 F -dR around the equator. 
31 F=yi- xj 32 F=R/p* 

33 F=axR 34 F=(axR)xR 

35 The circle C in the plane x+ y+2=6 has radius r and 
center at (1, 2,3). The held F is 3zj + 2yk. Compute ¢F-dR 
around €. 

36 Sis the top half of the unit sphere and F = 2i + xj + xyzk. 
Find jf curl F + ndS. 

37 Find g(x,y) so that curl gk = yi+ x7j. What is the name 
for g in Section 15.3? It exists because pi+x7j has zero 


38 Construct F so that curl F = 2xi+ 3yj — 5zk (which has 
zero divergence), 

39 Split the field F=x)i into ¥+W with curl ¥=0 and 
div W — 0. 

40 Ampere's law for a steady magnetic field Bis curl B= jJ 
(J = current density, 4 = constant). Find the work done by B 
around a space curve C from the current passing through it. 


Maxwell allows varying currents which brings in the electric 
field. 


4] For F =(x? + y?)i, compute curl (cur! F) and grad (div F) 
and F,,+F,, +F;,,. 


42 For F = cx, ¥, z)i, prove these useful identities: 
{a) curl(curl F}= grad (div F) —(F,, + F,, + F-.). 
(b) curl( fF) =f curl F + (grad f} x F. 


43 If B= a cos ¢ (constant direction a), find curl E from Fara- 
day’s Law. Then find the alternating spin ficld E. 


44 With Gfx, y. 2) = mi+ aj + pk, write out F x G and take 
its divergence. Match the answer with G- curl F— F+curi G. 


45 Write down Green’s Theorem in the xz plane from Stokes’ 
Theorem. 


46 True or false: V x F is perpendicular to F. 


47 (a) The second proof of Stokes’ Theorem took M*= 

Mix, v(x, y)) + Plx, yf Ox, y))effex as the M in Green's 
Theorem. Compute €M*/¢} from the chain rule and pro- 
duct rule (there are five terms). 
(b} Similarly N* = N(x, y+ Pix, yey has the x 
derivative N. +N lf. + Pf, + PLS, + Pf. Check that 
N¥ — M} matches the right side of equation (8), as needed 
in the proof. 


48 “The shadow of the boundary is the boundary of the 
shadow.” This fact was used in the second proof of Stokes’ 
Theorem, going to Green’s Theorem on the shadow. Give 
two examples of S and C and their shadows. 


49 Which integrals are equal when C = boundary of S or S= 
boundary of V? 


§F-dR §(curl F)-dR —$(curl F)-nds sf F- nS 
[[div Fas [f{curlF}-ndS [{ (grad div F)-ndS [{{div FdV 


50 Draw the field V = ~ xk spinning a wheel in the xz plane. 
What wheels would not spin? 


xil 
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CHAPTER 16 


Mathematics after Calculus 


I would like this book to do more than help you pass calculus. (I hope it does that 
too.) After calculus you will have choices— Which mathematics course to take next?) — 
and these pages aim to serve as a guide. Part of the answer depends on where you 
are going—toward engineering or management or teaching or science or another 
career where mathematics plays a part. The rest of the answer depends on where the 
courses are going. This chapter can be a useful reference, to give a clearer idea than 
course titles can do: 


Linear Algebra Differential Equations Discrete Mathematics 


Advanced Calculus (with Fourier Series) Numerical Methods Statistics 


Pure mathematics is often divided into analysis and algebra and geometry. Those 
parts come together in the “mathematical way of thinking’ —a mixture of logic 
and ideas. It is a deep and creative subject—here we make a start. 

Two main courses after calculus are Hnear algebra and differential equations. 
I hope you can take both. To help you iater, Sections 16.1 and 16.2 organize them 
by examples. First a few words to compare and contrast those two subjects. 


Linear algebra is about systems of equations. There are n variables to solve for. A 
change in one affects the others. They can be prices or velocities or currents or 
concentrations—outputs from any model with interconnected parts. 

Linear algebra makes only one assumption—the model must be linear. A change 
in one variable produces proportional changes in all variables. Practically every 
subject begins that way. (When it becomes nonlinear, we solve by a sequence of linear 
equations. Linear programming is nonlinear because we require x > 0.) Elsewhere J 
wrote that “Linear algebra has become as basic and as applicable as calculus, and 
fortunately it is easier.” I recommend taking it. 


A differential equation is continuous (from calculus), where a inatrix equation is 
discrete (from algebra). The rate dy/dt is determined by the present state y—which 
changes by following that rule. Section 16.2 solves y = cy + s(t) for economics and 
life sciences, and y” + by’ + cy=/f{t) for physics and engineering. Please keep it and 
refer to it. 
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A third key direction is discrete mathematics. Matrices are a part, networks and 
algorithms are a bigger part. Derivatives are not a part—this is closer to algebra. It 
is needed in computer science. Some people have a knack for counting the ways a 
computer can send ten messages in parallel—and for finding the fastest way. 

Typical question: Can 25 states be matched with 25 neighbors, so one state in each 
pair has an even number of letters? New York can pair with New Jersey, Texas with 
Oklahoma, California with Arizona. We need rules for Hawaii and Alaska. This 
matching question doesn’t sound mathematical, but it is. 

Section 16.3 selects four topics from discrete mathematics, so you can decide if you 
want more, 


Go back for a moment to calculus and differential equations. A completely realistic 
problem is seldom easy, but we can solve models. (Developing a good model is a skill 
in itself.) One method of solution involves complex numbers: 


any function u(xt+iy) solves u,,+u,,=0 (Laplace equation) 
any function e***) solves u,—c7u,,=0 (wave equation). 


From those building blocks we assemble solutions. For the wave equation, a signal 
starts at t= 0. It is a combination of pure oscillations e**. The coefficients in that 
combination make up the Fourier transform—to tell how much of each frequency is 
in the signal. A lot of engineers and scientists would rather know those Fourier 
coefficients than f(x). 

A Fourier series breaks the signal into La, cos kx or Lb sinkx or Ec,e™. 
These sums can be infinite {like power series). Instead of values of f(x), or derivatives 
at the basepoint, the function is described by a,, b,,c,. Everything is computed by 
the ‘Fast Fourier Transform.” This is the greatest algorithm since Newton’s method. 

A radio signal is near one frequency. A step function has many frequencies. A delta 
function has every frequency in the same amount: 6(x) = 2 cos kx. Channel 4 can’t 
broadcast a perfect step function. You wouldn’t want to hear a delta function. 


We mentioned compating. For nonlinear equations this means Newton’s method. 
For Ax = 6 it means elimination—algorithms take the place of formulas, Exact solu- 
tions are gone—speed and accuracy and stability become essential. It seems right to 
make scientific computing a part of applied mathematics, and teach the algorithms 
with the theory. My text Introduction to Applied Mathematics is one step in this 
direction, trying to present advanced calculus as it is actually used. 


We cannot discuss applications and forget statistics. Our society produces oceans 
of data—somebody has to draw conclusions. To decide if a new drug works, and if 
oil spills are common or rare, and how often to have a checkup, we can’t just guess. 
I am astounded that the connection between smoking and health was hidden for 
centuries. It was in the data! Eventually the statisticians uncovered it. Professionals 
can find patterns, and the rest of us can understand (with a little mathematics) what 
has been found. 


One purpose in studying mathematics is to know more about your own life. 
Calculus lights up a key idea: Functions. Shapes and populations and heart signals 
and profits and growth rates, all are given by functions. They change in time. They 
have integrals and derivatives. To understand and use them is a challenge— 
mathematics takes effort. A lot of people have contributed, in whatever way they 
could—as you and I are doing. We may not be Newton or Leibniz or Gauss or 
Einstein, but we can share some part of what they created. 
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You have met the idea of a matrix. An m by n matrix A has m rows and n columns 
(it is square if m=). It multiplies a vector x that has n components. The result is a 
vector Ax with m components. The central problem of linear algebra is to go back- 
ward: From Ax = b find x. That is possible when A is square and invertible. Otherwise 
there is no solution x—or there are infinitely many. 

The crucial property of matrix multiplication is linearity. If Ax= 6 and AX = B 
then A times x + X is b+ B. Also A times 2x is 2b. In general A times cx is cb. 
In particular A times 0 is 0 (one vector has n zeros, the other vector has m zeros). The 
whole subject develops from linearity. Derivatives and integrals obey linearity too. 


Question! What are the solutions te Ax=0?2 One solution is x =0. There 
may be other solutions and they fill up the “mu#space”’: 


x x= 2 
1 2))x 0 _ *=0 12 0 0 
= requires y]= also allows y= — 1 
0 3]ly 0 y=0 03 1 0 ; 
Zz z= 


When there are more unknowns than equations—when A has more columns than 
rows—the system Ax=0 has many solutions. They are not scattered randomly 
around! Another solution is X =4, Y= —2, Z=6. This lies on the same line as 
(2, —1, 3) and (0, 0, 0). Always the solutions to Ax =0 form a “space” of vectors— 
which brings us to a central idea of linear algebra. 


Note These pages are not concentrating on the mechanics of multiplying or invert- 
ing matrices. Those are explained in all courses. My own teaching emphasizes that 
Ax is a combination of the columns of A. The solution x= A~'b is computed by 
elimination. Here we explain the deeper idea of a vector space—and especially the 
particular spaces that control Ax = b. I cannot go into the same detail as in my book 
on Linear Algebra and Its Applications, where examples and exercises develop the 
new ideas. Still these pages can be a useful support. 


Ail vectors with n components lie in n-dimensional space. You can add them and 
subtract them and multiply them by any c. (Don’t multiply two vectors and never 
write 1/x or 1/A), The results x + X and x— X and cx are still vectors in the space. 
Here is the important point: 


The line of solutions to Ax = 0 is a “subspace”-—a vector space in its own right. 
The sum x + X has components 6, — 3, 9—which is another solution. The difference 
x~ X is a solution, and so is 4x. These operations leave us in the subspace. 


The nulispace consists of all solutions to Ax = 0. It may contain only the zero 
vector (as in the first example). It may contain a line of vectors (as in the second 
example). It may contain a whole plane of vectors (Problem 5). In every case x + X 
and x — X and cx are also in the nullspace. We are assigning a new word to an old 
idea—the equation x — 2y = 0 has always been represented by a line (its nullspace). 
Now we have 6-dimensionail subspaces of an 8-dimensional vector space. 

Notice that x? — y = 0 does not produce a subspace (a parabola instead). Even the 
x and y axes together, from xy = 0, do not form a subspace. We go off the axes when 
we add (1, 0} to (0, 1). You might expect the straight line x — 2y = 1 to be a subspace, 
but again it is not so. When x and y are doubled, we have X¥ — 2¥ = 2. Then (X, Y) 
is on a different line. Only Ax = 0 is guaranteed to produce a subspace. 
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Figure 16.1 shows the nullspace and “row space.” Cheek dot products (both zero). 


plane is row space of A 
= column space of A? 


Fig. 16.4 The nuilspace is perpendicular to the rows of A (the columns of A"). 


Question 2) When A muitiplics a vector x, what subspace does Ax lie in? The 
product Ax is a combination of the columns of A—hence the name “column space’’. 


2 1 8 0 
3 3(column 1) 
0 1 = + =|2 This is in the column space; | 0 | is not. 
2(column 2) 


0 0 0 1 

No choice of x can produce Ax = (0, 0, 1). For this 4, all combinations of the columns 
end ina . The column space is like the xy plane within xyz space. It is a 
subspace of m-dimensional space, containing every vector > that 1s a combination of 
the columns: 


The system Ax =b has a solution exactly when b is in the column space. 


When 4 has an inverse, the column space is the whole n-dimensional space. The 
nullspace contains only x = 0. There is exactly one solution x = A 'b. This is the 
good casc—and we outline four more key topics in Jincar algebra. 


1. Basis and dimension of a subspace. A one-dimensional] subspace is a linc. A plane 
has dimension two. The nullspace above contained ai] multiples of (2, —1, 3)—by 
knowing that “basis vector” we know the whole line. The column space was a plane 
containing column | and column 2. Again those vectors are a “basis” —by knowing 
the columns we know the whole column space. 

Our 2 by 3 matrix has three columns: (1, 0) and (2, 3) and {0, 1). Those are not a 
basis for the column space! This space is only a plane, and three vectors are too 
many. The dimension is two. By combining (1, 0) and (0, 1} we can produce the other 
vector (2, 3). There are only two independent columns, and they form a basis for this 
column space. 

In general: When a subspace contains r independent vectors, and no more, those 
vectors are a basis and the dimension is r. “Independent” means that no vector is a 
combination of the others. In the example, (1, 0) and {2, 3) are also a basis. A subspace 
has many bases, just as a plane has many axes. 


2. Least squares. If Ax = 6 has no solution, we look for the x that comes closest. 
Section 11.4 found the straight line nearest to a set of points. We muke the length 
of Ax — b as smail as possible, when zero length is not possible. No vector solves 


16.1 Vector Spaces and Linear Algebra 


Ax = hb, when b is not in the column space. So b is projected onto that space, This 
leads to the “normal equations” that produce the best x: 


A‘Ax = ATh. (1) 


When a rectangular matrix appears in applications, its tramspese generally comes 
too. The columns of A are the rows of A’. The rows of A are the columns of A’. 
Then A’A is square and symmetric—equal to its transpose and vital for applied 
mathematics. 


3. Eigenvalues (for square matrices only). Normally Ax points in a direction 
different from x. For certain special eigenvectors, Ax is parallel to x. Here is a 2 by 
2 matrix with two eigenvectors—in one case Ax = 5x and in the other Ax = 2x; 


weet dit bla} 


The multipliers 5 and 2 are the eigenvalues of A. An 8 by 8 matnx has eight eigen- 
values, which tell what the matrix is doing (to the eigenvectors). The eigenvectors are 
uncoupled, and they go their own way. A system of equations dy/dt = Ay acts like 
one equation—-when y is an eigenvector: 


dy, /dt =3y, + 2y =e 1 
yh fe ? has the solution ms : which is e™! . 
dyz/dt = y, + 4y2 y. =e 1 


The eigenvector is (1, 1). The eigenvalue A = 5 is in the exponent. When you substitute 
y, and y, the differential equations become Se* = Se*. The fundamental principle 
for dy/dt = cy still works for the system dy/dt = Ay: Look for pure exponential solu- 
tions. The eigenvalue “lambda” is the growth rate in the exponent. 

I have to add: Find the eigenvectors also. The second eigenvector (2, —1) has 
eigenvalue i = 2. A second solution is y, = 2e”, yp = — e*". Substitute those into the 
equation—they are even better at displaying the general rule: 


If Ax=Ax then d/dt(e"'x)= A(e*'x). The pure exponentials are y= e*'x. 


The four entries of A pull together for the eigenvector. So do the 64 entries of an 
8 by 8 matrix—again e”x solves the equation. Growth or decay is decided by A> 0 
or 4<0. When A= & + iw is a complex number, growth and oscillation combine in 
et = etteivot = e(cos wt + i sin we). 


Subspaces govern static problems Ax = b. Eigenvalues and eigenvectors 
govern dynamic problems dy/dt = Ay. Look for exponentials y= ex. 


4, Determinants and inverse matrices. A 2 by 2 matrix has determinant D = ad — be. 
This matrix has no inverse if D =0. Reason: A~' divides by D: 


acts geP times A= a equals [= = 
| nr c d 0 1] 


This pattern extends to n by » matrices, but D and A~‘ become more complicated. 

For 3 by 3 matrices D has six terms. Section 11.5 identified D as a triple product 
a*(b x c) of the columns. Three events come together in the singular case: D is zero 
and A has no inverse and the columas fie in a plane. The opposite events produce the 
“nonsingular” case: D is nonzere and A~' exists. Then Ax = b is solved by x = A~'b. 
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D is also the product of the pivots and the product of the eigenvalues. The pivots 
arise in elimination—the practical way to solve Ax = 6 without A” *. To find eigen- 
values we turn Ax = Ax into (A — AD)x = 0. By a nice twist of fate, this matrix A — AI 


kas D = 0. Go back to the example: 


3 2 1 0] [3-4 2 
-A = has D=(3—A)(4—A)—2= 12 —7A+ 10. 
1 4 01 ear | 


The equation 4? — 72+ 10=0 gives 1= 5 and A= 2. The eigenvalues come first, to 
make D=0. Then (A — S5i)x = 0 and (A — 21)x = 0 yield the eigenvectors. These x’s 
go into y = e**x to solve differential equations—which come next. 


46.4 EXERCISES 


Read-through questions 


If Ax = 6 and AX =B, then A times 2x +3X equals. a. 

If Ax=Q and AX =O then A times 2x + 3X equals _b 

In this case x and XY are in the _e¢__ of A, and so is the 

combination _d__. The nullspace contains all solutions to 
e__.Itisasubspace, which means_t__. If x =(1, 1, Lis 


in the nujispace then the columns add to __g_, so they are 
{independent)(dependent). 


Another subspace is the _h __ space of A, containing all 
combinations of the columns, The system Ax=4 can be 
solved when bis | _. Otherwise the best solution comes 
from A'Ax=__] _. Here A* is the _k _ matrix, whose 
rows are_ 1. The nullspace of A" contains all solutions to 
m_. The __n__ space of A™ {row space of A) is the fourth 
fundamental subspace. Each subspace has a basis containing 
asmany 9 ___ vectors as possible. The number of vectors in 


the basis is the _p __ of the subspace. 


When Ax =<Ax, the number 4 is an _q__ and x is an 
t__. The equation dy/dt = Ay has the exponential solution 
y=_s_.A7 by 7 matrix has _t _ eigenvalues, whose 
product is the _u _D. If D is nonzero the matrix A has an 
v__. Then Ax=b is solved by x=__w_. The formula 
for D contains 7! = 5040 terms, so x is better computed by 
x__. On the other hand Ax =4x means that A —</ has 
determinant __y __. The eigenvalue is computed before the 
z 


Find the nullspace in 1-6. Along with x go all cx. 
1 2 12 -6 
1 A= (solve Ax = 0} 2 B= 
2 4 —6 3 


10 

SC2le 1am wale 
= solve CX = = 
Ce = 012 


21 
5 £=)1 1 1 6 F= 
1 2 
111 
3 ; 
7 Change Problem 1 to Ax = ‘ (a) Find any particular 


solution xp. (b) Add any x» from the nullspace and show that 
Xp+ Xo is aiso a solution. 


I 
8 Change Problem | to Ax || and find all solutions. 


Graph the lines x, +2xz,=1 and 2x, +4x,=0 in a plane. 


9 Suppose Axp=b and Ax y=Q. Then by linearity 
A{Xp + Xo} = . Concluston: The sum of a particular 
solution xp and any nulivector xp is 


10 Suppose Ax=6 and Axp=b. Then by linearity 
A(x ~ Xp) = . The difference between solutions is a 
vector in . Concluston: Every soiution has the form 
X=Xp+Xg, one particular solution plus a@ vector in the 
nullspace. 


1) Find three vectors 5 in the column space of E. Find ai} 
vectors b for which Ex = 6 can be solved. 


12 ff Ax =0 then the rows of A are perpendicular to x. Draw 
the row space and nulispace (lines in a plane) for A above. 


13 Compute CCT and C™C. Why not C*? 


14 Show that Cx =6 has no solution, if b =(—{, I, f). Find 
the best solution from C™Cx = C7. 


15 C™ has three columns. How many are independent? 
Which ones? 


16 Find two independent vectors that are in the column space 
of C but are not columns of C, 


17 For which of the matrices ABCEF are the columns a 
basis for the column space? 


46.2 Differential Equations 603 


18 Explain the reasoning: If the columns of a matrix A are 
independent, the only solution to Ax =0 is x = 0. 


19 Which of the matrices ABCEF have nonzero deter- 
minants? 


20 Find a basis for the full three-dimensional space using 
only vectors with posilive components. 


t oO 
21 Find the matrix F ~' for which FF ~* -1-|) i 


22 Venfy that (determinant of F}’ = (determinant of F?). 


23 (Important) Write down F — AJ and compute its determi- 
nant. Find the two numbers & that make this determinant 
zero. For those two numbers find eigenvectors x such that 
Fx =Ax. 


24 Compute G = F*. Find the determinant of G — AI and the 
two 4's that make it zero. For those 4’s find eigenvectors x 
such that Gx = Ax. Conclusion: if Fx = Ax then F*x = d?x. 


25 From Problem 23 find two exponential solutions to the 
equation dy/dt= Fy. Then find a combination of those 
solutions that starts from yg =(1, 0) at £=0. 


26 From Problem 24 find two solutions to dy/dt = Gy. Then 
find the solution that starts from yo = (2, 1). 


16.2 Differential Equations 


27 Compute the determinant of E — (J. Find all 4’s that make 
this determinant zero. Which eigenvalue is repeated? 


28 Which previous problem found eigenvectors for Ex = 0x? 
Find an eigenvector for Ex = 3x. 


29 Find the eigenvalues and eigenvectors of A. 


30 Explain the reasoning: A matrix has a zero eigenvalue if 
and only if its determinant is zero. 


31 Find the matrix H whose eigenvalues are 0 and 4 with 
eigenvectors (1, 1) and (1, —1). 


32 If Fx = Ax then multiplying both sides by gives 
F-1x=A7 x. If F has eigenvalues 1 and 3 then F~' has 
eigenvalues . The determinants of F and F~! are 


33 True or false, with a reason or an example. 
{a) The solutions to Ax = 5 form a subspace. 


(b} E : has ‘| in its nullspace and column space. 
(c) A‘A has the same entry in its upper right and lower 
left corners. 

(d) If Ax = Ax then y =e” solves dy/dt = Ay. 

{e) If the columns of A are not independent, their combi- 
nations still form a subspace. 


We just solved differential equations by linear algebra. Those were special systems 
dy/dt = Ay, linear with constant coefficients. The solutions were exponentials, 
involving e”. The eigenvalues of A were the “growth factors” A. This section solves 
other equations—by no means all. We concentrate on a few that have important 
applications. 

Return for a moment to the beginning—when direct integration was king: 


1. dy/dt = s(t) 2. dy/dt=cy 3. dy/dt = u(y)c(t). 


In 1, \(t) is the integral of s(t). In 2, y(t} is the integral of cy{r). That sounds circular— 
it only made sense after the discovery of y = e'. This exponential has the correct 
derivative cy. To find it by integration instead of inventing it, separate y from ¢: 


Separation and integration also solve 3: { dy/u(y) = § c(t)}dt. The model logistic equation 
has u = y — y* = quadratic. Equation 2 has u = y = linear. Equation 1 is also a special 
case with u = 1 = constant. But Z and 1 are very different, for the following reason. 

The compound interest equation y =cy is growing from inside. The equation 
y' = s(t) is growing from outside. Where c is a “growth rate,” s is a “source.” They 
don’t have the same meaning, and they don’t have the same units, The combination 
y' =cy +s was solved in Chapter 6, provided c and s are constant—but applications 
force us to go further. 
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In three examples we introduce non-constant source terms, 


EXAMPLE 4 Soive dy/dt = cy +5 with the new source term s= e™. 
Method Substitute y= Be“, with an “undetermined coefficient” B to make it right: 
kBe = cBe" +e" yields B= I/(k—c). 


The source e“ is the driving term. The solution Be” is the response. The exponent is 
the same! The key idea is to expect e* in the response. 


Initial condition To match yp at t=0, the solution needs another exponential. 
It is the free response Ae, which satisfies dy/dt=cy with no source. To make 
y= Ae" + Be" agree with yo, choose A = yy — B: 


Final solution y= (yo — Ble" + Be = poe + (& — &S/(k — c). (1) 
Exceptional case #8 = 1/(k—c) grows larger as k approaches c. When k=c the 


method breaks down—the response Be*' is no longer correct. The solution (1} 
approaches 0/0, and in the limit we get a derivative. It has an extra factor t: 


e'~e _ change in e* 
k-c¢ change inc 


d ety — ggct 
ae) te". (2) 


The correct response is te“ when k = c. This is the form to substitute, when the driving 
rate k equals the natural rate c (called resonance). 
Add the free response ye" to match the initial condition. 


EXAMPLE 2 Solve dy/dt = cy +s with the new source term s = cos kt. 


Substitute y = B sin kt + Dos kt. This has two undetermined coefficients B and D: 


kB cos kt— kD sin kt = c(B sin kAt+ D cos kt)+ cos kt. (3) 

Matching cosines gives kB = cD + 1. The sines give — kD = cB. Algebra gives B, D, y: 
c k csin kt+k cos kt 

5 k? + ¢? k? + ¢# 7 k? + ¢? (4) 


Question Why do we need both Bsin kt and Dcos kt in the response to cos kt? 


First Answer Equation (3) is impossible if we leave out B or D. 
Second Answer cos kt is te + 4e7, So e* and e~ ™ are both in the response. 


EXAMPLE 3 Solve dy/dt =cy +s with the new source term s= te”. 


Method Look for y= Be" + Dte*. Problem 13 determines B and D. Add Ae“ as 
needed, to match the initial value yo. 


SECOND-ORDER EQUATIONS 


The equation dy/dt = cy is first-order. The equation d?y/dt? = — cy is second-order. 
The first is typical of problems in life sciences and economics—the rate dy/dt depends 
on the present situation y. The second is typical of engineering and physical sciences — 
the acceleration d*y/dt? enters the equation. 

If you put money in a bank, it starts growing immediately. If you turn the wheels 
of a car, it changes direction gradually. The path is a curve, not a sharp corner. 
Newton’s law is F = ma, not F = mv. 


16.2 Differential Equations 


A mathematician compares a straight line to a parabola. The straight line crosses 
the x axis no more than once. The parabola can cross twice. The equation 
ax? + bx + c= 0 has two solutions, provided we allow them to be complex or equal. 
These are exactly the possibilities we face below: two real solutions, two complex 
solutions, or one solution that counts twice. The quadratic could be x? — 1 or x* +1 
or x?. The roots are 1 and —1, i and —i, QO and 0. 

In solving differential equations the roots appear in the exponent, and are called A. 


EXAMPLE 4 y’=+y: solutions p=¢ and y=e! A=1,-1 
EXAMPLES y”=Qy: solutions y=1 and y=t A= 0, 0 
EXAMPLES y’=-—y: solutions y=cost and y=sint A=i, —i 
Where are the complex solutions? They are hidden in Example 6, which could be 
written y=e" and y=e ". These satisfy y’ = — y since i? = — 1, The use of sines and 
cosines avoids the imaginary number i, but it breaks the pattern of e*. 
Example 5 also seems to break the pattern—again e*' is hidden. The solution y = 1 


is e*'. The other solution y= t is te". The zero exponent is repeated—another excep- 
tional case that needs an extra factor r. 


Exponentials solve every equation with constant coefficients and zero right hand side: 
To solve ay"+by'+cy=0 substitute y=e" and find i. 
This method has three steps, leading to the nght exponents A= r and A=s: 


1, With y = e* the equation is a27e* + bie* + ce* = 0. Cancel e*. 
2. Solve ad? + b1 + c=0. Factor or use the formula A = (— b + ./b? — 4ac)/2a. 
3. Call those roots A= r and 1=s. The complete solution is y= Ae“ + Be™. 


The pure exponentials are y= e'' and y ~ e". Depending on r and s, they grow or 
decay or oscillate. They are combined with constants A and B to match the two 
conditions at t= 0. The initial state yp equals A+ B. The initial velocity yp equals 
rA + sB (the derivative at t = 0). 


EXAMPLE 7 Solve y" — 3y'+ 2y=0 with yp = 5 and yp = 4. 


Step 1 substitutes y= e*. The equation becomes A7e* — 34e* + 2e* = 0. Cancel e*. 
Step 2 solves 1?— 32+2=0. Factor into (2—1)(A— 2}=0. The exponents r, s are 1, 2. 
Step 3 produces y = Ae’ + Be’, The initial conditions give A + B= Sand 14 + 2B=4. 
The constants are A= 6 and B= — t. The solution is y = 6e' — e”*. 


This solution grows because there is a positive 4. The equation is “unstable.” It 
becomes stable when the middie term —3y' is changed to +3y’. When the damping 
is positive the solution decays. The 2’s are negative: 


EXAMPLE 8 (2? + 34+ 2) factors into {4 + 1)(4 + 2). The exponents are —1 and — 2. 
The solution is y= Ae™'+ Be **. It decays to zero for any initial condition. 
EXAMPLES 9-10 Solve y” +2y'+2p=Oand y”’ + 2y + y=0. How do they differ? 
Key Gifference A?+24+2 has complex roots, 7? + 24+ 1 has a repeated root: 
A74+2A+2=0 gives A=—-1+i (2+1)?=0 _ gives =-—1, -1. 


The —1 in all these 2’s means decay. The i means oscillation. The first exponential 
is e—1* which splits into e~' (decay) times e“ (oscillation). Even better, change e“ 
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and e~* into cosines and sines: 
y= Ae ' t+ Bel! = ea cos t+ b sin 1). (5) 


At t=0 this produces y, = a. Then matching yo leads to b. 

Example 10 has r=s= — 1 (repeated root). One solution is e~' as usual. The 
second solution cannot be another e~‘, Problem 21 shows that it is te~‘—-again the 
exceptional case multiplies by ¢! The genera! solution is y= Ae~'+ Bte~*. 


Without the damping term 2y’, these examples are y” + 2y =0 or y” + y=O—pure 
oscillation. A smail amount of damping mixes oscillation and decay. Large damping 
gives pure decay. The borderline is when A is repeated (r=). That occurs when 
b* — 4ac in the square root is zero. The borderline between two real roots and two 
complex roots is two repeated roots. 

The method of solution comes down to one idea: Substitute y = e*. The equations 
apply to mechanical vibrations and electrical circuits (also other things, but those 
two are of prime importance). While describing these applications 1 will collect the 
information that comes from A. 


SPRINGS AND CIRCUITS: MECHANICAL AND ELECTRICAL ENGINEERING 


A mass is hanging from a spring. We pull it down an extra distance yg and give it a 
starting velocity yg. The mass moves up or down, obeying Newton’s law: mass times 
acceleration equals spring force plus damping force: 


my” = —ky—dy’ or my” + dy + ky=0. (6) 


This is free oscillation. The spring force —ky is proportional to the stretching y 
(Hooke’s law). The damping acts like a shock absorber or air resistance—it takes 
out energy. Whether the system goes directly toward zero or swings back and forth 
is decided by the three numbers m, d, k. They were previously called a, b, c. 


46A The solutions e~ to my’ + dy +ky=0 are controlled by the roots of 
ma? + di+k=0. With d>O there is damping and decay. From ./d? — 4mk 
there may be oscillation: 


overdamping: d? > 4mk gives real roots and pure decay (Example 8) 


underdamping: d? < 4mk gives complex roots and oscillation (Example 9) 


critical damping: d?* = 4mk gives a real repeated root -- d/2m (Example 10) 


We are using letters when the examples had numbers, but the results are the same: 


| 

2m t 2m 
Overdamping has no imaginary parts or oscillations: y= Ae‘ + Be". Critical damping 
has r=s and an exceptional solution with an extra t: y = Ae’ + Bte™. (This is only a 
solution when r = s.) Underdamping has decay from —d/2m and oscillation from the 
imaginary part. An undamped spring (d = 0) has pure oscillation at the natural fre- 
QUeNcy Wy = ./k/m. 

All these possibilities are in Figure 16.2, created by Alar Toomre. At the top is pure 
oscillation (d= 0 and y= cos 2z). The equation is y” + dy + 4y=0 and d starts to 
grow. When d reaches 4, the quadratic is A? + 44 + 4 or (A + 2)’. The repeated root 


ma?+ddi+k=0 hasroots r,s=— d? — 4mk. 


16.2 Differential Equations 


no damping d=0 ., y"+dy'+4y=0 


critical d=4 F 


overdamped 


repeated 


real roots 


Fig. 16.2 


yields e~* and te~*‘, After that the oscillation is gone. There is a smooth transition 
from one case to the next—as complex roots join in the repeated root and split into 
real roots. 

At the bottom right, the final value y(2z) increases with large damping. This was 
a surprise. At d= 5 the roots are —1 and —4. At d= 8.5 the roots are —4tand —8. 
The small root gives slow decay (like molasses). As d— 00 the solution approaches 
y=1. 

If we are serious about using mathematics, we should take advantage of anything 
that helps. For second-order equations, the formulas look clumsy but the examples 
are quite neat. The idea of e“ is absolutely basic. The good thing is that electrical 
circuits satisfy the same eqution. There is a beautiful analogy between springs and 
circuits: 

mass m «> inductance L 
damping constant d «+ resistance R 
elastic constant k «+ 1/(capacitance C) 
The resistor takes out energy as the shock absorber did—converting into heat by 
friction. Without resistance we have pure oscillation. Electric charge is stored in 
the capacitor (like potential energy). It is released as current (like kinetic energy). 
It is stored up again (like a stretched spring). This_continues at a frequency 
Wp = 1 |,/LC (like the spring’s natural frequency ,/k/m). These analogies turn 
mechanical engineers into electrical engineers and vice versa. 
The equation for the current y(t) now includes a driving term on the right: 
dy 


) 1 
L a +Ryt+ C [ dt = applied voltage = V sin wt. (7) 


To match networks with springs, differentiate both sides of (7): 
Ly" + Ry’ + y/C = Vo cos wt. (8) 


The oscillations are free when V=0 and forced when V #0. The free oscillations 
e* are controlled by Li? + RA +1/C=0. Notice the undamped case R= 0 when 
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A=+i/,/LC. This shows the natural frequency w,. = 1/./LC. Damped free oscilla- 
tions are in the exercises—what is new and important is the forcing from the right 
hand side. Our last step is to solve equation (8). 


PARTICULAR SOLUTIONS—THE METHOD OF UNDETERMINED COEFACIENTS 


The forcing term is a multiple of cos wt. The ‘particular solution’ is a multiple 
of cos wt plus a multiple of sin wt. To discover the undetermined coefficients in 
y=acos wt + b sin wt, substitute into the differential equation (8): 


— Lo*(a cos wt + b sin wt)+ Ro(—a sin wt + b cos wt) 
+ (a cos wt +b sin wt)//C = Vea cos wt. 
The terms in cos wt and the terms in sin wt give two equations for a and b: 
—alw’ + bRa+a/C=Ve and — bLw? — aRa + b/C = 0. (9} 
EXAMPLE 44 Solve y’ + y= cos wt. The oscillations are forced at frequency w, The 
oscillations are free (y” + y = 0) at frequency 1. The solution contains both. 


Particular solution Set y=acoswt+ bsin wt at the driving frequency w, and (9) 
becomes 
-—aw?+0+a=1 and —bw? —0+b=0. 


The second equation gives b=0. No sines are needed beeause the problem has no 
dy/dt. The first equation gives a = 1/(1 — w), which multiplies the cosine: 
y=(cos wt)/(1—*) solves y’ + y=cos wt. (10) 
General solution Add to this particular solution any solution to y” + y = 0: 
Y = Pparticutar + Vromogeneous = (COS wt)/(i — w?)+ C cos t+ D sin t. (41) 


Problem of resonance When the driving frequency is w=1, the solution (11) 
becomes meaningless—its denominator is zero. Reason: The natural frequency in 
cos t and sin ¢ is also {. A new particular solution comes from tf cos ¢ and f sin ¢. 

The key to success is to know the form for y. The table displays four right hand 
sides and the correct y’s for any constant-coefficient equation: 


Right hand side Particular solution 

: y = Be" (same exponent) 
cos wt or sin wt y=acos wt + b sin wt {include both) 
polynomial in ¢ y = polynomial of the same degree 
ecoswt or 2 sin wt y = ae™ cos wt + be sin wt 


Exception If one of the roots A for free oscillation equals k or iw or 0 or k + iw, the 
corresponding y in the table is wrong. The proposed solution would give zero on the 
right hand side. The correct form for y includes an extra rt. All particular solutions 
are computed by substituting into the differential equation. 


Apology Constant-coefficient equations hardly use calculus (only e“). They reduce 
directly to algebra (substitute y, solve for 4 and a and 5). I find the S-curve from the 
logistic equation much more remarkable. The nonlinearity of epidemics or heartbeats 
or earthquakes demands all the calculus we know. The solution is not so predictable. 
The extreme of unpredictability came when Lorenz studied weather prediction and 
discovered chaos. 


46.2 Differential Equations 
NUMERICAL METHODS 


Those four pages explained how to solve linear equations with constant coefficients: 
Substitute y=e". The list of special solutions becomes longer in a course on 
differential equations. But for most nonlinear problems we enter another world— 
where solutions are numerical and approximate, not exact. 

In actual practice, numerical methods for dy/dt = f(t, y) divide in two groups: 


1. Single-step methods like Euler and Runge-Kutta 
2. Multistep methods like Adams-Bashforth 


The unknown y and the right side f can be vectors with n components. The notation 
stays the same: the step is At= A, the time t, is nh, and y, is the approximation to 
the true y at that time. We test the first step, to find y, from yo = 1. The equation is 
dy/dt = y, so the right side is f= y and the true solution is y= eé. 

Notice how the first value of f (in this case 1) is used inside the second f: 


Improved Euler yy+1= Yn t+ SAUL (tn Yn) +S (En 1 In t Af (ns tnd) 
TEST y= Lt+sh[l+(+A]attatdh 


At time A the true solution equals e*. Its infinite series is correct through h? 
for Improved Euler (a second-order method). The ordinary Euler method 
Ya+1—= Yat Af(t,, Ye} IS first-order. TEST: y,;=1+4. Now try Runge-Kutta 
(a fourth-order method): 


Runge-Kutta yn41 = Yq_+Glk, + 2ky + 2kg+k,] with k, = Af(t,, y,) 
ky = hf (t, +h, vat oki) ky = Af lt, +h, yn tthe) ka = hf (ty + hy yn + ke) 


Now the first value of f is used in the second (for k,), the second is used in the third, 
and then &; is used in k,. The programming is easy. Check the accuracy with another 
test on dy/dt = y: 


rns fea) oa (8) e(ed( 8) 


=l+h+—+—+ —. This answer agrees with e* through A*. 


These formulas are included in the book so that you can apply them directly — 
for example to see the S-shape from the logistic equation with f=cy— by?. 

Mulltistep formulas are simpler and quicker, but they need a single-step method to 
get started. Here is y, in a fourth-order formula that needs yo, y;, ¥2, y3- Just shift 
all indices for ys, yg, and y,41: 


h 
Maftistep y4= y4+ 4 (55y3 — 59y5 + 37), — 9yo J. 


The advantage is that each step needs only one new evaluation of y,=f(t,, ¥,)- 
Runge-Kutta needs four evaluations for the same accuracy. 


Stability is the key requirement for any method. Now the good test is y’ = — y. The 
solution should decay and not blow up. Section 6.6 showed how a large time step 
makes Euler’s method unstable—the same will happen for more accurate formulas. 
The price of total stability is an “implicit method” like y,; = yo +4+h(y9 + yj), 
where the unknown y, appears also in y,. There is an equation to be solved at every 
step. Calculus is ending as it started—with the methods of Isaac Newton. 
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16.2 EXERCISES 


Read-through questions 


The solution to y'—Sy=10 is y= Ae*'+4+ B. The homo- 
geneous part Ae satisfies y’—Sy=_o . The particu- 
lar solution B equals _b _. The initial condition yo is 
matched by A=_c_. For y'—5y=e" the right form 
is y=Ae"'+_d . For y'—Sy=cos: the form is 
y=Ae"+_e + 1 


The equation y’+4y' +5y=0 is second-order because 

g__. The pure exponential solutions come from the roots 
of __h __, which arer=_i and s= L . The general 
solution is y=__& _. Changing 4y’ to _|___ yields pure 
oscillation, Changing to 2y' yields 2 = — =f+3 = when the 
solutions become y=_m_. This oscillation is 
(over){under)(critically} damped, A spring with m= 1, d=2, 
k=5 goes (back and forth)(directly to zero). An electrical 
network with L=1, R=2, C=4 also _a 


One particular solution of y’+4y=e' is e' times 

o _. If the right side is cos ¢, the form of y,is__p _. If the 
right side is 1 then y,=_q__. If the right side is__+1_ we 
have resonance and y, contains an extra factor __s 


Problems 1-14 are about first-order linear equations. 


1 Substitute y= Be" into y=7= 8e* to find a particular 
sofution. 


2 Substitute y=acos 2¢+ sin 2¢ into y +y=4sin 2¢ to 
find a particular solution. 


3 Substitute y=a+ bt+cr? into y+y=1+07 to find a 
particular solution. 


4 Substitute y = ae'cos t+ be'sint into y’ = 2e’cos ¢ to find 


a particular solution. 


5 In Problem | we can add Ae’ because this solves the equa- 
tion . Choose A so that (0) = 7. 


6 In Problem 2 we can add Ae™'‘, which solves 
Choose A to match (0) = 


7 In Problem 3 we add to match (0) = 2. 
8 In Problem 4 we can add y= A. Why? 


9 Starting from yo =0 solve y' =e and also solve y' = I. 
Show that the first solution approaches the second as k > 0. 


10 Solve y' — y=e" starting from y)=0. What happens to 
your formula as k +1? By l’Hépital’s rule show that y 
approaches te' as k > 1, 


11 Solye y'~— y=e'+ cost. What form do you assume for y 
with two terms on the right side? 


12 Solve y+ y=e'+¢. What form to assume for y? 
13 Solve y' =cy + te’ following Example 3 (c # 1). 


14 Solve y =y+1t following Example 3 (c=1 and k = 0). 


Problems 15-28 are about second-order linear equations. 

1§ Substitute y= e* into y’ + 6y’ + Sy =0. (a) Find all 2’s. 
(b} The solution decays because . (c} The general 
solution with constants A and B is : 

16 Substitute y = e* into y’ + 9y =O. {a) Find all 2’s. (b} The 
solution oscillates because . (c) The general solution 
with constants a and 6 is 


17 Substitute y=e* into y’+2y'+3y=0. Find both 2’s. 
The solution oscillates as it decays because . The 
general solution with A and B and e* is . The 
general solution with e~‘ times sine and cosine is 


18 Substitute y =e" into y” + 6y’ + Oy = 0. (a) Find all 2’s. 
(b) The general solution with e* and te* is 


19 For y’+dy'+y=0 find the type of damping at 
d=0, 1, 2, 3. 


20 For y’+2y'+ky=0 find the type of damping at 
k=, 1,2. 


21 If 42+54+c=0 has a repeated root prove it is A= 
— 6/2. In this case compute y” + by’ + cy when y= 


22 A* + 3242=Ohas roots —[ and —2 (not repeated), Show 
that te~‘ does not solve y” + 3y' + 2y = 0. 


23 Find y=acost+bsine to solve y’+ y+ y=cos t. 
24 Find y=acos wt + bsin wt to solve y” + y' + y=sin wt, 


25 Solve y’ + 9y = cos 5t with yg =Oand yp = 0. The solution 
contains cos 3t and cos 5t. 


26 The difference cos 5t—cos 3t equals 2 sin 4t sin t. Graph 
it to see fast oscillations inside slow oscillations (beats). 


27 The solution to y’+@gy=coswt with yy=O0 and 
Yo=0 is what multiple of cos wt —cos wot? The formula 
breaks down when @ = 


28 Substitute y=Ae™ into the circuit equation 
Ly + Ry+fy dt/C = Ve. Cancel e™ to find A. Its denomi- 
nator is the impedance. 


Problems 29-32 have the four right sides in the table (end of 
section). Find y,snicuiar by using the correct form. 


30 y’ +3y=sint 
WM y"+2ystse 32 y" + 2y=e' Cost. 


33 Find the coefficients of y in Problems 29-31 for which the 
forms in the table are wrong. Why are they wrong? What new 
forms are correct? 


29 y"+3y=e* 
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34 The magic factor t entered equation (2). The series for 
e — & starts with 1 + kt +472? minus 1 + ct + 4¢7t*. Divide 
by k—c and set k=c to start the series for te“. 

35 Find four exponentials py = e* for d*y/dt* — py =0. 

3% Find a particular solution to d*y/dt* + y= et. 

37 The solution is y=Ae~*+Bte-** when d=4 in 


Figure 16.2. Choose A and B to match yo=1 and y=0. 
How large is (2x)? 


38 When d reaches 5 the quadratic for Figure 16.2 is 
4745444=(24+10044. Match p=Ae'+ Be to 
Yo= i and yp =O. How large is (22)? 


39 When the quadratic for Figure 16.2 has roots —r and 
—4jr, the solution is y= Ae~™ + Be +", 

(a} Match the initial conditions yp = t and yj =0. 

{b} Show that y approaches f{ as r +0, 


16.3 Discrete Mathematics: Algorithms 


40 In one sentence tell why y’ = 6y has exponential solutions 
but yp’ =6y? does not. What power y=x" solves this 
equation? 


41 The solution to dy/dt =f(t), with no y on the right side, 
is y={ f(t} dt. Show that the Runge-Kutta method becomes 
Simpson’s Rule. 


42 Test ail methods on the jogistic equation y’ = y— y” to 
see which gives y., = 1 most accurately. Start at the inflection 
point yo =4 with A=yy. Begin the multistep method with 
exact values of y=(I+e7')7 1. 


43 Extend the tests of Improved Euler and Runge-Kutta to 
y =—y with yg=1. They are stable if |y,| <1. How large 
can h be? 


44 Apply Runge-Kutta to y=—100y+100sint with 
yo =O and h=.02. Increase h to .03 to see that instability 
is no joke. 


Discrete mathematics is not like calculus. Everything is finite. I can start with the 50 
states of the U.S. I ask if Maine is connected to California, by a path through 
neighboring states. You say yes. I ask for the shortest path (fewest states on the way). 
You get a map and try all possibilities (not really all—but your answer is right). Then 
I close all boundaries between states like Illinois and Indiana, because one has an 
even number of letters and the other has an odd number. Is New York still connected 
to Washington? You ask what kind of game this is—but I hope you will read on. 
Far from being dumb, or easy, or useless, discrete mathematics asks good questions. 
It is important to know the fastest way across the country. It is more important to 
know the fastest way through a phone network. When you call long distance, a quick 
connection has to be found, Some lines are tied up, like Illinois to Indiana, and there 


is no way to try every route. 


The example connects New York to New Jersey (7 letters and 9). Washington is 

‘ connected to Oregon (10 letters and 6). As you read those words, your mind jumps 

to this fact—there is no path from New York with 7 letters to Washington with 10, 

Somewhere you must get stuck. There might be a path between all states with an 
odd number of letters—I doubt it. Graph theory gives a way to find out. 


GRAPHS 


A model for a large part of finite mathematics is a graph. It is not the graph of 
y=ftx). The word “graph” is used in a totally different way, for a collection of nodes 
and edges. The nodes are like the 50 states. The edges go between two nodes—the 
neighboring states. A network of computers fits this model. So do the airline connec- 
tions between cities. A pair of cities may or may not have an edge between them— 
depending on flight schedules. The model is determined by V and E. 
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DEFINITION A graph is a set V of nodes (or vertices) and a set E of edges. 


EXAMPLE 1 How many edges are possible with n nodes, in a complete graph? 


The first node has edges to the »—1 other nodes. (An edge to itself is not allowed.) 
The second node has n~— 2 new edges. The third node has another n — 3. The total 
count of edges, when none are missing, is the sum from Section 5.3: 


1+2+--+(n—1)= 3n(n— 1) edges in a complete graph. 


Fifty states have 25+ 49 = 1225 possible edges, The ‘‘neighboring states graph’’ has 
less than 200. A line of 6 nodes has 5 edges, out of §:6+5= 15 possible. 


EXAMPLE 2. Which states with an odd number of letters are reachable from New 
York? Boundaries to states like Pennsylvania (12 letters) are closed. 


Method of solution Start from New York (7). There is an edge to Connecticut (11). 
That touches Massachusetts (13), which is a neighbor of Vermont (7). But we missed 
Rhode Island, and how do we get back? The order depends on our search method— 
and two methods are specially important. 


Depth first search (DFS) “‘From the current state, go to one new state if possible.” 
But what do we do from Vermont, when New Hampshire (12) is not allowed? The 
answer is: backtrack to Massachusetts. That becomes the next current state, 

We label every state as we reach it, to show which state we came from. Then VT 
has the label MA, and we easily cross back. From MA we go to RI. Then backtrack 
to MA and CT and NY. At every step I searched for a new state with no success. 
From NY we see NJ (9). Finally we are in a corner. 

The depth first search is ended, by a barrier of even states. Uniess we allow Ontario 
and keep going to Minnesota. 


Breadth first search (BFS) “From the current state, add al! possible new states to the 
bottom of the list. But take the next current state from the top of the hist.” There is 
no need to backtrack. 

From NY we reach VT and MA and CT and NJ. What comes next? 

Where DFS moves from the Jast possible state, breadth first search moves from 
the first possible state. No move from VT is possible—so we “scan” from Massachu- 
setts. We see Rhode Island (barely). That ends BFS. 

The same six states are reached both ways. Only the order is different. DFS is fast 
in-first out. BFS is first in-first out. You have the same choice in drawing a family 
tree—follow a path as far as it goes and backtrack, or list all brothers and sisters 
before their children. The BFS graph in Figure 16.3 is a tree. So is the DFS graph, 
using forward edges only. 


VT VT 
tt 
MA MA 
~~ 
ti SR ta RI 
DFS from NY CT BFS from NY~—» CT 
| ' 
NI NJ 


Ag. 16.3 Search trees from New York. The minimum spanning tree. 
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DEFINITION =A free is a connected graph with ae loops. Its N nodes are connected 
by N — 1 edges. If N =n, so every node is im the tree, it is a spanning tree. 


The path from VA to KY to TN to NC to VA is a loop (or cycle). If one of those four 
edges is removed, we have a tree. If two edges are removed, we have two trees (a 
small forest). 


EXAMPLES Allow an edge between neighboring states only when one state is even 
and the other is odd. Ate the lower 48 states connected? 


Start anywhere—say California, Apply either type of search—maybe DFS. Go to 
Arizona (7) tben Utah (4) then WY (9) then CO (8) then NM then OK then TX. 
(I am writing this on an airplane, looking at the map.} We will never get to Florida! 
It is blocked by Alabama and Georgia. 

The search creates a tree, but not a spanning tree. This graph is not connected. 

An odd-to-even graph is special and important. It is called ‘“‘dipartite,”” meaning 
two parts. The odd states are in one part, the even states are in the other. All edges 
go between parts. No edges are within a part.t 


EXAMPLE 4 Is there a ‘“‘complete matching” between 25 even and 25 odd states? 
This requires neighboring states to be paired off (with no repetition). 


Method 1 Start pairing them off: CA-AZ, UT-WY, NV-ID, NE-SD, WA-MT. 
What about Oregon? Maybe it should have been paired with Idaho. Then Nevada 
could pair with Arizona. Trial and error goes nowhere fast. 


Method 2 Think first. The four states CA-OR-WA-NV are even. This whole group 
is only connected to three odd states (AZ, ID, MT). The matching is impossible. 


468 A complete matching is impossible if a group of nodes in one part is 


connected to fewer nodes in the other part. If every group has enough connec- 
tions, the matching can be achicved. 


This is Hall’s Theorem. In a course on graphs, it would be proved. Our purpose here 
is to see the ideas and questions in discrete mathematics, more than the proofs. 


THE GREEDY ALGORITHM 


Put back ail edges between neighboring states. The nodes could be provinces of 
Canada or states of Australia. If they are countnes of Europe—Asia—Africa (or the 
Americas}, we need a new map. The essential thing is the new problem. 

In a network each edge has a “length.” A positive number c,; is assigned to the edge 
from node i to node j. In an economics problem, c;; is the cost. In a flow problem it 
is the capacity, in an electrical circuit it is the conductance. We look for paths that 
minimize these “lengths.” 


PROBLEM Find the minitnum spanning tree. Connect all nodes by a tree with the 
smallest possible total] length. 


The six cheapest highways connecting seven cities form a minimum spanning tree. It 
is cheapest to build, not cheapest to drive—you have to follow the tree. Where there 


tExactly half the states have an even number of letters (a real trivia question), This is the little- 
known reason for admitting Alaska and Hawait. 
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Read-through questions 


A graph is a set Vof_a__ andaset Eof_p_. With 6 are __h__, As nodes are reached, DFS looks out from the 
nodes, a complete graph has __c__ edges. A spanning tree i___ node for a new one. BFS looks out from _ j _. DFS 
has only__¢__. Atreeis defined as__e _, and it is spanning must be prepared to ___k__ to earlier nodes. In case of fire, 
if_¢ .Jthas_g path between each pair of nodes. BFS locates all doors from the room you are in before __1 
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is no edge we set c,;;= 00 (or an extremely large value, in an actual code). Then the 
algonthm works with a complete network—all n(n — 1)/2 edges are allowed. How 
does it find the minimum spanning tree in Figure 16,3c? 


Method 1 Always add the shortest edge that goes out from the current tree. 


Starting from node s, this rule chooses edges of length 1, 2, 7,4, 3. Now it skips 5, 
which would close a loop. It chooses 6, for total length 23. 


Method 2 Add edges in order, from shortest to longest. Reject an edge that closes a 
loop. Several trees grow together (a forest). At the end we have a minimum spanning 
tree. 


This yanation chooses edge lengths in the order 1, 2, 3, 4, 6 (rejecting 5), 7. In our 
network both methods produce the same tree. When many edges have equal length, 
there can be many shortest trees. 

These methods are examples of the Greedy Algorithm: Do the best thing at every 
step. Don’t look ahead. Stick to a decision once it is made. In most network problems 
the Greedy Algorithm is not optimal-——in this spanning tree problem it is. 

Method 2 looks faster than Method 1. Sort the edges by length, and go down the 
list. Just avoid loops. But sorting takes time! It is a fascinating problem in itself— 
bubble sort or insertion sort or heapsort. We go on to a final example of discrete 
mathematics and its algorithms. 


PROBLEM Find the shortest path from the source node s to each other node. 


The shortest path may not go along the minimum spanning tree. In the figure, the 
best path going east has length 1 + 8. There is a new shortest path tree, in which the 
source plays a special role as the “root.” 

How do we find shortest paths? Listing all possibilities is more or less insane. A 
good algorithm builds ont from the source, selecting one new edge at every step. 
After k steps we know the distances d,, ..., d, to the & nearest nodes. 


Algorithm. Minimize d; + c;; over all settled nodes i and ali remaining nodes j. 


The best new node j is a distance c,; from a settled node, which is a distance d, from 
the source. In the example network, the first edges are 1, 2, 7. Next is 8. The northeast 
node is closest to the source at this step. The final tree does not use edges 3, 5, 6-— 
even though they are short. 


These pages were written to show you the algorithmic part of discrete mathematics. 

The other part is algebra—permutations, partitions, groups, counting problems, 

generating functions. There is no calculus, but that’s fair. The rest of the book was 

written to show what calculus can do—Z Aope very much that you enjoyed it. 
Thank you for reading, and thinking, and working. 


To find a path from node i to node j, two search methods 
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In a bipartite graph, all edges go from one part to __m 
A matching is impossible if k nodes in one part are connected 
to__n__ nodes in the other part. The edges in a network have 
oc; Aminimum spanning treeis__P__. It can be found 
by the__a_ algorithm, which accepts the shortest edge to a 


new node without worrying about _ ' 


1 Start from one node of a hexagon (six nodes, six edges). 
Number the other nodes by (a} breadth first search (b) depth 
first search. 


2 Draw two squares with one node in common (7-node 
graph). From that node number all others by DFS and BFS. 
Indicate backtracks. 


3 How many spanning trees in the hexagon graph? 


4 Draw a spanning tree in the two-square graph. How many 
spanning trees does it have? 


§ Define a connected graph. If a graph has 7 edges and 9 
nodes, prove that it is not connected. 


6 Define a loop, If a connected graph has 8 eages and 9 
nodes, prove that it has no loops. 


7 Find the shortest path (minimum number of edges) from 
Maine to California. 


8 Which state is farthest (how many edges are needed) from 
the state you are in? Why would it come last in BFS? 


9 List the steps of BFS from your state to Georgia or 
Colorado or New Jersey. (There are edges Hawaii—California 
and Alaska—Washington.) 


10 With edges between odd neighboring states and between 
even neighbors, what is the largest connected set of states? 
Map required. 


11 With edges only from odd io even neighbors, how many 
states can be matched? (Answer unknown to author—please 
adyise.} 


12 A matching is a forest of two-node trees. Giye another 
description. 


13 Find the minimum spanning tree for network A. 
14 Find the shortest path tree from the center of network A. 


15 Is there a complete matching betwecn left and right nodes 
in graph B? If not, which group of nodes has too few 
connections? 


A B 


16 Find the loop in network B. Then find a minimum span- 
ning tree by Method 1 and Method 2. 


17 How many spanning trees in graph B? It has one loop. 


18 Show that a graph cannot have 0,1, 2,3, and 4 edges 
going into its five nodes. 


19 If the only edges into a node have lengths 6 and 8, can 
they both be in a minimum spanning tree? 


20 In Problem 19, prove that a minimum spanning tree con- 
tains edge (6) if it contains edge (8). 
21 True or false, with reason or example. 


(a} In a complete network, the minimum spanning tree 
contains the n—1 shortest edges. 


(b) If a graph has 9 nodes and 9 edges, it has a loop. 
(c) A graph with a complete matching must be connected. 


22 Draw a tree that is perfect for (a} DFS; (b) BFS. 


23 The adjacency matrix has @,;;=1 if there is an edge from 
node i to node j. Write down this matrix for graphs A and B. 


24 In a complete network start with d,;=c,,. Show that the 
d,; at the end of this program are shortest distances: 
for i= 1 to ndo 
for j= 1 tom do 
fork=1tondo 
d;,=max(d;;, di, + dy)) 
25 How many spanning trees in graph A? 


26 A maximum spanning tree has greatest possible length. 
Give an algonthm to find it. 


27 Write a code that will find a spanning tree (or stop), given 
a list of edges Sike (1, 2), (1, 3), (4, 7), .--- 


A-0 Answers to Odd-Numbered Problems 


CHAPTER 1 INTRODUCTION TO CALCULUS 
Section 1.1 Velocity and Distance (page 6) 


2for 0 <t< 10 0 for O<t< T 
1 v = 30,0, —30; v = —10, 20 8 v(t) = lfor 10 <t<20 v(t) = x for T <t< 27 

—3 for 20 <t < 30 0 for 2T < t< 37 
20for t<.2 f(t) = 20t for 
Ofor t>.2 ~ | 4 ~~ for 
13 f(t) = 0,30(t — 1), 30; f(t) = —30¢,-60,30(t-6) 15 Average 8,20 17 40t~80for1<t<2.5 
210<t<3,-40< f <20;0<t<37,0< f < 60T 23 3 — Tt 25 6t — 2 27 3¢+7 

8 for O0<t<T 8 for0<t<T 
—2Z < < = = ae eo 

a0 slope tte aes Shay) { -2 for T<e<srt fl) { 10T—2¢ for T<t<5T 
33 2C + 32; slope 2 85 f(w) = 73593 slope = conversion factor 871<t<5,0<f <2 
s90<t<5,0<f<4 410<t<5,1<t<32 43 4¢+4;4¢+ 2; 2¢+ 12; 2¢+3 
45 Domains -1<t<1: rangesO<2t+2<4, -3<t-2<-1, -2<-f(t)<0, O< f(-t)<2 


5 25;22;+10 76;-30 Q9v{t)= { . 11 10%; 122% 


47 Vv; sv 49 input * input — A input input -A B#B-+C input +1— A 
input +A — output input +A + B B+C-— output AtA—~B 


A+ B— output 
51 3t+ 5,3t4+ 1,6¢ — 2, 6t — 1, —3t — 1, 9t — 4; slopes 3,3,6,6,-3,9 


: 2(2t O<#<1.5 12-—2(12-—2t) 3<t< 4.5 
53 The graph goes up and down twice. f(f(¢)) ={ ee 15<t<3 212 a ) 45<t<6 


Section 1.2 Calculus Without Limits (page 14) 


12+5+43= 10; f = 1,3,8,11;10 3 f = 3,4,6,7,7,6; max f at v= 0 or at break from v = 1 to —-1 

5 1.1,—2,5; f(6) = 6.6,—11, 4; f(7) =7.7,-13,9 7 f(t) = 2 for t < 5,10 + 3(t — 5) for t > 5; f(10) = 25 

9 7, 28, 8t+ 4; multiply slopes 11 f (8) = 8.8, —15, 14; af = 1.1,-2,5 

18 f(z) = 3052.50 + .28(z — 20,350); then 11,158.50 is f(49,300) 15 191% 

17 Credit subtracts 1,000, deduction only subtracts 15% of 1000 19 All vu; = 230; = (—1)?~1;v, = (4)? 
21 L’s have area 1,3,5,7 28 fj = j; sum j? +3; sum 4 +2 25 (101? — 99%)/2 = 42 27 v, = 27 29 fg, =5 
81 a; =—f; 33 0;1;.1 35 v= 2,6, 18,54;2-39-! 87 AL = 1,.7177, 6956, 6934 — In2 = 6931 in Chapter 6 
39 v, = —(4) 41 v,; = 2(-1)’, sum is f; —1 45 v = 1000,t = 10/V 
47M,N  51V9<2-9<9? < 2% (4)? < 2(2) < 1/9 < 21/9 


Section 1.3 The Velocity at an Instant (page 21) 
16,6, 2a,-12,0,13 $4,3.1,3+h,2.9 5 Velocity att=1is3 7 Area f =t+t?, slope of f is 1+ 2¢ 


9 F; F; F; T 11 2; 2¢ 18 12 + 10¢?; 2 + 10¢? 15 Time 2, height 1, stays above 3 from ¢t = 3 to 3 
17 f(6)=18 21 v(t)=—2¢ then 2¢ 28 Average tot =5 is 2;v(5)=7 25 4v(4t) 27 vaye = t, v(t) = 2t 


Section 1.4 Circular Motion (page 28) 
1 10x, (0, —1), (—1,0) 3 (4cost,4sint);4 and 4t;4cost and —4sint 


5 3t; (cos 3t, sin 3t); —3 sin 3t and 3 cos 3t 7 x = cost;/2/2; -V2/2 9 27/3; 1; 20 
11 Clockwise starting at (1,0) 13 Speed 2 15 Area 2 17 Area 0 


Answers to Odd-Numbered Problems A-1 


19 4 from speed, 4 from angle 21 i from radius times 4 from angle gives 1 in velocity 

23 Slope 3; average (1 — 8 )/ (7/6) = a(2-v8) =.256 25 Clockwise with radius 1 from (1,0), speed 3 

27 Clockwise with radius 5 from (0,5), speed 10 29 Counterclockwise with radius 1 from (cos 1, sin 1), speed 1 
$1 Left and right from (1,0) to (-1,0), v= —sint 83 Up and down between 2 and —2; start 2sin 0, v = 2.cos(t+0) 


35 Up and down from (0, —2) to (0,2); v = sin 3¢ 87 «= cos 2%, y = sin 254, speed 2%, vup = cos 2zt 


Section 1.5 A Review of Trigonometry (page 33) 


1 Connect corner to midpoint of opposite side, producing 30° angle Sx 7 £ — area r79 
9 d= 1, distance around hexagon < distance around circle 11 T; T; F; F 

13 cos(2t + ¢) = cos 2tcost — sin 2¢ sint = 4cos® t — 3 cost 

15 3 cos(s — t) + 4.cos(s + t); $ cos(s — t) — 3 cos(s + t) 17 cos@ = sec# = +1 at O= ne 

19 Use cos(% — s — t) = cos($ — s) cost + sin(% — s) sint 23 6 = 3+ multiple of 2x 

25 6 = $+ multiple of x 27 No 6 29¢=4 $1 |OP| = a, |OQ|=6 


CHAPTER 2 DERIVATIVES 
Section 2.1 The Derivative of a Function (page 49) 


1 (b) and (c) 8 12+ 3h; 13 + 3h; 3; 3 5 f(z) +1 7-6 922+ Azc+1;22+1 
ia t= a) a 18 7; 9; corner 15A=1, B=-1 17 F; F; T; F 

19 b= B;m and M;m or undefined 21 Average 22+ 2, — 2214 

25 4; no limit (one-sided limits 1,—-1);1;1ift#0,-1ift=0 27 f'(3); f(4) — f(3) 

2922*(42°)= 82" 81 = = Zi; = 88 GL = — 3; f'(2) doesn’t exist 85 2f L = 4u° Ht 


Section 2.2 Powers and Polynomials (page 56) 


1 62°; 3024; f’""’ = 720 = 6! 8 22+7 5 1+ 22 + 32? + 42° Tn”! —~nz-"-1} 
91+2+ 427+ 12° 11 —4,(—4)+5 18 2~7/3; 4/5; -1,-4/8 
15 3827 -1=Oatz= Js and 3 17 8 ft/sec; — 8 ft/sec; 0 19 Decreases for —1 < 2 < 3 
+h)- 1 5(n — hh = 
21 Onl rye «28.1 5 10 105 1 adds to (1+1)°(2 == 1) 
25 327; 2h is difference of 2’s 27 44 = 22+ Az+327+3zAz+(Az)? — 22+32? = sum of separate derivatives 
29 72°; 7(x + 1)® $1 32‘ plus any cubic S$ 2+ha?+ isi +ist+C 35 txt, p25 


87 F; F; F; T; T $9 & = 12 so 4H = 3(.12); six cents 41 4¥ =k saz - £), #=-S 
43 E= 522, 45 t to </2t 47 3,219; 12"; divide byn+1=0 


49 .7913, —3.7913, 1.618, —.618; 0, 1.266, —2.766 


Section 2.3 The Slope and the Tangent Line (page 63) 


1 =¥3y—6 = —3(2 — 2);y-6 = $(z-2);y-6=—3(2-2) Syt+1=3(2-1);y= 32-4 
5 y= 2; (3,3) Ty-—a? =(c+a)(z—a);y—a? =2a(z—a) y= 227? +2;y-7=-2(2- 5) 
lly=1c2=f 18 y—1=-4(z-—a);y=2,2=20;2 15¢=4, tangent atz=2 


A-2 Answers to Odd-Numbered Problems 


17 (—3, 19) and (3, 33) 19c=4,y=3- <2 tangent atzx=1 

21 (1+A)?; 3h + 3h? + h®; 3+ 3h + h?;3 23 Tangents parallel, same normal 

25 y = 2az — a”, Q = (0, —a”); distance a? + 4; angle of incidence = angle of reflection 

27 z= 2p; focus has y = 5 = p 2 y- saat it=-Z= -V2 

81 y—a?=—2(c-a);y=a?+2;a=%8 33 (24)(1000) =10atz=10hours 385a=2 

37 1.01004512;1+ 10(.001) = 1.01 39 (2+ Az)? — (8+ 6Az) = 6(Az)? + (Az)® 412, =8;2,= 4% 
43 T = 8sec; f(T) = 96 meters 45 a> 4 meters/sec? 


Section 2.4 The Derivative of the Sine and Cosine (page 70) 

1 (a) and (b) 3 0;1;5;2 5 sin(z + 2m); (sinh)/h — 1; 20 7 cos? 6 ws 1— 6? + 164; £64 is small 
9 sin 46 =] 36 11 354 13 PS = sinh; area OPR = sinh < curved area th 

15 cosa = 1-— an + ot sede 17 3_(cos(z + h) — cos(z — h)) = 2(—sinzsinh) > —sinz 

19 ¥ =cosz—sinz=Oatz=F+n0 21 (tanh)/h =sinh/hcosh < 4; > 1 

23 Slope Scos$a= $,0,—3, 3; no 25 y=2coszt+sinz;y” =—-y 27 y = —5 cos3a;y = 4 sin 3z 


29 In degrees (sin h)/h —+ 27/360 = .01745 81 2sinzcosz + 2cosz(—sinz) = 0 


Section 2.5 The Product and Quotient and Power Rules (page 77) 


1 2 S aaa = Gee Be ee) ee ee) ee ee) 
7-2*sinz+4zcosz+2sinz 9 2c-1- inte uy 2./zsinzcosz + 4271/2 sin? 2+ 3(sin2)~1/? cosz 

18 42° cos z—z‘ sin z+cos* z—4zcos*z sinz 15 42? cosz+2zsinz 17 019 —8(z—5)—5/3 + 8(5—2)-5/3(= 0) 
21 3(sin zcos z)?(cos? z — sin? x) + 2cos 2z 23 wowet+vuwz+wluezt+z'uvw = -. 25 — csc” 2 — sec? x 


27V= ce VS cost— a ee t? sint A= 2(aa +tcost + gost) A= = 2(cost — tsint + taeeet sint 


t+1 
2t°+3t? , 2t°+6t?+6t 
29 10% for #< 10, Blox for > 10 31 antl aa 
88 uly + 2ulu! + uo"; uly + 8uly! + 3ulu” + ul” 35 4 sin’? t; 3 tan? ¢; 2i(1 + t)3/? — 1] 
89 T; F; F; T; F 41 ee 2n — 1/ degree 2n “a v(t) = Gi tsint(t < 2) v(t) = -E£(t> >2 
45 a: ae + tae has 42 = 0 at z= 0 (no crash) and at z = —L (no dive). Then # - = S¥h( a, + £) and 
a = “Lp? rls a 1). 


Section 2.6 Limits (page 84) 


14,L=0, after N = 10; 28,00, no N; 4,0, after 5; 1.1111, 22, all n; V2, 1, after 38; /20 — 4, 3, all n; 
$25 ¢ = 2.718---, after N = 12. 3 (c) and (d) 
5 Outside any interval around zero there are only a finite number of a’s_ = 7 8 9 La) £(0) 111 
131 15 sin 1 17 No limit 19 21 Zero if f(z) is continuous at a 23 2 
25 .001, .0001, .005, .1 27 |f(z) — Ll; 7 29 0; X = 100 88 4; 00;7;7 85 3; no limit; 0; 1 
39 .0001; after N = 7 (or 8?) 413 


43 rere ee pat) 60 
45a,—-L<b,-—L<c,—Ls0 |b, — L| <« if |a, — L| < € and |c, —L| <e 


Answers to Odd-Numbered Problems A-3 


Section 2.7 Continuous Functions (page 89) 
lc=sin1l; noc 3 Any c;c =0 5c=Oorl;noc Tc=1;noc 9 no c; noc 
Wester sy 18 ¢=-1;¢=—1 1c=1;¢=1 Wec=-1ljc=-1 


19 c= 2,1,0,—1,---; same c 21 f(z) =0 except at z=1 23 /z—1 25 -fi 27 35 

29 One; two; two 31 No; yes; no 88 xf(zx),(f(z))?, 2, f(z), 2(f(z) — 2), f(x) + 22 85 F, F; F; T 
37 Step; f(z) = sin + with f(0) =0 89 Yes; no; no; yes (f4(0) = 1) 

41 o(3) = f(1) — £(3) = £(0) — £(3) = —9(0); zero is an intermediate value between 9(0) and g(3) 


43 f(z) —zis >Oat s=Oand <Oatz=1 


CHAPTER 3 APPLICATIONS OF THE DERIVATIVE 
Section 3.1 Linear Approximation (page 95) 


1Y=z 3 Y =1+2(z- %) 5 Y = 2n(a — 2m) 7 2°+6-25..001 91 
11 1—1(—.02) = 1.02 18 Error .000301 vs. 4 (.0001)6 15 .0001 — $10-® vs. 3(.0001)(2) 
d = — 
17 Error .59 vs. 3(.01)(90) 19 2V1-az= Wes —} atz=0 
21 EV tu pee = atu=Qct ects, 28 dV = 3(10)?(.1) 
25 A= 4ar?,dA = 8ar dr 27 V = ar*h, dV = 2arh dr (plus ar? dh) 291+ dg 31 32nd root 


Section 3.2 Maximum and Minimum Problems _ (page 103) 
1 «= -—2: abs min 8 2=-—1: rel max, z = 0: abs min, z = 4: abs max 
5 g = —1: abs max, z = 0,1: abs min, z = } : rel max 72 =-3: abs min, z = 0: rel max, z = 1: rel min 


92=1,9:absmin,2=5:absmax I12= i : rel max, z = 1: rel min, = 0: stationary (not min or max) 
13 c= 0,1,2,---: abs min, z = 4, 3, 8,---: abs max 15|z| < 1: all min, c = —3 abs max, z = 2 rel max 
17 + =0: rel min, t= 2: abs max, z= 4: abs min 
19 z =0: abs min, z = 7: stationary (not min or max), z = 2”: abs max 
21 6 =0: rel min, tan? = —4 (sin@ = 4 and cos@ = —2 abs max, sin 6 = —# and cos@ = 2 abs min), 
6 = 2x : rel max 
23 h = 3(62” or 158 cm); cube 25 aotaei 2Vab gallons/mile, VE miles/gallon at v = v2 
27 (b) 6 = 8 = 67.5° 29 x = 45; compare Example 7; ¢ = V3 


31 R(x) — C(z); Ria) ay ak — ag. profit 882= afi zero 852=2 


=z ) dz 
37 V = 2(6 — 32)(12 — 2z);2 1.6 39 A= mr? + 27,0 = 3(4—- 29); rin = Fy 
41 max area 2500 vs 49900 = 3185 438 2=2,y=3 45 P(x) = 12 — a; thin rectangle up y axis 
47h= Hrs aR Vv = Anko = 4 of cone volume 
49r= HHoRy; best cylinder has no height, area 27 R? from top and bottom (?) 


SBlr=2,h=4 58 25 and 0 55 8 and —co 


o. 2 2 —z)2- & — 2 — _ ae ah aaa 
57 Vr? +22 + J/¢q? + (s— 2)?; F = ae ac = 0 when sin a = sinc 
59 y = 2? = 3 61 (1,-1),(#2,-2 63 m= 1 gives nearest line 65m=i 67 equal; z = 3 


69 1,7 71 True (use sign change of f”) 
78 Radius R, swim 2Rcos@, run 2R6, time 222088 + 28; maz when sind = >, min all run 


A-4 Answers to Odd-Numbered Problems 


Section 3.3 Second Derivatives: Bending and Acceleration (page 110) 
8 y=-1-27;no--- 5 False 7 True 9 True (f' has 8 zeros, f” has 7) 
11 c=3 is min: f"(3)=2 18 2=0 not max or min; z= $ is min: f"($) = 81 


15 2 = 3 is max: f"(3#) = -V2;2 = © is min: f"(%) = V2 

17 Concave down for z > } (inflection point) 

19 «= 3 is max: f"(3) = —4;2 = 2,4 are min but f”=0 21 f(Az)=f(-Az) 2814+2-% 
25 1- = 27 1-$2-—i27 29 Error if"(z)Az 31 Error OAz+ 3 f'"(x)(Az)? 


37 - 35 = 1.010101; + oe ie = .90909 89 Inflection 41 18 vs. 17 43 Concave up; below 


Section 3.4 Graphs (page 119) 


1 120; 150; $2 3 Odd; z=O0,y=<2 5 Even; z= 1,2=-l1,y=0 7 Even; y= 1 9 Even 
11 Even; z= 1,2=-—-l1,y=0 138 2=0,2=—-l1,y=0 15 z=1,y=1 17 Odd 19 22 
21 2+ + 28 V2274+1 25 Of the same degree 27 Have degree P < degree Q; none 
29 =1and y= 32+C if f is a polynomial; but f(x) = (x — 1)!/9 + 3z has no asymptote z = 1 
$1 (x —3)? 89 2=V2,2=-V2,y=2 41 Y=100sn22% 45 ¢=3,d=10;c=4,d=20 
47 z* = V/5 = 2.236 49 y=2-—2;Y = X;y = 2x 51 amax = -281, 2min = 6.3393 ting = 4.724 
58 amin = -393, Zmax = 1.53, 2min = 3.33; Zina = .896, 2.604 
55 min = —.7398, 2max = -8135; Zing = .04738; chiowup = £2.38 57 8 digits 


Section 3.5 Parabolas, Ellipses, and Hyperbolas (page 128) 


1 dy/dz =0 at $2 3 V = (1,—4), F = (1, -3.75) 5 V = (0,0), F = (0,-1) 7 F =(1,1) 
= (0, +3); F = (0, +8) 11 V = (0,41); F = (0, +,/3) 13 Two lines, a= b =c =0;V = F = (0,0) 


15 y=52?-—42 17 yt p= V2? + (y—p)? — dpy = 275 F = (0, 75), 9 = — -3;(44,4 
19 t= ay? with a > O;y= 242" y= —az? + az with a> 0 


21 ay? mw 2 EM 4 fy 1 wet 2g 24 Hy, (ena)? vt _ 1592 4 2 = 95 
25 Circle, hyperbola, ellipse, parabola a7 du ae aa y= ot +5 2983 = wr - 4) 
$1 Circle; (3, 1);2;X = #33, Y = 4 83 32/2 + y!2 =2 35 y? ecaie 1; — 4 =1;y? — 2? = 


37 = = ve =1 39 ee Ay + 4,22? + 122 + 18; —14, (—3, 2), right-left 
41 F=(4¥%8,0);y=4+2 48 (c+y+1)? =0 
45 (a? — 1)x? + 2abzy + (6? — 1)y? + 2aca+ 2bcy + c? = 0;4(a? + b? — 1); if a? +b? <1 then B? -4AC <0 


Section 3.6 Iterations z,,,; = F(zn) (page 136) 


1 —.366; 00 3 1;1 5 2;+00 7 —2;-2 

9 1s attracts, its repels; ‘ attracts, 0 repels; 1 attracts, 0 repels; 1 attracts; z attracts, 0 repels; 
+/2 repel 

11 Negative 13 .900 15 .679 17 |a| <1 19 Unstable |F’| > 1 21 2* = 74 ;\a|<1 


Answers to Odd-Numbered Problems A-5 


23 $2000; $2000 25 z0,6/20,20,6/20,--- 27 F’ = —2,-3/2 = —} at x” 
29 FY =1-2cz=1- 4c at 2* = 2;0<¢ < i succeeds 
$1 F’ = 1-9c(z — 2)® = 1- 9c at 2* = 3;0<c < 2 succeeds 

1 


3 i - ei ° 
83 on41 = In — 25" jont1 = tn — ee 8B a* = 4 if zo > 2.5; 2° = Lif mo < 2.5 
n 


87 m=1+¢ at s* =0,m=1-—c at z* = 1 (converges if 0 < c < 2) 390 43 F'=1 at 2* =0 


Section 3.7 Newton’s Method and Chaos (page 145) 


1 ting1 = tn — Bae = Bea + gy 5 2 = 29; 2 is not defined (co) 72z* =1 or 5 from zp < 3, 2% > 3 


ke 
11 2 < } to z* =0;29 >} to z* =1 21 2n41 = 2p — Goer 23 24 = cot x = 00;25 = cot 8* = cot = 
a 2 2 
25 mis not afraction 27= $22+4+ az = (atu = 5 29 162 — 8027 + 1282 — 6424; 4; 2 
$1 |zo| <1 83 Az = 1, one-step convergence for quadratics 35 al — = 5.28: ¢9 = = 1.86 


87 1.75 < 2* < 2.5:1.75 < a* < 2.125 398:;3<a2* <4 41 Increases by 1; doubles for Newton 
45 21 = 29 + cotzp = 29 + & gives Zo = 21 + cota, =aj+7 49 a = 2, Y's approach 3 


Section 3.8 The Mean Value Theorem and )’H6pital’s Rule (page 152) 


1c=y/$ 3Noc 5c=1 7 Corner at 4 9 Cusp at 0 
A1sec?z—tan?z=constant 186 15-2 17-1 19m 21-2 28Not > 
25 —1 27 1; 47222 has no limit 29 f'(c) = “a1; c= V7 


810 = 2" — tags + ahr(aqy 2” — tn)? gives M fe} 33 (Oh AM; singularity a5 AG} E87 


CHAPTER 4 DERIVATIVES BY THE CHAIN RULE 


Section 4.1 The Chain Rule (page 158) 


Iz=y°,y= 2? —3,2' = 62(z? — 3)? 82z=cosy,y = 25,2’ = —327 sin 2° 
52=/y,y=sing,2'=cosz/2Vsinz Tz =tany+(1/tanz),y = 1/2,2' = (>t) sec?(+) — (tanz)—? sec? z 
9z=cosy,y=a?+2+1,2' = —(22 +4 1)sin(z? + 2+ 1) 1117 cos 17z 13 sin(cos x) sin z 
15 =? cos z+ 2zsin x 17 (cos /z + 1)3(2 + 1)-1/? 19 3(1 + sin z)~1/2(cos z) 21 cos(=+_)(>$382 
28 82" = 2(2?)?(2z?) (22) 25 2(2 + 1) + cos(z + x) = 22 +2-—cosz 
27 (x? + 1)? +1; sin U from 0 to sin 1; U(sin x) is 1 and 0 with period 2x; R from 0 to z; R(sin z) is half-waves. 
29 o(z) = +2, A(z) = 27 + 2;k(z) = 3 31 f'(f(z)) f'(z); no; (—1/(1/z)?)(-1/z?) = 1 and f(f(z)) =z 
33 a(52 + 8) +8; 32+ 14; 36 16 35 f(9(z)) = 2, 9(f(y)) =y 
87 f(a(2)) = zz, 9(f(z)) =1- 2,4(F(2)) = 2 = of ale), ol fale) = s2z = Fol f(2))) 
89 f(y) =y—1,9(z) =1 43 2cos(z? + 1) — 4x? sin(z? + 1); —(z? — a 8/2; _ (cos /z)/4z + (sin /z) /425/2 
45 f'(u(t))u’ (¢) 47 (cos? u(z) — sin? u(z)) 49 2zu(z) + ag? du 51 1/4Vi-Vi-z Vi—a 
BS8df/dt BB f'(g(z))g'(z) = 4(2°)°32? = 122'! = 7 3600;2;18 593; 4 


A-6 Answers to Odd-Numbered Problems 


Section 4.2 Implicit Differentiation and Related Rates (page 163) 
n— - d 
1-2 1 fy” : 3 a =1 5 du = Fig) 7 (y? = 2xy)/ (x? = 2zy) or 1 9 any OF er, 
11 First a = —¥£, second du — =@ 13 Faster, faster 15 222! = yy’ + 2’ = Hy! = y’ sind 
17 sec?6=52- 19 sno 500\/1+ (4%)? =. 21 H = —8; Y = 2/3; 00 then 0 
23 V = arth; B= AY = — in/sec 25 A= labsing, 44 =7 27 1.6 m/sec; 9 m/sec; 12.8 m/sec 
29-2 31% = = 2d; an = 4 cos? 92; 9" = s088y" — 4 cos? O sin O(y’)? 
Section 4.3 Inverse Functions and Their Derivatives (page 170) 
1z= 48 8 2=/yt1 ( unrestricted — no inverse) 5z= rE Tz=(1+y)/* 
9 (z unrestricted — no inverse) y= >> 13 2< f-'(z) <3 15 f goes up and down 


17 f(z)g(z) and F{a) 19 m40;m>0;|m|>1 21 @ = 524, . = zy 4/5 
23 au = 827; $2 = 3(1+y)~ a/s 25 2 = Grip, = roaE 27 yidy? +C 
= al -1 = 
29 f(9(z)) = -1/32° i” (y= Figg" (2))=2 392/¥3 = 41 1/6 cos9 
43 Decreasing; = = aide <0 45 F; T; F 47 g(x) =a", fly)=y",2= (ipin 


49 g(x) = 2°, f(y) = y + 6,2 = (2 - 6)/° 1 g(x) = 10°, f(y) = logy, z = log(10") = y 
538 y= 2°,y" = 62,d*z/dy’ = —2y 8/8; ne ,sec /m? 55 p= a -—1,0<y<1 


57 max = G= 8 yt/5 Gq = byi/s 59 y7/100 


Section 4.4 Inverses of Trigonometric Functions (page 175) 
10,5,0 3 5,0,7 5 x is outside [—F, 5 Ty = —V3/2 and 3/2 
9 sing = V/1— y?;/1— y? and 1 12 Meg cone = 1» Sag — 2h = bee 
18 y=0:1,-1,1,y=1:0,0,4 15 F; F; T; T; F; F 17 #& = dimes 19 2 =3 
1-2 id 
dz _ 2sin 12 _ ysin7ty dz _. 1 = du = 2 
213 = Werth 23 1 Vine 25 Fy ies 2Tu=1s0 4G; =0 Sl secz = Vy*4+1 
= d 
33 3,1,3 35 -y/V/1—y? 37 }sec £ tan % 39 ae 41 = mer} 
43@otr, 4Tua4sin''y 4907 51-7/4 


CHAPTER5 INTEGRALS 
Section 5.1 The Idea of the Integral (page 181) 


11,3,7,15,127 3—}—i-1=1-1 5 f;-fo= t= 7 32 for 2 <7,72—4forz>1 


9 bt & ava 11 Lower by 2 13 Up, down; rectangle 15 Va+ Az — Jz; Aa; 2; /z 
17 6; 18; triangle 19 18 rectangles 21 62 — $a? —10;6-—2 23 od 25 27; 2; so 


Answers to Odd-Numbered Problems A-7 


Section 5.2 Antiderivatives (page 186) 


12° + 226; 8 3 2,/z;2 5 $4/5(1 + 21/9); $(1 4 22/3) 7 —2cos z— 3 cos 2x; 8 — 2cos1— 4cos2 
9 zsinz + cosz;sinl+cos1—1 11 4 sin? x; sin? 1 13 f =C;0 15 f(b) — F(a); fr —fe 
178+ 19 2(14+ V3); 2(3+-V3);2  218;298;00 28 f(z)=2/2 25 3, below —1; 1,8 

27 Increase - decrease; increase - decrease - increase 

29 Area under B — area under D; time when B = D; time when B — D is largest 33 T; F; F; T; F 


Section 5.3 Summation Versus Integration (page 194) 
100 
1 28516 8 127; 2"+1 1 5 ya = 2550; } (27 — 1) = 10, 000; >t 1)*+1/k = — 
t=1 
7 > a,x”; done 9 5.18738; 7.48547 11 2(a? +6?) 9 .182"-1;4-2 A158; T 
k=0 


eae: 19 f, = 1;n? + (2n4+ 1) = (n+1)? 

2la+b+c=1,2a+ 4b + 8c = 5,3a+ 9b + 27c = 14; sum of squares 23 Szo0 = 80200; E409 = .0025 = 4 
25 Si00,1/3 © 350, B100,1/3 & -00587; Sio0,3 = 25502500, E100,3 = .0201 27 v, and v2 have the same sign 
29 v, = 9,2 = 12, 50 = 21 $1 At N =1,2'-? is not 1 33 0;2(v, +--+ + u,) 


35 Aa) ru(jA2) 87 f(1)— f(0) = ff Ldz 


j=1 
Section 5.4 Indefinite Integrals and Substitutions (page 200) 

12(2+2)9/7+¢C 3 (2 +:1)"t!/(n +1) + C(n # -1) 5 (2? +1)§+C 7—icostz+C 
9-Leost2z+C Alsin7t+C 18 4(1+¢7)9/?-(1427)/74+C0 152/2+2+C 
WFsecr+C 19-cosz+C 21 329+4+329/2 = a3—2(1-22)9/2? thy = V2n 

27 42? 29 asinz + bcosz 31 4$2°/2 33 F; F; F; F 35 f(x — 1);2f(2) 

87 z—tan“!2 89 fidu 41. 4.977 + Cit + Cp 43 f(t + 3); f(t) + 3¢; 3 f(t); § £(3t) 


Section 5.5 The Definite Integral (page 205) 


1C =—f(2) 8 C= f(3) 5 f(t) is wrong 7C=0 9 C = f(-a) — f(-b) 


11 u= 27 41; fo yids = wry? = Bal 18 u= tang; fp udu=2 
15 u= seca; [Yu du= 2 (same as 18) Wu=t,c=1,de= =H; 1/3 =du 


19 S=3(341)*+3(1+1)4;s = $(0) + 3(3 +1)* 

21S = 3[(5)> + 19 + (3)% + 29); s = 3 [0% + (3)> + 15 + (8)] 

23S = 4[(32)* + (8)* + (28)* + 24 25 Last rectangle minus first rectangle 

27 S = .07 since 7 intervals have points where W = 1. The integral of W(z) exists and ae zero. 

29 M is oo so Problem 25 aes S — s = Az(1—0); area from graph up to y = 1 is 3 -1+3 a: 3 te-= 
A1+d¢+34+--)= oy= 2; area under graph is }. 

31 f(z) =3+ Jy u(z)dz; f(z) = fy u(z)dz 38 T;F;T;F;T;F;T 


A-8 Answers to Odd-Numbered Problems 


Section 5.6 Properties of the Integral and Average Value (page 212) 


1d= tp a*da = 1 equals ct ate =+(1)/* 8 6= +1 fF cos? dz =} equals cos*c atc = * and 3 
5o= PP a¢ = 2 equals 4 at c= 2 7 fp v(2z)dx 9 False, take v(x) < 0 


11 True; if, v(z)do+ 2-3 JP v(2)dz = 1° v(z)dz 18 False; when v(x) = x? the function 2? — 2 is even 
15 False; take v(x) = 1; factor 3 is missing 175= aS v(z)dz 19 0 and 2 

21 v(z) = Cz; v(x) = C. This is “constant elasticity” in economics (Section 2.2) 23V-0;V—->1 

25 1 f?(a—2)dz =a+ lifa> 2;3 J? |a-2|dz= 3 area = a+ 1 if a < 2; distance = absolute value 

27 Small interval where y = sin # has probability a, the average y is J. sind 4@ = 2 
29 Area under cos@ is 1. Rectangle 0 < 6 < §,0<y< 1 has area a Chance of falling across a crack is i a 


314,3,---,4;105 388 1 f* 22006 28tde = 1. 220 £9 sin 28 = Vave 


2 
*. 


35 Any v(z) = veven(z) + voaa(z); (2 + 1)® = (32? + 1) + (2° + 32); 45 = Go - 
37 16 per class; £&; E(x) = 1800 = 228 39 F; F; T; T 


at ste) = { oy 28 \+c:s0)- 0) =84$=9 


Section 5.7 The Fundamental Theorem and Its Applications (page 219) 


1 cos? x 30 5 (x)? (22) = 227 7 vo(z +1) — v(z) 9 sin’ _ Jy fo sin? t dt 
11 fF v(u)du 180 152sinz? 17 u(z)v(z) 19 sin™* (sin z) cos z = zcosz 

21 F; F; F; T 23 Taking derivatives v(x) = (acos zx)’ = cos z — zsinz 

25 Taking derivatives ia ae 1)= ule) so v is even 27 F; T; T; F 

29 fi u(t)dt = fy v(t)dt — So u(t)dt = =45 — zhy (in revised printing) 

31 V = 8°; A = 3s; half of hollow cube; AV = 387dS; 3s? (which is A) 


33 nie = 2n?r3 35 Wedge has a ae rs height of triangle; je? 317d0 = = a 


do fall do tand m/4 _ 
sTr= Ut Zcos? 9? 2cos?6 ~— lo 


89z=y ar i y*dy = ep = §; vertical strips have length 2— /z 
41 Length 2a; width 44; ans ada = 43 The differences of the sums f; = vy +vg+---+,; are fj —fj;-1 = 0; 


45 No, fy a(t)de = £ ay = 4 (0) and ie a(t)dt)dz = f(1) — f(0) — # (0) 
Section 5.8 Numerical Integration (page 226) 


1 2A2(v9 — vn) 3 , 15625, 302550, .0625,.2025 5 Lg  .1427, Ts  .2052, Sg as .2000 

7 p=2: for y=’, 2-(2)7? +3-12 43 9 For y = 2”, error (Az)? from $ — 3,y, = 2Az 
18 8 intervals give aby rd ft oy] = gag < -001 15 f"(c) is y'(c) 17 00; .683, .749, .772 + = 
19A+B+C=1,iB+C= eae 3; Simpson 

21 y=1and z on (0,1): L, =1and $—3+,R, =1 and 3 + 34, so only 3L, + 3 Rn gives 1 and 

28 Tyo F 500, 000, 000; Too # 50, 000,000; 25, 0007 

25a=4,b=2,c=1; Jo (42? + 22+ 1)dz = 12; Simpson fits parabola 27 c= 735 
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CHAPTER6 EXPONENTIALS AND LOGARITHMS 


Section 6.1 An Overview (page 234) 


1 5;—5;—1; 3; 2;2 5 1;—10;80;1;4;—-1 7 nlog,z 9 2:3 138 10° 
15 0; Isr = 107Jo; 8.3 + logyo 4 17 A=7,b=2.5 19A=4,k=1.5 
214;2;log2 28 y—1=ca;y—10=c(z—-1) 25 (.17" — 1)/(—h) = (10" — 1)/(—A) 


27 y= c7b?; 2" = -1/cy* 29 Logarithm 


Section 6.2 The Exponential e” (page 241) 
1 49¢7* 3 8e8* 5 37In3 7 (2)? In2 9 at 112 13 ze? 15 Gq 
17 "7 cosz+e*cose*™ 19 .1246, .0135, .0014 are close to = 21 3; + 
23 Y(h) =14+ 4;Y(1) =(14+ $) =2.59 9-25 (1+ 4)%@ <e <6 < 8/2 < @?* < 107 < 2? 

3a Tr (2 e)* -2? 
27 = 29242545 31 ane 83 - - % 

3a 

35 2e7/2? 4+ = 37 e~* drops faster at x = 0 (slope —1); meet at s = 1;e77 *fem® < e PfeF < qa for x > 3 
39 y — e* = e*(z — a); need —e* = —ae* ora =1 
41y =27(Inz+ 1) =0 at tmin = 35 y = 2?[(Inz +1)? + 2] >0 
3 £(e-7y) = etd — ety = 080 e~*y = Constant or y = Ce* 

32 : Fa 2-2 a S sing 
45 S|) = =F 47 25)h1 = ad = ins 49 —e" "| =1 51 et **]) =e? —€ 53 716 =0 
55 fue a2 deg = —e-“ + C; S(e")P#¢dz = de" + 57 yy’ = 1 gives dy? =2+C ory= V2zr42C 
59 4F = (n — z)2"~“!/e* <0 for z > n; F(2z) < constant 9 61 Stan © 117; (£)° © 116;7 digits 
Section 6.3 Growth and Decay in Science and Economics (page 250) 
1 t? + yo 3 yoe** 5 10 e4#;¢ = Into 7 Lett 49.75; t = 861 lic = 2;4 = Ini0 
13 88 In(t) = abe = B2;¢=2In(2) ate = SURE) — 19 e° = 3 50 yo = 971000 = 1000 


21 p= 1013 e; 50 = 1013 7; ¢ = 4 In(=8%); p(10) = 1013 et = 1013/3895 = ,/(1013)(50) 

28c= in2;(4)8 = 7 25 y = yo — at reaches y, at t = “°—"1; then y= Ae~%/v1 27F; F; T; T 
29A=1,B=-1 31e°-—1 831—e7* 35 6;6 + Ae~*: 6 — Ge~7#, 6 + 4e—7#. 6 

4 39 ye~*; y(t) = te’ 41A=1,B=-1,C=-1 43 €975 > 075 45 s(e—1); s{e~1) 


AT (1.02)(1.03) — 5.06%;5% by Problem 27 49 20, 000 e(?0-T)(.95) — 34, 400 (it grows for 20 — T years) 
51s = —cye™/(e* — 1) = —(.01)(1000)e®°/ (eS — 1) 53 yo = (1 - e~:008(48)) 


55 et = 1.20 s0 c = 420 57 24€965 =? 59 To —oo; constant; to + 00 
61 & = 60cY; # = 60(—-Y + 5); still Y =5 


63 y = 60e* + 20,60 = 60e!?° + 20,c = 35; In(48); 100 = 60e + 20 at t = 2 In( 88 650 
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Section 6.4 Logarithms (page 258) 

1i 8 cay 5 Inz T 2282 — cots 9 iiint+C 18 In$ 

15 ind 17 —In(In 2) 19 In(sinz) +C 21 — In(cos3z) + C 23 d(Inz)?+C 

27 Iny = 4 In(z? +1); # = at 29 @ = sine cosy 

31 qu = ete 33 Iny = e* Ina; & = ye*(Inz + 4) 35 Iny=—1 soy=i,%=0 370 


39 ~i 41 secz 47 .1;.095; .095310179 49 —.01; —.01005; —.010050335 

51 )’Hépital: 1 53 1. 55 3—21in2 57 Rectangular area $+---+2< Abs a =Inn 
59 Maximum at e 610 63 logiy¢ or 5 65 1—2;1+2ln2 

67 Fraction is y = 1 when In(T + 2) —In2 = 1 or T = 2e —2 69 y' = wey? —y=1- ae never equals 1 
71 Inp = 2ln2;LD 27 In2;ED p = e7!"?, p! = In2 et "2 

75 24 = 47-yInzg=alny— ne a are ne decreases after z = e, and the only integers before e are 1 and 2. 


Section 6.5 Separable Equations Including the Logistic Equation (page 266) 
1 Tet — 5 8 (222 +41)1/8 52 Tien ce? 9($+ 9)? Wy. =0;¢= + 
15 z=1+e',yis in 18 17 ct =1n3,ct = 1n9 

19 b= 10-®,c = 13-1079; y,. = 13 - 10%; at y= & (10) gives n} = ct +In 20%, so t = 1900 + 12 = 2001 
21 y* dips down and up (a valley) 23 sc=1=sbrsos=4,r=¢ 
25 v= eof = 40 27 Dividing cy by y+ K > 1 slows down y’ 


dk ck cy 
29 ay — HKD 7 GER Oe 


31 2 = =; multiply e¥/* ¥ = e-+t/K evo/K (2) by K and take the Kth power to reach (19) 


83 ¥ = (3—y)?3 54, =tt gy =2att = 3 

35 Aeb+ D=Aeé+ B+ Dt+t—> D=~-1,B =—-1;y=A+B gives A=1 

37 y — 1 from yo > 0, y > —oo from yo < 0;y — 1 from yo > 0,y — —1 from yo < 0 

39 f csudy — f dt — In(siny) =t+C =t+Inb. Then sin y = de’ stops at 1 when t = In2 


siny 
Section 6.6 Powers Instead of Exponentials (page 276) 
Li-z+S-24- Sltc+ S48 4--- 5 1050.62; 1050.95; 1051.25 
7 14n(=1) + seth (at)? 41-142 9 square of (1+ +)"; set N = 2n 
11 Increases; In(1+ 4) — 4; >0 13 y(3) = 8 15 y(t) = 4(3°) 17 y(t) =t 


19 y(t) = $(3* — 1) 21 (Va!) fax lyst ifa=1 23 yo = 6 25 yo = 3 
a 31 10.38% 33 100(1.1)?° = $673 


27 —2,-10, -26 > —0o0;—-5,-—42,-44 — -12 29 P= 
35 as =965 37 1000(1.120_1)= 57,275 39 y.=1500 41 2;(23)5? = 2.69;e 


1 
4 
43 1.01421? = 1.184 — Visa charges 18.4% 


Section 6.7 Hyperbolic Functions (page 280) 

diet cena, a = 3 sinh 22 7 sinh nz 9 3sinh(3z + 1) 11 sinh = —tanh az sech 
‘ x ° 15 

13 4cosh pane - Ta (sechv'2? + 1 Ps seen eh 

19 cosh(Inz) = 5(2+ =) =latz=1 BA ea as a 23 0,0, 1,00, 00 


25 4 sinh(2z + 1) 27 Lcosh® z 29 In(1 + cosh z) 31 e* 
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33 fydz=f sinh ¢(sinh ¢ dt); A= 3 sinh tcosht — fy dz; A’ = 3;A4=0 att=0s0 A= dt. 

41 M=r4+Va24+1y= Inf +-Va? + 1] AT + In|2+2| 49 sinh™' x (see 41) 51 —sech™*z 
53 41n3;00 55 y(z) = 4 cosh cz; +coshcL — + 

57 y! =y — 3y?; 2 (y')? = dy? — y? is satisfied by y = dsech? = 


CHAPTER 7 TECHNIQUES OF INTEGRATION 


Section 7.1 Integration by Parts (page 287) 
1 ~zcosz+sinz+C 3 —ze“* —e 7 +C 5 2? sinz + 2zcosz—2sinz+C 
7 £(22+ 1)In(2z+1)—-2+C 9 3e7(sinz —cosz)+C 11 atzo (asin ba — beosba) + 0 


1 


i) 


3 (sin(In z) — cos(In z)) + C 15 z(Inz)? —-2z2Inz+22+C 17 zsin-*'2+V1—2?7+C 

19 3 (2? + 1)tan"'c-2+C 21 z>sinz + 3z?cosz — 6zsinz — 6coszt+C 

23 e?(z° — 327+ 62-6)+C 25 ctanz +In(cosz)+C 27 -1 29 —3e-7 414 31 -2 
33 3Inl0-6+2tan'3 85 u=2",v=e* STu=2",v=sine 89u=(Inz)",v=2 


41 u=<csinz,v =e? > fesing dz in 9 and ~ f zcosz e*dz. Then u = —2xcosz,v = e* + f e® cosz dz 

. . . ad . 

in 10 and — fa at z e*dz (move to left side): $(zsinz— xcosz+cosz). Also try u = ze”, v = —cosz. 
43 f dusinu du = 3(sinu—ucosu) = 4(sinz? — 2? cos 2”); odd 


45 3- step function; 3e*- step function 49 0;26(x)| — f 6(x)dz = —1; v(z)6(z)| — f v(x)6(z)dz 
51 v(z) = f f(z)dz 


58 u(z) = ¢ fo v(z)da; 7-(2—% a); 2 for x < 5, ¢(22 — 2? — 4) forz > 3;% fora < 3,4 for a> $. 
55 u=2?,v=—cosz > —2? cosz+(2z)sinz—f2sinzdz | 57 Compare 23 
59 uw'|d - Ais u'w! — ulw]d + fo uw! = [uw! — u’w)d 
61 No mistake: e* cosh z — e* sinhz = 1 is part of the constant C 
Section 7.2 Trigonometric Integrals (page 293) 
aS ote sens dy tea eee 8 isin?s+C 
ees u?(— sel —}cos* z+ 2cos®z— cos’ t+C 7 2(sinz)9/?+C 
9 1 fsin® 22 dz = rel es Bat 5 cos Qaz)+C lle 13 $(32 + 88) 4+C 
15 2+C 1b Loos! zsingz+é es coe z dz; use sinuicn (5) 
19 IC’? cos” ee nol 7)? cos edz = +++ =: Soin-8...t 7) de 
21 1 =-—sin"~' xcosz + (n—1) ee ? zcos* ¢ dz = —sin” ' zcosz+(n—1) f sin”? x dz — (n— 1)I. 
So nI = —sin"~* z cosz+(n—1) fsin®~? z dz. 
23.0,+,0,0,0,—- 25 —2cos*z,0 27 —3 (e822 + s0s2002) g = gg 1 (sin 2002 | sin22) q 


31 —} cos z,0 83 fo zsinz dz = fr Asin? z dz + A=2 35 Sum = zero = 5 (left + right) 
37 pis even 389 p—q is even 41 sect +C 43 sean ee 45 $secor+C 

AT 3 tan®s—tanz+z+C 49 In|sinz|+C 51 Fonts +C 53 A= V2,—-vV2sin(x + £) 
55 4/2 57 1 59 l-cosztsinz 1 ( 61 pand q are 10 and 1 


1+cos z+sin z 
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Section 7.3 Trigonometric Substitutions (page 299) 


12 =2sin6; fd) =sin"'2+C 8 x = 2sind; f 4cos? 6 d§ = 2sin"*£+4+2,/1-2+4+C 
5 z= sind; f sin?@ d§ = isin" 2—42V1—-2?+C 

7 z= tan; [ cos? 6 dé = gtan-t2+3745 +C 

9 x = 5sec 6; f 5(sec? 6 — 1)dd = Va? — 25 —5sec1 £4+C 

11 2 = sec; fcosd dd = “=! + C 18 z= tand; fcosd dd = 


= 86 db __ = 
15 x = 3sec 0; I Pints 9sin? 6 Sane Te Sere g te 


17 x = sec 6; face? dd = $cc Otand +L nfaec# + tané) +0 = gave? —1+3In(z+ V2? -1)+C 
19 z= tand; f <9 4 — +C= =V#t14¢ 


sin? 0 Soe 6 


x 
wes +C 


21 f =sngde —~6+C=-—cos'24+C; with C = © this is sin" z 
23 f tandsec 4? — _In(cos6) + O = In Vz? +14 C which is 4In(z? +1) +C 


25 2 =asin6; {7/7 a cos? 6 dé = on = area of semicircle 27sin-*sJ, = 2-Z=% 

29 Like Example 6: z = sin# with 9 = 5 when z = 00,9 = $ when z= 2, ake sos) -1+ 4a 
n/2 2 n-1 ee 

Si o= Stands [rj Sacto Slin/2 = § $8 [ Sade = fe eo 


35 x = sec; 3(ef teS) = 3(a + Vz? -1+ aa 3 S(n+Va2?-1+2-V2?-1)=2 
a ae z+C 

39 x = cosh 6; f sinh? 6 dO = 4 (sinh 9 cosh 6 — 0) + C = daz? — 1-4 In(z + Vz? -1)+C 
412=tanhdé; f dé =tanh-'x+C 43 (x — 2)? +4 45 (x — 3)? -9 47 (z+ 1)? 


49u=2-2, f ey = jf tan”! $ = } tan (252) + Chu=2-3, f Hs = Fn 453 = fn 2 +C; 
usstif#ed= shite 
5lu=a2+5; Sat uses u = asec 6 a is gen esedtal eee 


58 cos@ is negative (—/1 — 2?) from % to 8%; then Jo - fat a ie V1—a?dz = x = area of unit circle 
55 Divide y by 4, multiply dz by 4, same ie da 


57 No sin”! x for z > 1; the square root is imaginary. All correct with complex numbers. 


Section 7.4 Partial Fractions (page 304) 


i 


1A=-1,B=1,—Ina+h(z—1) +6 s-4-c4 5 ae - acura 


7345 93-58 11-2-34+25, 18-*8+12-12+44 
15 404+ 3 + SP A= -}, B= },0=0,D=-} 
17 Coefficients of y: 0 = —Ab+ B; match constants 1= Ac; A=1,B=28 
19 A= 1, then B=2 and C=1;f 2%, + f Get — 
In(z — 1) + In(x? ee) nese +241) =In(z° - 1) 
21a meri f ea au. — f @ = In(44) + C = In(4) + 
23 u=cos6; f = =-3f @¢- 3S a = $m(1— 4) - bn(i+4)= 21m Feon9 tC. We can reach 
2 jy (e=cono) In 452282 = In(csc @ — cot 0) or a different way $n 7} poh = In 72S, = — In +e? — 


— In{csc # + cot 6) 
25 u=e*;du= e*dz =uda;f Gthidu=fi 2du u 4 f du = —2In(1 — e7) +. ne? +C= —2In(1—e7)+24+C 
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27 2+1=u7,de = 2u du; f *%4 = f[2- 72 -]du=2u—2In(1+u)+C= 
2/z+1—-2h(it+Vz+1)+C 
29 Note Q(a) = 0. Then 375 = aj-o1) ~ aia by definition of derivative. At a double root Q’(a) = 


Section 7.5 Improper Integrals (page 309) 


zn P= an 8 —2(1 — z)*/2]5 = 2 5 tan 2)° a2 =F 7 i(Inz)?]§ = —00 
9 oat — 2]§ = 11 In(In(In z))|78) =0o 18 Ate + sin x cos z)]8° = co 


1 6° diverges for every p! 17 Less than 1 Ba =} 


5 
ore than [> sats ved= — tan- 1a1p - aie = ie 
21 Less than f° e~*dz = }, greater than —} 
28 Less than f° e@dz +e fe (@-W'dz=eF +e fre du=e? + a4 
25 fo sintgda 4 f° oe than aad =2 27 p! = p times (p — 1)!;1 = 1 times 0! 
29 u=2,dv = ze ie os =1/m 81 f° 1000e~-1#dt = —10, 000e~-1*/g° = $10, 000 
83 W = —GMmjo — aye = = imvé if vo = Jer 


35 fy $2 = fo et ds = So l® = as 
87 JZ/? (sec z — tan z)dz = [In(sec z + tan z) + In(cos a)|¢/? = [In(1 + sin a)|a/? = In2. 
The areas under sec z and tan z separately are infinite 89 Only p= 0 


CHAPTER 8 APPLICATIONS OF THE INTEGRAL 
Section 8.1 Areas and Volumes by Slices (page 318) 


1s? ~3=1 gives s = +2; f? alba — 3)|dz = 2 

oe = 2 =9 gives y = +3; ibs 3l9- y?|dy = 36 

5 xt — 22? = 22? gives s = +2 (or z= 0); f? 3[2z? — (a* — 22?)|da = 428 

Ty = 2? = —2? + 182 gives z = 0,9; ito [(—2? + 182) — ve = 243 

9 y =cos z= cos? z when cos z = 1 or 0, = 0 or ¥ or - Jo"? (cos 2 — cos? a)dz=1-—% 
11 e” = e?*~1 gives z = 1; fo le? — e?*—ldz = (e- 1) — (=e ) 

18 Intersections (0, 0), (1,3), (2, 2); Jo (32 — aldz+ SPl4 —2z—a\dz=2 
15 Inside, since 1 — 2? < V/1— 2; f°, [V1 — a? — (1—2?)|dx = E a 
17V = f° nytdz = f° nb?(1- 25 )dx = Anta. around y axis V = 42 

z = 2,y = 0 around y axis gives a circle not in the first football 

19V = fj Qxzsinzdz=2n? 21 fo n(8 — z)*dz = Size; f° 2nx(8 — z)dz = £12" (same cone tipped over) 
28 de n-1?dz— So n(a4)?da = 8; [> aa(1 ~ «')z dx = 2% 
25 fr, 7(3")dz — frig n(2)?de = = > abe, 5 Sys 2023 — Age = = 2s 
27 sf a|(x?/3)? — (2°/?)"|az = = er, f Qnz(22/3 — 23/2)dz = sx * (notice zy symmetry) 
29 2? = R?-y?,V = ie ‘ n(R? — y*)dy = (Rh? — he 
31 f° (2Va? — 2?)?de = 180% = 88 fe (2V1— y)'dy = =2 87 f A(z)dz or in this case f a(y)dy 
39 Ellipse; /1 — 2? tan}; 11 — 2”) tan 6; 2 tan 
41 Half of xr7h; eectshglie 43 i (5? — 2?)\dz = 420 45 f> (4? — 1?)dz = 300 


5. rotating 
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47 fo x((b — y)? — a?)dy = (6° — 3a7b + 2a°) 49 Kr x(3 — 2)?da; i, 2ry(2)dy + im 2ry(3 — y)dy 
51 [?x(%)2dy= =") ss 960ncm 558 57 2 

59 2x = 1 ff 2ny(2— /y)dy= 2 63 3me 65 Height 1; [\ 2nz dz = xa; cylinder 

67 Length of hole is 2/6? — a? = 2, so 6? —a? = 1 and volume is “* 69 F; T(?); F; T 


Section 8.2 Length of a Plane Curve (page 324) 


1 fo fit (32/2)2dx = S[(42)9/? — 1] = GS-8 8 f) 14 aa? + Dds = fp (1+ 2?)dz = $ 
B fp i+ (2? — da)Pde = fo (e? + dale = @ 

T ft fis (2 = hada = fi (atl? + ta 7)dz = 

9 Jo? Vocos* t sin?t + 9sin’t cos? t dt = ce 3costsint dt = 3 
11 f7/? \/sin? ¢ + (1 —cost)?dt = f"/? /2— 2cost dt = "129 int dt =4-2V2 
18 f) V+ 2¢+1dt=fo(t+i)dt=2 15 f" Vi+cos?zdx=3.820 17 ff Vit dds = 2.003 


19 Graphs are flat toward (1,0) then steep up to (1,1); limiting length is 2 


21 92 — V36sin? 3¢ + 36cos?3t=6 28 f) 26 dy = V26 


25 £ Vile — evr + 4 dy = 2, 2(% +e7¥)dy = 2(e¥ ~ e“¥) 2}, =e — BL 
Using x = cosh y this is J V1 + sinh? y dy = f cosh y dy = sinh y|!, = 2sinh1 
27 Ellipse; two y’s for the same z 29 Carpet length 2 4 straight distance /2 
31 (ds)? = (dz)? + (dy)? + (dz)?; ds = y/ (42)? + (4)? + (42)2dt; 
ds = Vsin’ t + cos? t + 1dt = V2 dean curve = helix, shadow = circle 
88 L= fs V1 + 4x2dz; fe V1+ 22dz = i V1 + 4u? 2du = 21; stretch zy plane by 2 (y = x? becomes ¥ = (2)?) 


Section 8.3 Area of a Surface of Revolution (page 327) 


1 fy 2n/z,/1+ (334)?da = fo tny/2+ idx = 49% 3 2 f> 2n(72) 50d = 147/50 
5 ft, 2eV4—a7\/14 sde= fi 4ade=8e 7 fo dxay/1 + (22)2dx = E(1 + 40?)9/2)2 = £[179/2 — 1] 


9 ih QnaV/2de = 9nV/2 11 Figure shows radius s times angle # = arc 27R 
18 2xrAs = x(R + R')(s — s') = rRs — rR's' because R's — Rs' =0 
15 Radius a, center at (0, 6); (42)? + (44)? = a?, surface area ii 2n(b + asint)a dt = 4x7ab 


17 ii 2nay/1+ Gash de = fr 3peds, = x? + 2m (write 22 — 2? = 1— (x — 1)? and set x — 1 = sin 8) 
19 ine 2na,/1+ A;dz (can be done) 
21 Surface area = f° 2n24/1+ Adz > f°? 2842 = 2x In 2]? = co but volume = fr° x(2)?dz = x 


28 Sc 2m sintV2sin’ t + cos? ¢t dt = So 2x sin tV2 — cos? t dt =f 2nV/2—utdu = 
muv2 — u? + 2msin* ee =2a+77 


Section 8.4 Probability and Calculus (page 334) 
1 P(X <4)=2,P(X =4) = %,P(X>4)= 3 f>° p(z)da is not 1; p(x) is negative for large z 
1.01 


5 fo° e*dz = 4; f.” e~*dx x (.01)2 ly Tira ei ct se Sa 
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Qpa=Zlt yA -+e-TH=4 UL SS TA =F n(1 + 27)|§ = +00 
13 [5° aze~** dz = [—ze~** 9° + fo” edz = 4 
15 ff Bites = Bean“; [Zenda = 1 e-F fo aed = AeA AT J er ANDdg m= e280 wt 


19 Exponential nes than ome 60 years — So° O1e—91* dg = 1 — e~ 6 = 45 
2ly= = three areas © 5 1 each because y —o to pu is the same as yx to w+o and areas add to 1 
23 —2u f xp(z)dz+ yp? f p(s)de = -2n- pt p?-1= —p? 


25 w=O0-54+1- ee. 2 =1; o? = (0-1)?-24+ (1-1)? 34 (2-1)?-3 = 2. 
Also 35 n?pp, — py? Sy te Hee z-1=2 

27 p= Jo ze */*da —*Pas =2;1- ahd =e des : +4 le: = ? 

29 Standard deviation (yes — no re pone =5 7008 = a Poll showed 870 28 peaceful. 
95% confidence interval is from 22 — 4 an 38 + a0 3, or "03% to 100% ecul 


31 95% confidence of unfair if more than 2g = Tits = 2% away from 50% heads. 
2% of 2500 = 50. So unfair if more than 1300 or less than 1200. 

33 55 is 1.50 below the mean, and the area up to p — 1.5a is about 8% so 24 students fail. 
A grade of 57 is 1.30 below the mean and the area up to pt — 1.30 is about 10%. 


1000 _ 1000 1 1 

35 .999; 999000 = (1 — +4.) 1 because (1— 4)” — 3. 

Section 8.5 Masses and Moments (page 340) 

1g= Sz=4 SF=88 Te=2=y sea a g=Ue=y7 z= E,y= Hs 
UGe=L=7 21I= deegae AER Le =-2fzpdz+2tfpdx=0fort=% 
23 South Dakota 25 2n7a7b 27 M, =0, My = 29 2 31 Moment 

33 I = > mar2; $  mar2w2;0 35 14nt; jared 33 


37 2; solid ball, solid cylinder, hallow ball, hollow cylinder 39 No 


41 Tw V1+4+J by Problem 40 so T & V1.4, V1.5, \/5/3, V2 


Section 8.6 Force, Work, and Energy (page 346) 


1 2.4 ft Ib; 2.424... ft Ib 3 24000 lb/ft; 833 ft lb 5 10z ft Ib; 102 ft Ib 7 25000 ft Ib; 20000 ft Ib 
9 864,000 Nkm 115.6-107 Nkm 13 k = 10 lb/ft; W = 25 ftlb 15 f 60wh dh = 48000w, 12000w 
17 }wAH?; 3wAH? 19 9600w 21 (1- v5) 3/2 23 (800) (9800) kg 25 + force 


CHAPTER 9 POLAR COORDINATES AND COMPLEX NUMBERS 
Section 9.1 Polar Coordinates (page 350) 


1 (1,2 3 (2, = 5 (1/2, 57) 7 (0,2) 9 (/10, 10) 11(V3,-1) 132/72 
15 \/r? + R?2 — 2rRcos(6 — ¢) 

ITO<r<0,-2<0< 0K r<o,ms 6 < Wm V4<r< V5,059< 20,05 1r<0,-E<SO< 
19 y = stané,r = zsec 8 21¢=f,allrjyr= 


® (a 


1 é _ . 
sin O+coa6?" — cos @ + sin 8 


23 o2+y?=y 25 c=rsin cos, y =rsin? 6,2? + y? = y 


arty =aty,(s— 3)? +(y- GP = (2) 29 2= ao Y = cavtams 91 (2? +97)? = 2 


A-16 Answers to Odd-Numbered Problems 


Section 9.2 Polar Equations and Graphs (page 355) 


1 Line y= 1 8 Circle z? + y? = 2z 5 Ellipse 327 + 4y? = 1— 22 7 x,y,r symmetries 

92 are: only 11 No symmetry 18 x, y,r symmetries! 
15 2? + y? = 6y + 82 — (x — 4)? + (y— Boe = 5%, center (4,3) 17 (2,0), (0,0) 
1w9r=1- 9 = 8 r=142 9-500) 2lr=2,9=4 Bi ti £47 
28 (x,y) = (i, 1) “os r = cos 56 has 5 petals 27 (2? +y?— 2)? =a? + y? 
29 (x? + y?)> = (z?-y")? 31 cos? = —3, sind = v8 y= aV3 = -~2 332=4,r=-8 35.967 
Section 9.3 Slope, Length, and Area for Polar Curves (page 359) 

1 Area 5 3 Area 5 Area = 7 Area = — 4 9 frei g(3 cos? 8 a {2+¢086)") gg = 
11 Area 87 13 Only allow r? > 0, then 4{o"* ds 26 d6=1 152+ 4 

17 6 = 0; left pone r= 0= = +28 gg = -iy= = +3 

19 rs = 40,000; ¢[rvr? + c? + c? In(r + Vr? + c?)]* = 40, 000.001 
21 tan = tand 23 z = 0,y =1 is on limacon but not circle 25 41n(2a+WV1+ 4m?) + mV1 + 41? 
27 32 29 4 (base)(height) ~ 5(rAé)r 3142/2 = 88 2n(2 - V2) 35 & 39 sec 0 


Section 9.4 Complex Numbers (page 364) 

1 Sum = 4, product = 5 5 Angles 2 roe az on 7 Real axis; imaginary axis; i axis z > 0; unit circle 
9ced=54+101,5= ae 11 2cos§,1;-1,1 13Sum=0, product = ~1 15 r*e4*? sheo*, Ae 40 

17 Evenly spaced on circle around origin 19 e*t e*# 21 ete? €® 23 cos 7t, sin 7t 

29t= — 28, y= —e7/V8 31 F; T; at most 2; Re c<0 33 +e7*? 2 = + cos6,y= —F sin 9; tne" /? 
CHAPTER 10 INFINITE SERIES 

Section 10.1 The Geometric Series (page 373) 

1 Subtraction leaves G — xG = 1 or G = ;4+, 34; 2; 48534 52-14+3-20+4-32?+---= G25 


7 .142857 repeats because the next step divides 7 into 1 again 

9 If q (prime, not 2 or 5) divides 10” —10™” then it divides 10N-™ —1 11 This decimal does not repeat 
18 81,2220 15 545 75 86197 acb 23 tan~}(tanz) =z 

25 (ltct+2?+2°--)(1-ct+a2?-2°.--)=14+22?+24+.--- 

27 2(.1234...) is 2-35 > Got = Hi 0123...is1-pooqray = sr 489 29S qo = 1 
31-In(i-.1)=-In.9 33 41n42 seal sTy= 

39 All products like a,b2 are missed; (1+1)(1+1)4#1+1 41 Take z = 3 in (13): In 3 = 1.0986 


43 In 3 seconds the ball goes 78 feet 45 tanz = 2; (18) is slower with z = 2 


Answers to Odd-Numbered Problems A-17 
Section 10.2 Convergence Tests: Positive Series (page 380) 


1 3424... is smaller than 1+ $+--- 


3 Qn = 8n — Sn—1 = aoa 8 = lj an = 4,8 = ja, = n 2H = In 2-9) = ape = In? 
5 No decision on y b, 7 Diverges: 735(1+3+---) ‘9 > Toon? ssipanas: Diss z7 is larger 
11 Converges: Lys islarger § 18 Diverges: 7 is smaller 15 Diverges: x , is smaller 


17 Converges: > + ga is larger 19 Converges: > x is larger 21 L=0 23 — 0 25 L=5 
27 root (2=4)" + L= 4 a s= he on survivor) $1 If y decreases, 59 y(t) < fT y(z)dz < rr" y(t) 


33 pe et< yn e "dz = 1;2 Lis ste oa 35 Converges faster than f at ect 
87 Diverges because f.° 54% = Lin(2? + ye =0o 89 Diverges because f° 2° "dz = 27]? = 00 


41 Converges (geometric) because f,°(£)*dz < 00 48 (b) ["** 42 > (base 1) (height — ) 

45 After adding we have 1+ 4 +---+ 4 (close to In 2n); thus originally close to In 2n — In n = In 2# = In2 
47 i bee dz — 1 __l_ = 009 49 Comparison test: tt < a,; if a, = an then sin aa = = 0 but a, = 00 
51 a, = n-8/2 53 a,= 2 55 Ratios are Ma 4,1,4,--+ (no limit LZ); (fe)1/2* = Vai ; yes 

57 Root test ie —+L=0 59 Root test L= 15 61 Divergence: N terms add to In uta — co 

63 Diverge (compare )>+4) 65 Root test L= 2 67 Beyond some point $* *<1lor dp, Ze bn 


Section 10.3 Convergence Tests: All Series (page 384) 


1 Terms don’t approach zero 3 Absolutely 5 Conditionally not absolutely 7 No convergence 

9 Absolutely 11 Noconvergence 13 By comparison with >> |a,| 
15 Even sums b+ i + a+ - +» diverge; a,,’s are not decreasing 17 (b) If a, > 0 then sy is too large so s— 8, < 0 
19 s=1- i; below by less than 4 

a Ss : : . 
21 Subtract 204+ 3+---)=32(4+A+4+---) = 3% from positive series to get alternating series 
23 Text proves: If }>|a,| converges so does )> an 
25 New series = (4) -$+(2)-}---=4(1-3443-4---) 27 3ln2: add In 2 series to } (In 2 series) 
29 Terms alternate and decrease to zero; partial sums are 1 + Ry +h —Inn>y 
31 .5403? 38 Hint + comparison test 85 Partial sums a, — ap; sum —ao if a, — 0 
37 iePice res but product is not 1+ 2+-- 
39 Write z to base 2, as in 1. seal which keeps 1+ § 1 and deletes 4 re i, . 
41i er a +--+ adds to Bs = 7 and can’t eee m4 
43 5 “sink = cot 4 (trig identity) = tan (% — $);s = ) < = —log(1—e*) by 10a in Section 10.1; 
take i es part 


Section 10.4 The Taylor Series for e”,sinz, and cosx (page 390) 


11+22+ G2)" +...; derivatives 2"51+2+ 2 4--: 3 Derivatives 17;1+%2+--- 
5 Derivatives 2"n!;1+ 22+ 427 +--- 7 Derivatives —(n — 1)!;-—z — a. _ ee ee 
9 y=2-e* =1-2-#-... Ily=2-®4+---=sing Ba Ge eae 
15 atid eh whe ig ted 17 —(z —1)§ 19 1-(z—1)+ (z-1)?- 
-1)? -1)° - 
21 (e-1)- Gye _. = In(1 + (2—1)) 23 ertel-t merit (2-1) + GP 4 


25 2+ 227 + 228 orbs 4 at 22-f +2 S1it¢2?+% $31+2-2 


A-18 Answers to Odd-Numbered Problems 


35 co slope; 1+2(2-1) s72-S +2 s9e4+54+2 arite+S 4314+02-2? 
16 —16 iY -i 5 

45 cos§ = “+2 sing = £ “ze ° 47 99th powers —1,-t, 24/4, 4 

49 e~*"/3 and ~1; sum zero, product —1 5315 ,15 + 2m 55 2e* 


Section 10.5 Power Series (page 395) 


114424 (42)? +---;r=$;2= 4 $e(1—-z+%—--);r=00 
5 Ine + In(1+ 2) = 14+ 2 - 3(2)? +---;r=e;2 =—e 

7 |23*| < 1 or (-1,3); 5% jz — al < 1;—In(1— (z — a)) 
111+2 +4 +--+; add tol atz=0 18 a ,43,--- are all zero ig) 


© 


17 f'8)(c) = cose < 1; ahaa terms might not decrease (as required) 

n+l nth 
19 f=; roa |Ral $ ae Ra = 2-=5;(1-¢)* =1~ 3 

n+1 
21 flrtt)(g) = ae |R,| < ayet (siz) — 0 when z= 3 andl-—ec> 3 
28 Ro = f(z) — f(a) — f'(a)(x — a) — $f" (a)(x — a)? so Ro = Ry = RY =0 at z=, Ry’ = f"; 
Generalized Mean Value Theorem in 3.8 gives a<c <cg <cy <2 

251+ 327+ 3(2?)? 27 (-1)";(-1)"(n + 1) 
29 (a) one friend k times, the other n — k times, O< k <n; 21 ts (16 — 1)1/4 1.968 
35 (1+ .1)h! = 1(1.1)(.1) + SRY)? w 11105 3714 4 Sp HE gyda? + 825+ Sat 
43z°—iet+ 20% 451424 32422 9 47.2727 pier Ca ines 4 §1r=1,r=£-1 


CHAPTER 11 VECTORS AND MATRICES 
Section 11.1 Vectors and Dot Products (page 405) 


1 (0,0, 0); (5,5, 5); 3; —3;cos@ = —1 $ 2i— j — k;-i — 7j + 8k; 6; —3;cos# = —4 
5 (v2, —v1); (v2, —v1, 0), (vs, 0, —v1) 7 (0,0);(0,0,0) 9 Cosine of 6; = cjéetion of w on v 
11 F;T;F 18 Zero; sum = 10 o’clock vector; sum = 8 o’clock vector times Lvs 
15 45° 17 Circle 2? + y* = 4;(z — 1)? + y? = 4; vertical line z = 2; half-line z > 0 
19v = —3i+ 2j,w Beane Ww 21d=-6;C =i-2j+k 
23cos 9 = qi 0088 = Fi 0088 = =i 25A-(A+B)=1+A-B=1+B-A=B-(A+B); equilateral, 60° 
27a=A-1b=A-J 29 aa t) and (—sint, cost); (cos 2t, sin 2t) and (—2sin 2t, 2 cos 2t) 
31C=A+B,D=A-B;C-D=A-A+B-A-A-B-B-B=r*-?r*?=0 
33 U+ V — W = (2,5,8), U— V + W = (0, -1,—2), -U+V + W = (4,3, 6) 
35c and Va? + 6?;b/a and Va? + 6? + c? 
37M, = 3A+C,M2 =A+3B,M;3 = B+ 5C;Mi +M2+Msz = 3(A+B+C) = 
398 <3-3;2y/ey<ety 41 Cancel a?c? and b?d?; then b?c? + a?d? > 2abed because (bc — ad)? > 0 
43 F; T; T; F 45 all 2\/2;cos @ = -} 


Section 11.2 Planes and Projections (page 414) 


1(0,0,0) and (2,—1,0);N = (1,2, 3) 3 (0,5,6) and (0,6,7); N = (1,0,0) 
5 (1,1,1) and (1,2,2); N = (1,1,-1) Tet+ty=3 QOr+2y+z2=2 


Answers to Odd-Numbered Problems A-19 


11 Parallel if N - V = 0; perpendicular if V = multiple of N 

13 i+j+k (vector between points) is not perpendicular to N; V - N is not zero; plane through first three 
istt+y+z2=1;2+y-—z=3 succeeds; right side must be zero 

15 az + by + ez = 0; a(% — xo) + b(y — yo) + c(z — 20) =0 17 cos 6 = fo V1 

19 ZA has length 4 21 P = 3A has length 3|A| 23 P = —A has length |A| 25P=0 

27 Projection on A = (1,2,2) has length gS; force down is 4; mass moves in the direction of F 


29 |Plnin = iN = distance from plane to origin 31 Distances a and a both reached at (3, 3,—$) 


S8it+j+kt=-$5(3,-3,-shi 


35 Same N = (2, —2, 1); for example Q = (0,0, 1); then Q + 2N = (4, —$, 44) is on second plane; 2|N| = 2 
37 3i + 4j; (3t, 4t) is on the line if 3(3t) + 4(4t) = 10 or t = 22; P = (38, 42), |P| =2 
39 22 + 2(12 — 22)(—3) = 0 so x = 32 = $532 + 4y = 10 gives y= # 


41 Use equations (8) and (9) with N = (a,b) and Q = (21,1) 43t= ABB onto A 
45aVL= aL; - $Lr1j;aVF = $Lir + £Lyir 
47V -L; = 2-1;V-Ly = —-3-1, V- Lr; = —3 — 2; thus V- 2i = 1,V - (i- V3j) = —4, and V = si+ 3/3; 


Section 11.3 Cross Products and Determinants (page 423) 


10 8 3i — 2j — 3k 5 —2i + 3j — 5k 727i+12j-—17k 
9A perpendicular to B; A, B, C mutually perpendicular 11|AxB| = /2,AxB=j-k 13AxB=0 
15 |A x B|? = (a? + a2)(b? + 62) — (a,b) + agbz)? = (a,b2 — agb,)?; A x B = (a1b2 — agb)k 
17F; T; F; T 19N = (2,1, 0) or 21+j 21lz-—yt+z=2s0oN=i-j+k 
23 [(1, 2,1) — (2,1, 1)] x [(1,1,2) — (2,1, )) =N=i+j+kzt+y+z2=4 
25 (1,1,1) x (a,b,c) = N = (c — b)i+ (a — c)j + (b — a)k; points on a line if a = b = c (many planes) 
27 N =i+j, plane z+ y = constant 29 N =k, plane z = constant 


zy 2 
31/1 1 0|/=2-y+z2=0 88i—3j;-i4+3j;-31-j 35 ~1,4,-9 
12 1 
ag a3] _ a, a3 a, ag 
39 +c, bs by c2 by by |+23| by bs 


41 area? = (Jab)? + (fac)? + ($c)? = ($|A x BI)? when A = ai — 8j, B = ai — ck 

43 A = $(2-1—(-1)1) = 3; fourth corner can be (3,3) 

45 ai + aaj and bi + bj; |aybo ae a2bi|;A xB=.---+ (aib2 = a2b1)k 

47 A x B; from Eq. (6), (A x B) x i = —(a361 — a1b3)k + (a162 — a2b1)j;(A-i)B-(B-ijA = 
a1 (b1i + boj + bgk) — b1(a1i + aoj + agk) 

49N = (Q- P)x(R-P)=i+j+k; area $V3j;2+y+z=2 


Section 11.4 Matrices and Linear Equations (page 433) 


seceacnoea[S]oe[t]s2[ 1] seeaveu[t]-a[1][ 3] oe 


52 = 2y,y = anything, D = 0,2y ; +u : = : 7 no solution, D=0 
8 -1 3 8 
ar tk — —24 _ = A 8 0 


A-20 Answers to Odd-Numbered Problems 

15 ad— be = -2 80 AT} = : ne 

19 Multiples of each other; in the same direction as the columns; infinite 

21 dy = .34,d2.=4.91 283 962+ .02y = .58, 042 + .98y = 4.92;D = 94,2 = .5,y=5 

25 a= 1 gives any x = —y;a = —1 gives any r= y 

27 D=-2,A-1=-} 3 ES ;D = -8,(2A)~ = $471; D= +, (A+)! = original A; 
D = —2 (not +2), (-A)-' = -A';D=1,l-1 =I] 


7 5 5 11 3.5 4 2 
294B=| | Ba=[§ 5 | 2e=| 3 5 |.ca=[3 | 


ae+bg oc aecf +aedh + bgcf + bgdh 


17 Are parallel; multiple; the same; infinite 


= (ad — be)(eh — fg) 


31 AB= ce+dg cf+dh ? —afce—afdg cae bhee — bhdg 


= Oe 
83 Al = . Q B= a : | 85 Identity; B-1A7? 37 Perpendicular; u = v x w 
2 


39 Line 4 + ¢, errors —1,2,—1 41 d,; — 2d2+d3=0 43 A~+ can’t multiply O and produce u 


Section 11.5 Linear Algebra (page 443) 
1 2 2 x 0 1 -1 0 x 0 
1/23 5//y {= |0 3/0 1 -1|/|y |=] 0 
0 2 2 z 8 1 0 -1 z 1 
1 0 0 
5 det A = 0, add 3 equations > 0 = 1 75at+ib+0c=d,A+=|0 $ 0 
0 0 3 
9bxc;a-bx c= 0; determinant is zero 11 6, 2, 0; product of diagonal entries ° 
1-2 4 o 2 -$ 
18 A-+= | 0 z -1},B-1=|0 -3 1 15 Zero; same plane; D is zero 
0 0 3 1 0 0 
8 4 1 
17 d = (1, -1,0); u = (1, 0,0) or (7,3,1) 19 AB= | 40 26 O |, det AB = 12 = (det A) times (det B) 
18 12 0O 
2 3-3 
21A+C=] -1 4 = 6 |, det(A+C) is not det A + det C 
0 -1 6 


28 p= (2)(3)- (0(6) =1,q= L484 (0)(0) = 2 25 (A~+)—? is always A 
27 —1,-1,1,1,;(y, 2, z), (z, y, 2), (y, 2, 2), (z, z,y) 29 2!= 2,4! = 24 
312z2= y= —$3,2=3;2= Ly= Te -} 

33 New second equation 3z = 0 doesn’t contain y; exchange with third equation; there is a solution 

85 Pivots 1,2,4, D = 8; pivots 1,—1,2, D = —2 87 a12 = 1,421 = 0, > ay;b,, = row i, column k in AB 
39 411422 — @12@21 #0; D=0 


CHAPTER 12 MOTION ALONG A CURVE 
Section 12.1 The Position Vector (page 452) 


1 v(1) =i+ 3j; speed /10; 3 wy - quid = 0st ; tangent to circle is perpendicular to = cost 


5Bv=ei-etj=i-jyy-1=—-(z—-1);zy=1 


Answers to Odd-Numbered Problems A-21 


= (1, 2,4) + (4,3, 0)¢;R = (1, 2,4) + (8, 6, 0)t; R = (5, 5,4) + (8,6, O)é 

9R= (2+t,3,4-%);R = (2+ 6, 3 4—#); the same line 

11 Line; y = 2+ 2t,2 = 2+ 3t;y = 2+ 4t,2 = 2+ 6t 

18 Line; /36+9+ 4 = 7;(6,3,2); line segment 15 251; 2 17 cs=ty=mt+b 

19 v =i- 4j,|v| = V1+¢-4, T = v/|v|;v = (cost — teint)i + (sint + tcost)j; |v| = V1 + t?; 
T =v/|v|;v =i+ 2j + 2k, |v| = 3, T = iv 

21 R= —sint i+ cost j+ any Ro; same R plus any wt 

23 v = (1—sint)i+ (1 —cos¢)j;|v| = V3 — 2sint — 2cost, |v|min = V3 — 2V2, |Vimax = V3 + 2V2; 
SR ee ain Cee 

25 Leaves at (2, ¥2);v v = (-v2,V2);R (2, 2) + v(t - £) 

27 R = cos “ai + sin oj + ek 

29 v = sec? ti + sect tant j; |v| = sec? tV/1 + sin” t;a = 2sec? ttant i+ (sec? t + sect tan? t) j; 
curve is y? — 2? = 1; hyperbola has asymptote y = z 

$1 If T = v then |v| = 1; line R = ti or helix in Problem 27 


_ (2t, 0) O<t<} (3-2¢,1) 1<t¢< 2% 
$8 (2(),4)) = Cy oe—1) b<t<i (04-2) 2<t<2 
35 a(t) = 4cos£, y(t) = 4sin£ 37 F; F; T; T; F 39 4 =tané but 4 # tant 


41 v and w; v and w and 4; v and w, v and w and u; not zero 
43 u = (8,3, 2); projection perpendicular to v = (1,2, 2) is (6, -1,—-2) which has length /41 
45 «= G(t),y = F(t);y = 2?/°;t = 1 and t = —1 give the same z so they would give the same y;y = G(F71(z)) 


Section 12.2 Plane Motion: Projectiles and Cycloids (page 457) 


1 (a) T = 16/gsec, R = 128V8/g ft,¥ = 32/gft (b) 19v2; 128VS 96 () 92,9,128 §=— gg = 1.2 or 83.5 


5y=2-42? =Oatc=2)y=ztanz—- %(-4)? = bean Cerne: 
9 up & 11.2, tana s 4.32 11 u = J/gk = V980 mae larger 13 v2 /2g = 40 meters 
15 Multiply R and H by 4; dR = 2u2 cos 2ada/g,dH = vg sinacos a da/g 


ae 144 = ances elite d) Ottsin 6 
Wt= 12/2 sec; y= 12 -— ae w —2.1 m; + 2.1m 19 T= Te LE 


21 Top of circle 25 ca(1—cos@),casin@;@ = m, = 27 After 0 =: 2 = na+ vot and y = 2a— gt? 29 2;3 
$1 Stna’ | 57243 83 =cos#+6sin§,y=sinf—Ocosd 35 (a= 4) 6x 


37 y = 2sin@ — sin 20 = 2sin 6(1 — cos 6); 2? + y? = 4(1 — cos 6)?;r = 2(1 — cos 6) 


Section 12.3 Curvature and Normal Vector (page 463) 


1 asap $3 50 (line) 7 asa 9 (—sin ¢?, cos ¢”); (— One — sin ¢?) 


11 (cost, sint);(—sint,—cost) 18 (—2sint, 2 cost, 4);|v| =5,« = 8 longer; tan = 4 


15 a ialr=eond 17K=3,N=i 19 (0,0); (—3,0) with 4 = 4; (—1,2) with 2 1 ay 
21 Radius +, center (1,4,/4,-1)forn<1 283U-V' 25 jg (sin t i — cost j +k) 273 
29 N in the plane, B = k, r =0 31 fees sees seine $5a=7teT+ 241 N 


STa= AGT + ylegN 89 [F? + 2(F")? — FFM /(F? + F2)9/2 
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Section 12.4 Polar Coordinates and Planetary Motion (page 468) 


1j,-i;i+j=u,—ue  $(2,-1);(1,2) 5v = 3e?(u, + ug) = 3e9(cos3 — sin 3)i + 3e°(sin 3 + cos 3)j 
ee = —20sin 5t i+ 20cos 5¢t j = 20 T = 20 ug; a = —100 cos 5¢ i — 100sin 5¢ j = 100 N = —100 u, 


9 roe + 24 a= a a Paes #) 11 ae = .0004 radians/sec; h = r? = = 40,000 
13 mR x a;torque 15 77/9(GM/4n?)/5 17. 40703 /T?G 9 _4nt(1s0)1077 be 


9 (3651)7(24)(3600)?(6. 67)10—11 
23 Use Problem 15 2ate= aip.a-c= Bqp solve for aa 
27 Kepler measures area from focus (sun) 29 Line; r= 1 
$3 r= 20 — 24,0 = 2% vy = —2u, + (20 — 2t)2%ug; a = (2¢ — 20)(24)?u, — 4(22)ug; fo ° [v|de 


CHAPTER 13. PARTIAL DERIVATIVES 


Section 13.1 Surfaces and Level Curves (page 475) 


$3 z derivatives co, —1,—2,—4e~* (flattest) 5 Straight lines 7 Logarithm curves 

9 Parabolas 11 No: f = (z+ y)” or (az+ by)” or any function of az + by 13 f(z,y) =1-2?-y? 
15 Saddle 17 Ellipses 42? + y? = c? 19 Ellipses 52? + y? = c? + 4cxz + 2? 
21 Straight lines not reaching (1,2) 23 Center (1,1); f= 2? +y?-1 25 Four, three, planes, spheres 
27 Less than 1, equal to 1, greater than 1 29 Parallel lines, hyperbolas, parabolas 
81 4 : 482 — 3x” = 0, x = 16 hours 33 Plane; planes; 4 left and 3 right (3 pairs) 


Section 13.2 Partial Derivatives (page 479) 


2. 2 3.2 93 —2y ._ 2 ~2 
134 2xy?; -1+ 2yz 8 3z%y? — 22; 22°y — e¥ 5 Guy) Gu? 7 rey we 
9ataiaty Nays ae 18 2,3,4 15 6(z + ty), 61(z + cy), -6(z + zy) 


z7+y*7? 27+y 
17 (f =2)fee = aero s fey = ca Stu Fy = ay" 2" 19 —a* cos axcos by, absin arsin by, —b? cos ax cos by 
21 Omit line z = y; ‘all positive numbers; f, = —2(z — y)~°, fy = 2(z— y) “> 
28 Omit z = ¢; all numbers; =, -4,, eH, (ea? 
252>0,t>0and z=0,t> 1 and oa. | —2,:+-,t=e,e7,---3 fe = (Int)z!™*-!, f, = (Inz)t™ 27 
27y, 2, f=G(z)+H(y) 29 9L = Hy (zy) = yo(zy) 


81 frrz = 6y*, fuyy = 62°, fesy = = poe = ee = 1829"; fyye = = fyzy = = Jayy = = 1827 y 

33 g(y) = Aecs/7 35 g(y) = AecY/? + Be~cv/? 

37 fe = —2f, for = fyy = —e~ 7 sinzsin y;e *** sin 2z sin 3y 

39 sin(z + t) moves left 41 sin(x — ct), cos(z + ct),e*~* 

43 (B — A)h,(C*) = (B — A)[f,(b, C*) — fy (a, C*)] = (B — A) (6 — 2) fyz(c*, C*); continuous fzy and fyz 
45 y converges to 6; inside and stay inside; d, = ./(zn — a)? + (yn — 6)? — zero; dy <eforn > N 


47 €, less than 6 49 f(a,b); 4; or Gauge 51 (0,0) = 1; f(0, 0) = 1; not defined for z <0 


Answers to Odd-Numbered Problems A-23 
Section 13.3 Tangent Planes and Linear Approximations (page 488) 


lz-1l=y-1;N=j-k 3z—2= (2-6) — 2(y—3);N= - 2j-k 
5 2(2— 1) +4(y — 2) +2(2 — 1) =0;N = 21+ 4 +2 ee N=i-k 
9 ae _ 2zo(z — 20) — 2x%0(x — zo) — 2yoly — yo) = 0; (0,0, 0) satisfies this equation because 


22 — 22 — y2 = 0 on the surface; cosé = Nik = Vanite A (surface is the 45° cone) 
11 dz = 3dz — 2dy for both; dz = 0 for both; Az = 0 for 32 — 2y, Az = .00029 for z*/y?; tangent plane 
18 z= 29 + F,t; plane 6(z — 4) + 12(y — 2) + 8(z — 3) = 0; normal line z = 4 + 6t, y = 2+ 12t,z = 3+ 8t 
15 Tangent plane 4(z — 2) + 2(y — 1) + 4(z — 2) = 0; normal line z = 2 + 4t, y = 1+ 2t,2z = 2+ 4¢; (0,0, 0) 
att=—3 
17 dw = yodz + tody; product rule; Aw — dw = (z — 20) (y — yo) 
19 dI = 4000dR + .08dP; aP = $100; I = (.78)(4100) = $319.80 


21 Increase = 2¢ — 28 = 33,, decrease = 2 - 2% = wiidA =F Adz — dy; 3 23 Ad = ae 

25 Q increases; Q, = — 250 nH=5 =5 P, =-—.2Q, = 50, Py = —.2Q: = 2;Q = 50 — 289(s — 4) - 8(¢— 10) 
ree ; 2 tae P, = ~Q, = Qe +Q=Qs +40. wes ip = 4 

27 s=1,t = 10 gives Q = 40: Po = 26.560, 41=0.41 39, = —20,Q, = —5,P, = 20,R = 

ae ee nes 2y—1);2=1,y=4,2z=0 

$1 Az =—3,Ay= 3321 = toy =—3; meer) », 


88 3a7Az— Ay=-a— a gives Ay = —Az = ats; lemon starts at (1/./3, —1/V3) 
aoe + 3a7Ay =a+a° 

35 If 2° = y then y® = 2°. Then 2° = z only if z = 0 or 1 or —1 (or complex number) 

87 Az = —29 + 1, Ay = —yo + 2, (21, y1) = (1,2) = solution 


2 = 0 = ares 
s9G=H=77 41J= : ev Ag = —1+e7*, Ay = -1— (tn, —1+e7**)e"™ 
43 (21, x1) = (0, 9), (- aia) Gi 0) 

Section 13.4 Directional Derivatives and Gradients (page 495) 


1 grad f = 2ai — 2yj, Duf = V3z— y, Duf (P) = V3 
8 grad f = e* cosy i—e*siny j, Duf = —e*siny, Duf(P) = — 
= Ja? + (y— 3), grad f = Fi+ 485,Duf =$,Duf(P)= tg — Tgrad f= Ptr i+ i 
9 grad f = 6zi+ 4yj = 61+ 8j = steepest direction at P; level direction —8i + 6j is perpendicular; 10, 0 
11 T; F (grad f is a vector); F;T 138u= (Ta Ta): Duf = Va? + b? 
15 grad f = (e*~¥,—e7-¥) = (e~*, -e~*) at Pju= (Jes =a), Duf = V2e7! 
17 grad f = O at maximum; level curve is one poms 19 N = (-1,1, —1), U = (-1, 1,2), L = (1, 1,0) 
21 Direction —U = (—2,0, —4) 23 -U = (=== jean aot —— is 
25 f = (z+ 2y) and (= + 2y)?;i + 2j; straight lines s + 2y = constant (perpendicular to i+ 2) 
27 grad f = +(Jz, 2); grad g = +(2V5, v5), f = (Fp — 7h) + C,g = ol +C 
29 @ = constant along ray in direction u = sit), grad 9 = ayltal et, ;u- grad 6 = 
31 U = (fz, fy, f2 + £2) = (-1,—2,5); -U = (—1, —2, 5); tangent at aie ae (2,1,6) 
33 grad f toward 2i+j at P,j at Q,—2i+j at R; (2,3) and (24, 2); largest upper left, smallest lower right; 
Zmax > 932 goes from 2 to 8 and back to 6 


A-24 Answers to Odd-Numbered Problems 


35 f= 5V (= 1)? + (y— 1); (3, 3, 0,0 = (75 aa) 

37 ae C now shows level curves; |grad f| is varying; f could be zy 

39 x? + zy;e”—¥; no function has of — = y and - = ~gz because then fry # fys 

41 v = (1,2t); T=v/V1+4 402; £ =v. (2¢, at?) = = 2t+ 49; © = (2¢ + 405) /V1 4 40? 

48 v = (2,3);T = vei =¥ (220 + 4t, —2yo — 6t) Reese oe ue 

45 v = (e*, 2e7?, — e~');T = yy; grad f = Aaa e = ie), £ =14+2- 1f= 

47 v = (—2sin 2t, 2 cos 2t), T = (— sin 2t, cos 2t); grad f = ‘a 2), = -2 sin? 2t + 2cos? 2t, x = id. 
zero slope because f = 1 on this path 

49 z2—-1=2(2-4)+3(y—5);f=14+2(2-4)+3(y—5) 51 grad f-T=0;T 


Section 13.5 The Chain Rule (page 503) 


1 fy = cf, = ccos(z + cy) Siete ate @ 5 3g? 3a de + 3g? 92 a a 7 Moves left at speed 2 
9 42 = 1 (wave moves at speed 1) 


s ‘s > 3 . « < 
11 gal (a +ty) = f"(2+ iy), Sef (wt ty) =? f" (x + ty) 
80 fro + fuy = 0; (z + ty)? = (2? _ y) +1(2zy) 
18 £ = 22(1)+ 2y(2t)= 20+ 409 1s Layee pe =-1 ay H = wey yy 


19 V = hart?h, & iv = ach dr 4 net a = 36x 


aD _ 105 60 45 
21% = iors (60) BE “rites (45) = V2 mph; 4 ap = Sis ccar en) + Fast raor (45) ss 74 mph 
28 F = ur 5h t ugh + us Zh 25 Sf = 1 with z and y fixed; £ =6 


27 fi = fat + Ff, (2t); fit = fast F fa + 2fyet = 2fy = fezt + fyx(2t))t + te a5 2(fayt f fyy (2t))t ae 2fy 


29 of = pe oe + GLY = 5 cos 6 + $f sind, @ is fixed 


a 1 =n 
Siae= aae a eee ee 


—20r _1_ 
ox r 


3 (2), = Jae: (cos z)(22), = 1; first answer is also Fa = 
35 f, = fz cos 6+ fy sin 8, fro = —f, sin 6+ fy cos 6+ fr2(—rsin 0 cos 6) + fry(—r sin? 6+r cos? 6) + fyy (r-cos 8 sin 6) 


37 Yes (with y constant): 92 = ye*¥, 32 = ae = a 39 fe = fate t+ fyys fee = fart? + 2foyteye + fyyy? 
= 5 on  O — Of dy _1 
41 ($4), = sf + $f ou = a— ~ $4; (2), = 0; (28), = Le = 2s 


43 1 45 f=y? so f, =0, fy = 2y = 2rsind; f =r? so f, = Ir = 2/2? + y?, fo =0 
47 Qu = fatu + fy¥u =fet+ fyi Qu = farZy + fyYu = f,— fyi Guu = frau + feyYu + fyz2u + fuyYu 
> fez + 2fry ah tuys Juv = farty + fay Yu > fux®u = fuy¥o = tea _ 2 fay ce uy: Add Suu + Gov 49 False 


Section 13.6 Maxima, Minima, and Saddle Points (page 512) 
1 (0,0) is a minimum 3 (3,0) is a saddle point 5 No stationary points 7 (0,0) is a maximum 
9 (0,0,2) is a minimum 11 All points on the line z = y are minima 13 (0,0) is a saddle point 


15 (0,0) is a saddle point; (2,0) is a minimum; (0, —2) is a maximum; (2, —2) is a saddle point 

17 Maximum of area (12 — 3y)y is 12 

19 2(z-+ y) + 2(z + 2y—5) + 2(24+ 3y—4) = ais z= 2; 
2(z+y) + 4(2+2y—5)+6(x+ 3y— 1 BN p= = 1 

21 Minimum at (0, $); (0, 1); (0, 1) 


min because Ezz Eyy = (6)(28) > E2, = 12? 


Answers to Odd-Numbered Problems A-25 


23 £ = 0 when tant = V3; fmax = 2 at (3, V3), frrin = —2 at (-3,-*8) 
25 (az + by)max = Va? + 6; (z? + y?)min = ara 270<c< i 
29 The vectors head-to-tail form a 60-60-60 triangle. The outer angle is 120° 31 2+ /3;14+ V3;1+ vs 


85 Steiner point where the arcs meet 39 Best ee for p = oo is equidistant from corners 
41 grad f = (v2 2ap + att aan va uous + Woua 4 uo¥s); angles are 90-135-135 


43 Third derivatives all 6; f = $2° ef Sx yt eae + gy? 

45 (2)"(%)™ In(1— zy)]o,o = n!(n — 1)! for m =n > 0, other derivatives zero; f = —zy — sty” ee _ 
47 All derivatives are e? at (1,1); f s e?[1 + (x — 1) + (y— 1) + $(z— 1)? + (2 - 1)(y—1) + ity - 1)?] 

49 z=1,y=-1: f, =2, fy = —2, fer = 2; fay = 0, fyy = 2; series must recover x? + y? 

51 Line z — 2y = constant; z+y = constant 

53 © fas + tyfey + X fyvlo, 03 fez > Oand fez fyy > ty at (0,0); i = fy=0 55 Az =—1,Ay=-1 
57 f = 27(12 — 4) iiss fmax = 16 at (2,4); line has slope —4,y = 4 has slope =32 = —4 

59 If the fence were not perpendicular, a point to the left or right eeiii be HGier 


Section 13.7 Constraints and Lagrange Multipliers (page 519) 
1 ree ape 2x)?; £ = 22 — 4(k — 22) = 0; (2% , B), 3 X= —4, min = 2, Ymin = 2 
= spas : (2, y) = (+2/8, 0) or (0,+2"/9), frnin = 21/3, d= 4: (z,y) = (£1,+1), fax = 2 


7 = = 4,(z,y) = (2,—3); tangent line is 2z — 3y = 13 

9 Weds eae ete eo) —c)? + (2—6—c)? is minimized at a = —},b = 8,c = 2 
11 (1,~1) and (—1,1);A = —3 

13 f is nota eae when C' crosses to lower level curve; stationary point when C is tangent to level curve 
15 Substinnting: 3 = oe — ah = 0 and L = fmin leaves Goin =A 
17 z? is never eee. ‘(0,0); 1 = A(—3y?) but y = 0;g = 0 has a cusp at (0,0) 
19 2z = d1 + Az, 4y = Ag, 2z = Ay — Ag, o+yt+z2=0,2—z=1 gives 4; = 0,A2 = 1, fmin = 3 at (4,0,—4) 
21 (1,0,0);(0,1,0);(A1,2,0);2=y=0 28 Sf and $f;.=0 
25 (1,0,0), (0, 1,0), (0,0, 1); at these points f = 4 and —2 (min) and 5(max) 
27 By increasing k, more points are available so fmax goes up. Then A = Hain >0 
29 (0,0); A = 0; fmin stays at 0 
$15 = d1 + A2,6 = A1 +.A3, Az > 0, Az < 0; subtraction 5 — 6 = Az — Az or —1 > O (impossible); 

z = 2004, y = —2000 gives 5z + 6y = —1980 

33 22 = 4)1 + Ag, 2y = 4A1 + Az, Az > 0,A3 > 0,42 + 4y = 40; max area 100 at (10,0)(0,10); min 25 at (5,5) 


CHAPTER 14 MULTIPLE INTEGRALS 
Section 14.1 Double Integrals (page 526) 


18,2 Simg 52 TL Of 119%, [2  dedyt fro finde dy 

13 fro Son tiny dz dy = 15 fie Y yetyds = ATS fal? dz dy = fo’? fo, dy dz = 3 

19 ff, dz dy= f°, Peedydzt+ fj fedydz=9 21 fF ya de dy + fe Syj2 42 dy = fo 2" dy dz =8 
28 fp fo" dy da+ J? fo dy dz = fo fr de dy=b 25 f (a,b) — f(a,0) — f(0,6) + F(0, 0) 


A-26 Answers to Odd-Numbered Problems 


27 fo fo(2z—3yt+ldzdy=2 29 f? f(z)de=fP fi 1dydz $1 50,000m 
33 f° f2a2dzdy=%4 = p2fi/¥? fv? 1azdy=% 
--1 fil 
37 3, D7, De = f(— 2) d 2) is exact for f=1,2,y,zy 39 Volume 8.5 41 Volumes In 2, 21n(1 + 2) 
43 f pe w4dz dy = i Siren fe S a¥dy dz = So 3 a t=ldz =1n2 
45 With long rectangles > y;AA = )) AA= 1 but ffydA=3 


Section 14.2 Change to Better Coordinates (page 534) 


1 feri* for dr do = % 3 S = quarter-circle with u > 0 and v > 0; J) fY*” ” du du 


Y1—v? 
5 R is symmetric across the y axis; fo fY” ” 


. udu du = 3 divided by area gives (i, 0) = (4/3, 4/3m) 


7 2 fo vhs Joe Ph dy dz; zy region R* becomes R in the z*y* plane; dz dy = dz*dy* when region moves 
_ | @2/Or* Ax/0* |_| cos@* —r*sinO* |, p3m/4 pl a ye ape 
od dy/Or* dy/de* sin6* r* cos 6* Sef Sor dr* dé 


11 ly = ff_ ada dy = fon' fo r? cos? 6 rdrdé= & -—3;1,= B+ iihb= A 

18 (0,0), (1,2), (1,3), (0,1); area of parallelogram is t 

152=u,y = ut 3v+ uv; then (u,v) = (1,0), (1,1), (0,1) give corners (z, y) = (1,0), (1,5), (0, 3) 
17 Corners (0,0), (2,1), (3,3), (1,2); sides y = $2,y =22-3,y= 32+ 3,y = 22 

19 Corners (1,1), (e?, e), (e?, e°), (e, e7); sides = y?,y = 27/e9, x = y?/e®, y = 2? 


21 Corners (0,0), (1,0), (1,2), (0,1); sides y=0,2=1,y=1+2?,2=0 
De TD 1 pl Derry geuey 3ut3v fl rl o,3u43u 
23 J = 12\7 3, area [, fj, 3du dv =3;J = put2y  ggut2u | = 3e ito So 3e dudv= 
Jo (e8 +9" — 2”) du = 2(e6 — 2e9 + 1) 
> : ; La ee 1 0 = 
25 Comers (2,9) = (0,0), (1,0), (1, f(0)) (0, £(0))5 (4.1) sives == 3,= 1B I=| bay Joy [= se 


27 B? = 2 f7l4 passin’ or? de dp = 7/4 (ent/sin?o _ 1) a9 

29F=ffrdr d6/ ffrdr dé = fo 8a cos? 8 d6/ma? = 320 $1 it, r?r dr dé =F 
33 Along the right side; along the bottom; at the bottom right corner 

35 ff zy dzdy= fo fo (ucos a —vsina)(usina + vcosa)du dv = 4(cos? a ~ sin? a) 


37 ig Pye dr do = 2&(56 — 48) 39 xz = cosa — sina, y = sina + cosa goes tou=1,v=1 
Section 14.3 Triple Integrals (page 540) 


1 
Ifo fo So dz dy dz = § 
30<y<2<2<1and all other orders zzy, yzz, zy, zyz; all six contain (0,0,0); to contain (1,0,1) 


BSS, dz dy dz =8 Ths Sidedydz=4 9 of’, f) fidzaydz=4 

uf, f 2—2z ral ¥-3* dn dy dz = 2 1s fi? f° 2- ik ¥~ 38 ae dy dz = i 

15 fo fo 2 Va *°—¥" de dy dz = ih So JNO dz dy dz = 6m 19 fo fo J M0 dzdydz=n 
21 Corner of cube at (Ys, Ya Ja); sides vai area 55 


23 Horizontal slices are circles of area mr? a — z); volume = i, n(4 — z)dz = 87; centroid 
has £=0,9=0,2= fo en(4 — z)dz/8x = 4 


Answers to Odd-Numbered Problems A-27 


2I=% ~ gives zeros; al x ft fe Bey dz, 3t = fo fo f dz dz, 23- <- 4. = fo fdz 
a7 PD Pyle? +28de dy dz = 18; [ff atav = 453 ff fle Ee Prav = 38 


29 k fe fi dudydz=6 $1 Trapezoidal rule is second-order; correct for 1,2, y, z, zy, 2z, yz, Zyz 


Section 14.4 Cylindrical and Spherical Coordinates (page 547) 


1 (r,9, 2) = (D, 0,0); (0, 6, 6) = (D, F,0) 8 (r,9, 2) = (0, any angle, D);(,¢,0) = (D,0, any angle) 

5 (z,y,z) = (2, —2, 272); (r, 6, z) = (2V2,-,2/2) 7 (x,y,z) = (0,0, —1); (r,9, 2) = (0, any angle, —1) 
9¢=tan-*(£) 11 45° cone in unit sphere: 2&(1 — 4) 18 cone without top: 72 
15 1 hemisphere: 7 17 x 19 Hemisphere of radius x : 2x4 21 x(R? — 27); 4ar/R? — 1? 
28 2a° tana (see 8.1.39) 27 22 = eS = _ueat side” = cos a 
$31 Wedges are not exactly similar; the error is higher order => proof is correct 


33 Proportional to 1+ }(./a? + (D — h)? — Va? + D?) 
a 
$5 J= b = abc; straight edges at right angles 37 
c 


39 La 32 41 p®; p*; force = O inside hollow sphere 


cos6 —rsing 0 
sin? rcos@ O 
0 0 1 


=r 


CHAPTER 15 VECTOR CALCULUS 
Section 15.1 Vector Fields (page 554) 


1 f(z,y) = 2+ 2y 3 f(z,y) = sin(z + y) 5 f(z,y) = In(z? + y*?) = 2Inr 


7F = xyi+ #5, f(z,y) = =" 9 af = 0 so f cannot depend on z; streamlines are vertical (y = constant) 
11 F = 3i+j 18 F =i+2yj 15 F = 2zi-2yj 17 F = e*-¥ i - e7 -¥jj 

2 
19 # = -1; y=—-2z+C 21 = —2;2? +y=C 23 4 == =z Set+y=c 25 parallel 


a7F = 52i+%j 29F = =e (at' 93) eBMSeyen (2-1) i+ ys) 
‘ = =2 . dy _ = 
s1F = Qyi- rj 38 is a = —aias = yas =? 
35 3f — Sf ar — Ma, of = SL u: f(r) = C gives circles 
37 1 F (no equipotentials); T; F (not multiple of zi+ yj + zk) 
39 F and F +i and 2F have the same streamlines (different velocities) and equipotentials (different potentials). 


But if f is given, F must be grad f. 


Section 15.2 Line Integrals (page 562) 


fo Vi? + dt = V5; fj 2dte=2 8 fi t?V/2dt + f21-(2-t)dt= 42 

5 Se" (-3 sin t)dt = 0 (gradient field); i —9sin? t dt = —9x = — area 

7 No, zy j is not a gradient field; take line z = ¢, y = ¢ from (0,0) to (1,1) and f é?dt A 4 
9 No, for a circle (2xr)? 4 0? + 0? lif =2+ 4y; f(0,1) — f(1,0) = —2 

18 f = 327y?; f(0,1)-f(1,0)=0 Ib f=r= vert (0,1) — f(1,0) =0 

17 Gradient for n = 2; after calculation ae ON — 2-2 

19 z = acost, z = asint,ds = a dt, M = f""(a + asint)a dt = 2xa? 


A-28 Answers to Odd-Numbered Problems 


21 s=a cost,y = asint,ds =adt,M = fie a? cos” t dt = na*, (Z, g) = (0,0) by symmetry 
a3 -T = 2126 _ +8) = 32i44j =6ti+4j,ds = 2V1 + tdt, F-Tds = (6ti+4j)-(7*h avis tat = 


Vatae? 1487? Vite? 
20t dt; F - dR = (6ti + 4j) - (2 dti + 2¢ dt j) = 20¢ dt; work = f” 20¢ dt = 30 
25 If eu) = Nts) then M = ay+b,N =az-+c, constants a,b,c 
27 F = 42j (work = 4 from (1,0) up to (1,1) 29 f=(2—2y|{g)=—-1 81 f = [ay*I{O') = 1 
83 Not conservative; f, (ti— tj) (dt it dtj) = f 0 dt =0; J, (t7i — tj) - (dt i+ 2¢ de j) = fo —t?dt = —1 
35 oe = az, 2Y = 22+ b, so a = 2,6 is arbitrary 37 ne = 2ye* = Ns faye? 


39 OM — =3y — ON: fp = \/n? + y? = [xi + yj| 
41 F = (z—y)it+ (zt+y)j has Sf =-1,9% =1,n0 f 48 25050 


Section 15.3 Green’s Theorem (page 571) 


1 Jo" (acost)acost dt = na?; N, — M, = 1, f f dz dy = area ma? 
8 fo cdz+f)2dz=0,N, ~My =0,ffOdzdy=0 
4 


5 fx?ydz= Sa cos t)?(asint)(—asint dt) = = 2" (sin 2t)*dt = ce 


N, — M, = —27, f f (—z?)dz dy = 5” Jo —1? cos? 6 r dr do = —* 
7 fxdy—ydz= f* (cos?t+ sin” t) dt = 1; ff (1+ 1)dz dy = 2 (area) = 1; f 2?dy — zy dx =441; 
Jo fo (22+ 2)dx dy = 3 


9: {70 cos‘ tsin” t + 3sin‘ t cos? t)dt = 4 is 3.cos” tsin” tdt = 3% (see Answer 5) 


11 fF dR =0 around any loop; F = #i+ “j and fF -dR= fo" [sin t cost +sintcost|dt = 0; 
“a oN gives ff[Odz dy 
13 z = cos 2t, y = sin 2t, t from 0 to 2m; fo" —2sin? 2¢ dt = —2x = —2 (area); 
an —2dt = —4x = —2 times Example 7 
15 f Mdy — Ndz = J," 2sint cost dt = 0; f [(Mz + Ny)dz dy= ffOdzdy=0 
17M =2,N=%, f Mdy—Ndz= Jo" (cos? t-+sin? t)dt = 2m; ff (Mz+N,)dz dy = sfG-344-4)dz dy = 
Jf Adz dy= ff drdé=2n 
19 f Mdy — Ndz = f —2*y dz = it —27(1—2)dz = ato | ie edz dy= 4 
21 f f{(Mz + Ny)dz dy = ff div F dz dy = 0 between the circles 
23 Work: fadz+bdy= ff[(2 - 50 )dx dy; Flux: same integral 
25 g =tan~'(“) = @ is undefined at (0,0) 27 Test My = N, : 27da + y*dy is exact = d(42° + Ly) 
29 div F = 2y — 2y = 0;g = zy? 31 div F = 2z + 2y; nog 33 div F = 0;g = e* siny 
35 div F =0;g = ve 
37 N, — M, = —2z, —6zy,0,2z — 2y, 0, —2e*¥; in 81 and 88 f = $(2° + y°) and f = e* cosy 
39 F = (32? — 3y)i — 6zy j;div F = 0 41 f= 21 —627y? + yt; 9 = 4a7y — 42y? 
43 F = e* cosy i— e*siny j;g = e* siny 
45 N = f(z), f Mdz+ Ndy= f> f(t)dy+ SP f(O)dy = F(1) — £0); [J (Ne — My)dz dy = 


Sf %4dz dy = ie 8f dz (Fundamental Theorem of Calculus) 


Answers to Odd-Numbered Problems A-29 
Section 15.4 Surface Integrals (page 581) 
1 N = —2ci — 2yj +k; dS = V/14+ 42? + 4y? dz dy; f, fe V1+ 4r? r dr dé = = (179/2 — 1) 
8 N=-i+j+k;dS = V3 dz dy; area /3x 
_  =si- a ae drd0 _ as 

BN = stale +k; dS = Vi-=—y? Vi-=—y? = fo" Vi-r? n(2 V2) 
IN =-7+k;dS = 5/2 dz dy; area Leia 
one (7 ~— 27)i — Qayj +k; dS = 1+ (y? — 27)? + 427y2da dy = \/1+ (y? + 2?)2dz dy; 

On 3 VI+ ro de do = 5 + Tota) 
11 N= = 214+ 23 +k; ds = See dy; 3(area of ae with 22 + 2y < 1) =? 
18 navVa? + h? 15 Ci ¥ cy(V3 dz dy) = 
17 Se" [ihe sin 2 Wicoaibein 0608 (sin d¢ dé) = 19 A = i+j+2k;B =j+k;N = —i-jik;dS = V3 du dv 
21 A= —sinu(cosv i+ sinvj) +cosuk; B= ap eee oar (3 + cos u) cos v j; 

N = —(3 + cos u)(cos ucosv i+ cos usinvj+sinuk); dS = (3+ cosu)du dv 
23 S{(-MSE - NGL 4+ P)dz dy = f f(—22? — 2y? + 2)dz dy = f f - 2 (r dr d#) = — 
25 F-N=—z+y-+z=0 on plane 
a7 N=-i-j+k,F=(v+u)i-uj,{f¥F-NdS = ff—vdudv=0 
29 ff ds = se (3 + cos u)du dv = 12x? 31 Yes 83 No 
35 A=i+ f'coséj+ f'sind k;B = —fsin§ j+ fcosdk;N = f f'i— fcosé j— fsin@ k;dS = |N|dzd6 = 
f(z) 14 f? dz dO 

Section 15.5 The Divergence Theorem (page 588) 
sagt ied | ace Stee) ete If ave a0 Rane ee a 


7F-N=/? SSine? 2d = 4ra4 9 div F = 2z, - ts ak Ps 2pcos $(p? sin ¢ dp d¢ dé) = dmat 

11 fo So So (2 + Uda dy dz = a* + a°;—2a? + 2a? + 0+ a4 +0448 

18 div F = §, ee 15 divF=1;fffidV=2;fffiav =} 

17 div (B) = ae R av +R grad4+ = 3 — R-gradp 

19 Two See n radial out, n radial i in, n = k on top, n = —k on bottom, n = rere on side; 
n = —i, —j, —k,i+ 2j + 3k on 4 faces; n = = on top, 2 = eae k) on cone 

21 V = cylinder, ff {div F dv = ff(9% +4 on )dz dy (2 integral = 1); ff F -ndS = 
jf Mdy — Ndz, z integral = 1 on side, F - Ge 0 top and bottom; Green’s flux theorem. 

23 div F = =°¢@ — —4nG; at the center; F = 2R inside, F = 2(%)°R outside 

25 divu, = 2,g= 78, ffE-ndS=ffidS=4n 27 F (divF =0);F;T(F-n< 1); F 

29 Plane circle; top half of sphere; div F = 0 


Section 15.6 Stokes’ Theorem and the Curl of F (page 595) 


lcurl F=i+j+k 8 curl F=0 5 curl F=0 7 f=k(z+y+2)? 
9 curl z™i = 0; 2”j has zero curl ifn =0 11 curl F = 2yi; n =j on circle so ff F -ndS =0 
18 curl F = 21+ 2j,n=i,ff curl F-ndS = ff2dsS = 20 


A-30 Answers to Odd-Numbered Problems 


15 Both integrals equal f F -dR; Divergence Theorem, V = region between S and T, always div curl F = 0 
17 Always div curl F = 0 19 f=az+y 21 f =e** 23 F = yk aoe 
25 curl F = (agb2 — agbs)i + (4163 — a3b1)j + (a2b1 — a1b2)k 27 curl F = Qwk; curl F - ide = Qw/V3 
29 F = x(agz + agy)it y(aiz + a3z}j + 2(a,2 + agy)k 
$1 curl F = ~2k, ff -2k Ras = f2" f*/? -2c0s $(sin ¢ dé dd) = —2n; f y dz — 2 dy = 

al sin? t — cos? t)dt = —2r 
33 sil F = 2a,2/f f(aiz + ay + a3z)dS =0+0+ 2a3 f.” 


85 curl F = —i,n = BAK, pp ‘ndS = —ear? 

37 9= ve - = = stream function; zero divergence 

39 div F- aiv (V + W) = div V 0 y = div V so V = wy (has zero curl). Then W = F - V = zyi — wi 
41 curl (curl F) = curl (—2yk) = —2i; grad (div F) = grad 2z = 2i; F,, + Fyy + F.2 = 4i 

43 curl E = eon =asint so E = }(a x R)sint 


45n= jso f Mdz+ Pdz= f f[(9@ — 22 )dz dz 47 My = My + M.fy + Pyfet+ Pefyfe + Pfhay 
49 fF -dR=ff curl F -nds;ff¥ -ndS= fff divF dv 


*/? cos sing dé dé = 2nas 


CHAPTER 16 MATHEMATICS AFTER CALCULUS 
Section 16.1 Linear Algebra (page 602) 


1 All vectors c is $8 Only z =0 5 Plane of vectors with xz, + z2 +23 =0 


3 0 
1 Ip = 0 | -Alee-+ 20) = FE J+ | 0 9 A(z, + 2) = b+ 0 = 5; another solution 


HEHE 


101 
isco" =| 0 1 2 |icro~ [2 5 |i bys) (2 by 3) is impossible 
1 2 5 


11 


15 Any two are independent 17 C and F have independent columns 


2 -1 
= -121 
19det F=3 21F =3[ 3 a 


2-A 1 


2s det(F — A1) = det | 1 2-d 


]=@-ap-1=9- ar? ota torans, 


Answers to Odd-Numbered Problems A-31 


PSX 8 phe ee eee ee ee ot eee ee 1]_,f1 
20 det | : al -sr=0if1=oora=54[ 7 ]=0] T],4[ 3 ]=5| 3 | 


sin=|_} a 33 F if 6 #0; T; T; F (e** is not a vector); T 


Section 16.2 Differential Equations (page 610) 


1 3Be** — Be** = 8e* gives B= 4: y = 4e* Sy=3-—2t+t? 5 Ae’ + 45 = 7 att =Oif A=3 
1” Add y = Ae* because y’ + y = 0; a A=-—1s80 —e-* +3 —2t+#? =2 at t=0 
gy=5 aiy=t; by PHépital lim * —= ld 
11 Substitute y = Ae’ + Bte* + CG. coat f Dsint in equation: B= 1,C = 1,D= =-—},any A 
18 Particular solution y = Ate’ + Bet; y/ = Ate’ + (A+ B)e* = c(Ate’ + Be‘) + tet 
gives A=cA+1,A+B=cB,A= p> B= a= 
15 Ae4* + 6AUe** + 5e** = O gives A7 + 6A +5 =0,(A+5)(A+ 1) =0,A = —1 or —5 
(both negative so decay); y= Ae~* + Be~** 
17 (A? + 2d + 3)e** = 0,A = —1 + —2 has imaginary part and negative real part; 
y = Ael~1+V2i)t 4. Bel—1-V28)t. y = Ce—t cos /2t + De~* sin /2t 
19 d = 0 no damping; d = 1 underdamping; d = 2 critical damping; d = 3 overdamping 
21A=-$4 VP —Ae j is repeated when b? = 4c and A = —£; (tA? + 2A)e** + B(tA + )e** + cte** = 0 
when 3? 4d -+e=0 and 214+6=0 
28 —acost — bsint — asint + bcost + acost + bsint = cost ifa = 0,b = 1,y = sint 
25 y = Acos3¢ + Bcos 5t;y" + 9y = —25B cos 5t + 9B cos 5t = cos 5t gives B = 53; 
Yo = 0 gives A= 4 ie 


27 y = A(coswt — sesuolg? y” = —Aw? coswt + Awé cos wot, y” + wey = coswt gives A(—w? + w) = 1; 
breaks down when w? = w? 

29y = Be*;25B+3B=1,B=4 Siy=A+ Be=4+ht 

83 yf’ — 25y = ey" + y =sint; y” = 1-+ t; right side solves homogeneous equation so particular 
solution needs extra factor t¢ 

35 e*,e7*, ee BT y = 7 + 2te—7*; y(2x) = (1+ 4n)e~** wO 

39 y = (4e7"t — r2e~4#/") /(4 — 7?) +1 asr 0 48h<2;h<2.8 


Section 16.3 Discrete Mathematics (page 615) 


1 Two then two then last one; go around hexagon $ Six (each deletes one edge) 

5 Connected: there is a path between any two nodes; connecting each new node requires an edge 
13 Edge lengths 1,2,4 
15 No; 1,3,4 on left connect only to 2,3 on right; 1,3 on right connect only to 2 on left 174 
19 Yes 21 F (may loop); T 25 16 


Table of Integrals 


antl In |u| : : 
1f u"dz = antl) nti) except for f dz = — All the integrals 1 - 17 involve u=azr+ 6b 
n n+ 
af cu" de = ster — Stray except for feje= 3-4! and f 2g? = $7 + OH 


3f #22 = as a(S - 2bu +b? In |u|) 4f = 3" ade es 5f %# ade _ A, (in |u| + 28 — b2 ) 


2uF 
eof S=Fflnlz| Ff R=-k+ Smile of S=L-AlY] of 4 = —- 24+ Vly 
1of ude = gu? af aude = Mus? af x? fu de = Mises asters) 8/2 

af #42 = ax) 15f 2e side 2(30%s?—dabet 804) 7 

= pnERIO a ies i7f ees 
1i8f Tastheray = = ieeaa In | £2+$| 19f GaGa = x (2 In lax + | - 4 In |ex + d|) 
20f V2? + a? dz = Vx? Ha? + © In[x + V2? + a 21f ofcq = In|2 + V2? + a?| 
a2f Yee" dz = Vx? + a? — aln (st¥2*ts") a3f 5% de = V2? — a? — asec"! z 


24f 72, = t/e? £0? = @ In|2+ V2? £ a?| apf $2 dx = —Y2#0? 4 In |e + Vx? $a} 
26f 2?V'x? + a? dx = 2 (22? + a?)V2? + a? — “ In|2 + V2? £ a?| 27 age rer = Vera? 


28f (x? + a)8/? dx = £(2x? + 5a”) Va? + a? + Sat In |2 + Vx? + a?| 29f (FEa?)57? Ln = a 
sof Va? — a? de=3Va?—a?+%sin 2 gif ie =sin 2 32 er fyes = aE 
33 YE de = Va? at — aln|4YE=B| gf ata? a? de = £(20? — a?) VF oP 4 sin 
35[ tag = YEE sof Mee de = -V8E! — sin 2 g7f 2 = 1 n |tvee? 
38f Taras = — ~£Va? — 224 2 2” sin7 12 39f (a? — 27)3/2de = Z(5a? — 22 2)/a? — 2? + Sa 3a* gin-} z 
40 Srcsinaz = aera tan”? |v 552 tan (F - #)| » >? 4if wika = Ren a 
42) relics = cette In | tegen Pemar|, bP <o? asf 7-#255 = btan(¥ + #) 
44) srteas = WS tan~! $= tan 92 » >? 45S 77225; = i tan¥ 

46f brscss az eae In |sthcoseat reas ?<¢? 47f iene = —g cot 2 

48f sin! az dz = 2 sin} az + 1V/1 — atx? 49f x" sin” az dz = ae si Pan eer) 
50f tan™* az dz= 2 tan™! ax— A In(i+a?’z”) 51f 2" tan) az dz = 2 heme eel ae 
52f edz = — 53] re%* dz = (ax —1) 54f 27e% dr = £5 (a?z? — 2ar +2) (6% is ea(ind)z) 


55 fs = In|Inagz| Not elementary: fe dz, f e*Inz dz, @, se dz, fiz dz, f de 


Exponentials and Logarithms 


y=? c=logy yre*oar=Iny 
e = lim(1+ 4)" = Oro ay = 2-71828--- 


n=0 rl 
e* = lim(1-+ £)" = rn SF 
Iny= Y Inl=0 Ine=1 


Insy=Inz+Iny Ing? =nlnz 


log, y = (log, 6)(log,y) log, 5 = 1/log,a 


eZtY = eT e¥ fF = etind = pIny — y 


Vectors and Determinants 


A = aji+ ag j+ ask 

|A/? = A- A = a? + a2 + a2 (length squared) 

A-B = a1); + agbe + a3b3 = |A||B] cos é 

|A - B| < |A||B| (Schwarz inequality:|cos 4] < 1) 

|A + B| < |A] + |B] (triangle inequality) 

|A x B| = |A||B||sin 6] (cross product) 
ijk i(a2bs — agb2) 

AxXB=]| a, a2 a3 |= +j(a3b1 — aib3) 
by be bg +k(aib2 — agb1) 

Right hand rule 1xj=k, jxk=i, kxi=j 

Parallelogram area = |a1b2 — a26,| = |Det| 

Triangle area = $|a,b2 — a2b,| = 4|Det| 

Box volume = |A - (B x C)| = |Determinant| 


SI Units Symbols 

length meter m 
mass kilogram kg 
time second s 
current ampere A 
frequency hertz Hz ~ 1/s 
force newton N ~ kg-m/s? 
pressure pascal Pa ~ N/m? 
energy, work joule J~ Nm 
power watt W ~ J/s 
charge coulomb Cw As 
temperature kelvin . K 


Speed of light c = 2.9979 x10° m/s 
Gravity 


G = 6.6720 x 107-1! Nm?/kg? 


Equations and Their Solutions 


y' = cy yoe™ 

y =cyt+s yoe™ + 2(e%* — 1) 

y! = cy — by? spisw d= Ste 
y_ =—)*y cos At and sin At 

my + dy +ky=0 e*14 and e42¢ or te*1? 
Ynt1 = 2Yn a” yo 

Yntit = n+ 8 ayy + sv} 


Matrices and Inverses 


Az = combination of columns = 5 
Solution = A71b if ATA =T 
Least squares A? AZ = ATb 


Az =Az (A is an eigenvalue) 


ab], d —b 
c d ~ ad-be aE a 


(AB)-! = B-14-1, (AB)? = BT AT 


-1 
bxe 


+ aybgcg3 + agbgcy + agbicg 


ba ba ss fo a1b3c2q = agb1c3 = agbecy 
C1 ¢2 63 
From To Multiply by 
degrees radians .01745 
calories joules 4.1868 
BTU joules 1055.1 
foot-pounds _ joules 1.3558 
feet meters .3048 
miles km 1.609 
feet /sec km/hr 1.0973 
pounds kg 45359 
ounces kg .02835 
gallons liters 3.785 
horsepower watts 745.7 


Radius at Equator 
Acceleration 


R = 6378 km = 3964 miles 
g = 9.8067 m/s? = 32.174 ft/s? 


Sums and Infinite Series Area - Volume - Length - Mass - Moment 
Ttat--- tart = 1-2” 


I-2 Circle mr? Ellipse rab. Wedge of circle 76/2 
l+nz+ aln=t) 2 +:++-+ 2" = (1+ 2)" Cylinder side 2mrh Volume mr7h Shell dV = 2arh dr 
1424---4n= gn(n +1) ne Sphere surface 4ar?_ Volume $ar® Shell dV = 4ar*dr 
aa eee ee n(ntiM2nt 1) Pe ne Cone or pyramid Volume i (base area) (height) 
1+4+4.-.+4 Inn -— oo (harmonic) Length of curve fds = f \/1+ (dy/dz)? dz 

—i41-...=]n2 (alternating) Area between curves f (v(x) — w(z))dz 
poly ioles va=% yaes Surface area of revolution f 2mr ds(r = z or r = y) 


pig = 1+ 24 224--- (geometric: |2| < 1) Volume of revolution: Slices f ry?dz Shells f 2azh dz 


cage = 14224307 +--= £(;4) Area of surface 2(z, y): ff ./1+22 + 22 dx dy 


i¢gg =1-—2+27~--- (geometric for —z) Mass M = ff pdA Moment M, = [f pz dA 
In(l+2)=2- = ey = ee == M,/M,y¥=M,/M Moment of Inertia Iy = f f pz?dA 
sing = 2— 2°/6+ 2°/120—--- (all z) Work W = fe F(z)dz =V(b) —V(a) Force F = dV/dz 
cosa = 1—27/2+24/24—--- (all z) 4 2 hs 

; , Partial Derivatives of z = f(z,y) 
es itad et (em Lis to) 
e** = cosz+isinz (Euler’s formula) Tangent plane z — zo == (2£)(x ~ 20) + (S4)(y — yo) 
a? 

coshza = $(e*+e*) = 145 4--- Approximation Az ~ (2L)Az + ($2) Ay 

: = Ler —-7) = aes 

sree ea ere). Se eee Normal N = (fz, fy,—1) or (Fe; Fy; Fe) 


(cos@+72 sin 8)" = cosnO + ve né Grdient Ops aL 4 2h 
= , "(oye 4... 
f(z) = f(0) + f'(O)z+ f (0) at (Taylor) Directional derivative: Dy f = Vf -u= fru; + fyua 


2? 
f(z,y) = Ft2tfetyfy t+ Fhe + ty fey t+ Chain rule: de = Side, ot oy 


Polar and Spherical 
Vector field F(z,y,z)= Mi+Nj+Pk 


z=rcosé and y=rsin@ Work f{[F.dR Flux f M dy — Ndz 

r= /c? + y? and tand = y/2 ; Divergence of F = V-F = 9% 4 aN 4 oP 

z+ty=r(cosd+¢ sin 6) = ret? j k 

Area [5r7d9 Length f y/rj + r?d¢ Curlof F=VxF=| a/d2 8/dy 3/2 

z= psin¢cosé,y = psingsin 0, z = pcos¢ M N P 

Area dA = dz dy=r dr d§ = J dudv Conservative F = Vf = gradient of f if curl F = O 

Volume r dr dé dz = p? sing dp d¢ dé Green’s Theorem § M dz+N dy = ff(2% — oT) dx dy 
Zu Ly 


Divergence Theorem ff F ndS = fff div F dV 


Stretching factor J = et} an ie 
“ Stokes’ Theorem § F -dR= ff (curl F) nds 


Yu Ye 


An additional table of integrals is included just after the index. 


